Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1942 |
Symbol | |
ID | 5733831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2353809 |
End bp | 2355551 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279086 |
Product | hypothetical protein |
Protein accession | YP_001544713 |
Protein GI | 159898466 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0873468 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTTG TTATTCAACA ATCGAAACTA AACGTAAAAA CTAGCTCGTT ATTCGTCTTT GGTTTGGCGT TATTGCTCTA TACCATAACC CTCTTACCAG GGCTTGGCGG TGGTGATACC GCCGAATTTC AGCGCGTTGG CCCAACCCTT GAGGTCGCCC ATAGCACAGG CTACCCGCTT TATAGCATCC TAACGTGGCT TTGGAGCAAG CTTATTCCAC TTGGGAGTAT TGCTTGGCGG GTTAATCTCT TTTCGGCGTT CGTTGCCGCC CTCAGTTTAA GCAGCTTCAA CCAACTTGCC CAAACGAGCG GCCTTGCCCA ACGTTGGGCC TTGGCTGGCA CTGCCATGCT TGGGCTATCG TCAAGCTTTT GGCAACAGGC TACTCAAGCC GAAATTTATG CGTTTGCGGT GCTTTTGCAG GTCAGCACGC TTGGGCTGAT CGTCGCTTGG TGGCAACAAC GCATACCGTT TTGGCTGCTT GGGCTAGCGG CAGGCCTGAT GCTTGGGCAT CATCGCAGCA GCGTGTTTAT TCTGCCATGG ATGCTGATTG CCTGTTGTTG GCAGCAACGC CCAAGTGTAC GGCAGTGGCT GTTGGCGGTT GGGGCTGGCT GTTTAAGCTT TCTGCCCTAT TTGTATATCG TTTGGCGTGC CCCAGCTTGG CAAAATGGTT GGGCATTGCT TACTGATTAT CTGCTTGGCA GTGCTGGCGG GGCATGGTTT GATCCTCAAC GAGCGCTCGA TCAAGGCTGG CAACGGCTCT TCGAGGTGCA AATCCAGCTG TTTGGGCCAC AATTAACCTG GCTGGGCTTG GGCTTAGCTG GGATTGGCTG GTGGCAGTGT TGGCAAAAAC AACGGCCACT CGCAATAATG CTTGGTGGTA GCTATTGCAG CATTTTGGGC TTTTGTTTGG TCTATTTCGT TGACGATCTG GCGGTGTTTG GGCTGGGAGC CTATGTTGCC CAAGCCTTGT TGATTGGCTT TGGGTTGGCC GCGTTGCCAT GGCCACGCTG GGGGCTGGGC TTGGCCTGCG GCATCAGTCT GTGGTTGGCT TGGCAGACAT GGCCACAAAT TCACCAATCC AATACCGCCC AACCTGAACA ACTTGCCAGA CAACGCTTGG CCGAACCACT CGCTGCCAAT AGCCTCGTGA TTGGTGATGG TTGGAGCATT GAGAGTTTGC GCTATCTGCA AACCGTTGAG CAGCTACGGC CAGATCTCGA ATTTAGCTTT CAAGCCGAGC AGCAACGGAT TCGCGATCAA TTAGCCCAAG GTCGCCAGAT CTATGCCTTG CAGGCGATCC CTGAATTGGG CTTGCAGCAT CGTAAAGTTG GCGATTGGTG GCAGATCGAG AATATTCCTC AGCAGTTTAA GCAACAAGTC GCTATTGTGT GGCAGAATGG TATTCAATTA ACTGGATTTG ATTTACCGGC TCAAGCTCAA GATCAGCTGT TAATTGGGCT ACGTTGGCAA ACCAGCCAGC AATTACCGAG CTTAATTCGC TTCATTCATG TGCTTGATCA AACTGGCCAG TTGGTTGCGC AAACTGATAG CCCAACCAAT CCTAGCAGCG AATTTTGGCC CCTAGCCCAA GCCCAAACCG ACCTATTGGC GGTCAATTTG CCGCAGTATT TGCCAGCAGG CAACTATAGC GTCATTGTCG GCTGGTACGA TTTGGGTGGG CAGCGGATTG CGCTCACTCC TCAGCAAAAT ACCTATCTGC TTGGCATGGT TATGCTCGAT TGA
|
Protein sequence | MQFVIQQSKL NVKTSSLFVF GLALLLYTIT LLPGLGGGDT AEFQRVGPTL EVAHSTGYPL YSILTWLWSK LIPLGSIAWR VNLFSAFVAA LSLSSFNQLA QTSGLAQRWA LAGTAMLGLS SSFWQQATQA EIYAFAVLLQ VSTLGLIVAW WQQRIPFWLL GLAAGLMLGH HRSSVFILPW MLIACCWQQR PSVRQWLLAV GAGCLSFLPY LYIVWRAPAW QNGWALLTDY LLGSAGGAWF DPQRALDQGW QRLFEVQIQL FGPQLTWLGL GLAGIGWWQC WQKQRPLAIM LGGSYCSILG FCLVYFVDDL AVFGLGAYVA QALLIGFGLA ALPWPRWGLG LACGISLWLA WQTWPQIHQS NTAQPEQLAR QRLAEPLAAN SLVIGDGWSI ESLRYLQTVE QLRPDLEFSF QAEQQRIRDQ LAQGRQIYAL QAIPELGLQH RKVGDWWQIE NIPQQFKQQV AIVWQNGIQL TGFDLPAQAQ DQLLIGLRWQ TSQQLPSLIR FIHVLDQTGQ LVAQTDSPTN PSSEFWPLAQ AQTDLLAVNL PQYLPAGNYS VIVGWYDLGG QRIALTPQQN TYLLGMVMLD
|
| |