Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4877 |
Symbol | |
ID | 5736954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6212214 |
End bp | 6213245 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641282043 |
Product | UBA/THIF-type NAD/FAD binding protein |
Protein accession | YP_001547635 |
Protein GI | 159901388 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.360367 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAGTA GTGGGATACC ATCAGCACGC TATAGTCGTC AAACGCGGTT TGCTGGGCTA GGCCAAGCCG GGCAGCAGCG CTTAGCCCAA GCACGAGTGG CGATTGTTGG CTTGGGTGCA ACCGGTAGCA CCATCGCCCA TGCCTTGCTA CGAGCGGGCG TAGGCTATTT GCGGCTAATC GACCGCGATT GGGTTGAGGA GCATAATTTG CCGCGCCAAA GTTTGTATAC CGAGGCTGAT GCTGCTCAGT TAGTGCCCAA AGTTGTGGCT GCCAAAGCCC ATGCCCAGCG CATCAACAGT GCTTGCGACA TTGAGGCGTT GGTGCTCGAT TTACATGCTG GCACGATTGA TCAAGCACTA GCTGGGGTCG ATTTAATTAT GGATGGCAGC GATAGCCTCG AAACCCGCTT GCTGATCAAT CAATGGTGTG TGCGTGAAGG CAAGCCATGG ATTTATAGTG GCGTGTTGGG TGGCCATGGC ATGACCGCAA ATTTTCGGCC CAAGCAAGCC TGTTGGCGCT GTGTTTTTAC GACTTCGCCG GAGCCAGGCA GCATGCCAAC GTGTGAAACC GCTGGCGTGA TTGGGCCAGT CGTGGGTGTT ATTGGCAATT TGGCGGCAAC TGAAGCACTT AAATTGCTCA GTGGGCAAGG CCAAGCCAAC CCCGATTTAT ATATGCTCGA TCTGTGGGCT TGGCAATTTG AGCAGCTGCC GCTGCCAACG CCGCGCCCCG ATTGCCCAGT TTGTGGCTTG CGCCAATTCG ATTTGCTGGA GCAAGATAGT GCGCCAACCC TGAGTTTATG TGGCCGCAAC GCCATCCAAA TTCGGCCACA ACAGCCAATC ACCATGGCCT TAGCCCAATT GGCAGCCCAT TTGCAGCAAG CCGATCTGCG AGTGATTCAA ACCGACTATC TGCTACGCTT TGCGGCTGAA ACCTTGCAGG CCACCGTGTT TCCTGATGGC CGCGTGATTA TCAGCGGCAC CGATGATCCA GCGCTGGCAC GCGGATTTTA CAATCGCTGG ATTAACCATT AA
|
Protein sequence | MPSSGIPSAR YSRQTRFAGL GQAGQQRLAQ ARVAIVGLGA TGSTIAHALL RAGVGYLRLI DRDWVEEHNL PRQSLYTEAD AAQLVPKVVA AKAHAQRINS ACDIEALVLD LHAGTIDQAL AGVDLIMDGS DSLETRLLIN QWCVREGKPW IYSGVLGGHG MTANFRPKQA CWRCVFTTSP EPGSMPTCET AGVIGPVVGV IGNLAATEAL KLLSGQGQAN PDLYMLDLWA WQFEQLPLPT PRPDCPVCGL RQFDLLEQDS APTLSLCGRN AIQIRPQQPI TMALAQLAAH LQQADLRVIQ TDYLLRFAAE TLQATVFPDG RVIISGTDDP ALARGFYNRW INH
|
| |