Gene Haur_4877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4877 
Symbol 
ID5736954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6212214 
End bp6213245 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content55% 
IMG OID641282043 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_001547635 
Protein GI159901388 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.360367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAGTA GTGGGATACC ATCAGCACGC TATAGTCGTC AAACGCGGTT TGCTGGGCTA 
GGCCAAGCCG GGCAGCAGCG CTTAGCCCAA GCACGAGTGG CGATTGTTGG CTTGGGTGCA
ACCGGTAGCA CCATCGCCCA TGCCTTGCTA CGAGCGGGCG TAGGCTATTT GCGGCTAATC
GACCGCGATT GGGTTGAGGA GCATAATTTG CCGCGCCAAA GTTTGTATAC CGAGGCTGAT
GCTGCTCAGT TAGTGCCCAA AGTTGTGGCT GCCAAAGCCC ATGCCCAGCG CATCAACAGT
GCTTGCGACA TTGAGGCGTT GGTGCTCGAT TTACATGCTG GCACGATTGA TCAAGCACTA
GCTGGGGTCG ATTTAATTAT GGATGGCAGC GATAGCCTCG AAACCCGCTT GCTGATCAAT
CAATGGTGTG TGCGTGAAGG CAAGCCATGG ATTTATAGTG GCGTGTTGGG TGGCCATGGC
ATGACCGCAA ATTTTCGGCC CAAGCAAGCC TGTTGGCGCT GTGTTTTTAC GACTTCGCCG
GAGCCAGGCA GCATGCCAAC GTGTGAAACC GCTGGCGTGA TTGGGCCAGT CGTGGGTGTT
ATTGGCAATT TGGCGGCAAC TGAAGCACTT AAATTGCTCA GTGGGCAAGG CCAAGCCAAC
CCCGATTTAT ATATGCTCGA TCTGTGGGCT TGGCAATTTG AGCAGCTGCC GCTGCCAACG
CCGCGCCCCG ATTGCCCAGT TTGTGGCTTG CGCCAATTCG ATTTGCTGGA GCAAGATAGT
GCGCCAACCC TGAGTTTATG TGGCCGCAAC GCCATCCAAA TTCGGCCACA ACAGCCAATC
ACCATGGCCT TAGCCCAATT GGCAGCCCAT TTGCAGCAAG CCGATCTGCG AGTGATTCAA
ACCGACTATC TGCTACGCTT TGCGGCTGAA ACCTTGCAGG CCACCGTGTT TCCTGATGGC
CGCGTGATTA TCAGCGGCAC CGATGATCCA GCGCTGGCAC GCGGATTTTA CAATCGCTGG
ATTAACCATT AA
 
Protein sequence
MPSSGIPSAR YSRQTRFAGL GQAGQQRLAQ ARVAIVGLGA TGSTIAHALL RAGVGYLRLI 
DRDWVEEHNL PRQSLYTEAD AAQLVPKVVA AKAHAQRINS ACDIEALVLD LHAGTIDQAL
AGVDLIMDGS DSLETRLLIN QWCVREGKPW IYSGVLGGHG MTANFRPKQA CWRCVFTTSP
EPGSMPTCET AGVIGPVVGV IGNLAATEAL KLLSGQGQAN PDLYMLDLWA WQFEQLPLPT
PRPDCPVCGL RQFDLLEQDS APTLSLCGRN AIQIRPQQPI TMALAQLAAH LQQADLRVIQ
TDYLLRFAAE TLQATVFPDG RVIISGTDDP ALARGFYNRW INH