Gene Haur_3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3049 
Symbol 
ID5734921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3850970 
End bp3852388 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content52% 
IMG OID641280193 
Productpyridine nucleotide-disulphide oxidoreductase dimerisation region 
Protein accessionYP_001545815 
Protein GI159899568 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0397378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGATT TACTGGTAAT TGGTGGCGGC TCGGCGGGGA TTACCTTTGC AAAATTTGGC 
GCTTCGTTGG GGGCAAAAAT TACCGTAATC GAGGCCAACA AGCTCGGTGG CGATTGTACT
TGGACGGGCT GTGTGCCAAG TAAAAGCTTA ATTCACGCTG CCAAAATTGC TCATACCACG
GCAACTGCTG CCCGTTATGG GATCAGCGCC CAGCCAAGCA TCGATTTTGC CGCAGTGATG
GGCTATGTGC ATTCTGTGCA GCAGCAAATT TATCAACACG ATGATGCGCC TGAGGTGCTG
CGTCAAGCGG GTGCACGGGT AATCGAAGGG CGTGCGCGTT TTTATGATGA TCAAACGGTG
GAAGTTAATG GCGAGTTACT ACGAGCCAAG CACTTTTGCA TCGCGACTGG CTCGCATCCC
AAAATTCCGA CGATCCCTGG TTTGGCCGAG GCGGGCTACC TCACCAACGA AGATGTTTTT
TTACTAGAAG CATTGCCCAA GCGAATTGTT GTGCTTGGTG GTGGGCCGAT TGGCTGTGAG
CTAGGCCAAG CACTATTTCG CTTGGGAGCT GAAGTGACGA TTATTCAACA AGGCCCACGC
CTGTTGCCCA AAGATGACCA TGCCATGGGT GCGGCTTTGG CTCAAGCACT TAAATCCGAG
GGCTTACAGC TTTACCTCAA CACCAAAACC CTCAAGGTCG AGTTGCAGGC TGGTGCAAAA
CAGCTCACGA TTCAAACTGC CAATAACCAA CCCCAAACCA TCGTTGCTGA TGCAATTTTA
GTGGCAGCGG GTCGTACCCC GAATCTACAT AATTTGGGTT TGGATGCAGC AGGCATTTTG
TATGACCCTG AACAGCGCAT TCATGTTGAC CACTATTTGC GCACCAGTAA TCCACGAGTG
TTTGCCTGTG GCGATGTGAT TGGGCGCTAT CAATTTACCC ATGTGGCGGC ACAAGAAGCA
GGCTTGGTGC TACGCAATGC ACTTTTTCCA GGCCAAAGCG CCATGAAATA CGAATTAGTG
CCGTGGGCAA CCTTTACCGA CCCCGAAGTT GGCCATGTTG GCTTGAATGA AGACCAAGCG
CGGGCCAAGT ATGGCAGCAG CTTGCGGGTG TATGAATTGC CGTGGAGCGC CAACGACCGT
GCTCGCACCG AGGATGCAAC TCAGGGCTTT ACCAAAATTT TGGCGGTTGG CCGCAAAGAA
CAAATTGTCG GCGTGCATAT TATCGGCCAA GGCGCTGGCG ATATGATCAA TGCCGCAGTG
CTAGCGATGG GCACGGGTGT CAGTGCCTCA AAACTCGGTG GCTTAATTAA TGTTTATCCA
ACCCGCTCGC AAGGTCTCAA AATGACCGCC CAACGCTCGT TTACCCGCTG GCTCGAAAAA
CCATGGCTGC AACGAGCGTT ACGCTGGTAT TTTCGCTAA
 
Protein sequence
MDDLLVIGGG SAGITFAKFG ASLGAKITVI EANKLGGDCT WTGCVPSKSL IHAAKIAHTT 
ATAARYGISA QPSIDFAAVM GYVHSVQQQI YQHDDAPEVL RQAGARVIEG RARFYDDQTV
EVNGELLRAK HFCIATGSHP KIPTIPGLAE AGYLTNEDVF LLEALPKRIV VLGGGPIGCE
LGQALFRLGA EVTIIQQGPR LLPKDDHAMG AALAQALKSE GLQLYLNTKT LKVELQAGAK
QLTIQTANNQ PQTIVADAIL VAAGRTPNLH NLGLDAAGIL YDPEQRIHVD HYLRTSNPRV
FACGDVIGRY QFTHVAAQEA GLVLRNALFP GQSAMKYELV PWATFTDPEV GHVGLNEDQA
RAKYGSSLRV YELPWSANDR ARTEDATQGF TKILAVGRKE QIVGVHIIGQ GAGDMINAAV
LAMGTGVSAS KLGGLINVYP TRSQGLKMTA QRSFTRWLEK PWLQRALRWY FR