Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3049 |
Symbol | |
ID | 5734921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3850970 |
End bp | 3852388 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280193 |
Product | pyridine nucleotide-disulphide oxidoreductase dimerisation region |
Protein accession | YP_001545815 |
Protein GI | 159899568 |
COG category | [C] Energy production and conversion |
COG ID | [COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0397378 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGATT TACTGGTAAT TGGTGGCGGC TCGGCGGGGA TTACCTTTGC AAAATTTGGC GCTTCGTTGG GGGCAAAAAT TACCGTAATC GAGGCCAACA AGCTCGGTGG CGATTGTACT TGGACGGGCT GTGTGCCAAG TAAAAGCTTA ATTCACGCTG CCAAAATTGC TCATACCACG GCAACTGCTG CCCGTTATGG GATCAGCGCC CAGCCAAGCA TCGATTTTGC CGCAGTGATG GGCTATGTGC ATTCTGTGCA GCAGCAAATT TATCAACACG ATGATGCGCC TGAGGTGCTG CGTCAAGCGG GTGCACGGGT AATCGAAGGG CGTGCGCGTT TTTATGATGA TCAAACGGTG GAAGTTAATG GCGAGTTACT ACGAGCCAAG CACTTTTGCA TCGCGACTGG CTCGCATCCC AAAATTCCGA CGATCCCTGG TTTGGCCGAG GCGGGCTACC TCACCAACGA AGATGTTTTT TTACTAGAAG CATTGCCCAA GCGAATTGTT GTGCTTGGTG GTGGGCCGAT TGGCTGTGAG CTAGGCCAAG CACTATTTCG CTTGGGAGCT GAAGTGACGA TTATTCAACA AGGCCCACGC CTGTTGCCCA AAGATGACCA TGCCATGGGT GCGGCTTTGG CTCAAGCACT TAAATCCGAG GGCTTACAGC TTTACCTCAA CACCAAAACC CTCAAGGTCG AGTTGCAGGC TGGTGCAAAA CAGCTCACGA TTCAAACTGC CAATAACCAA CCCCAAACCA TCGTTGCTGA TGCAATTTTA GTGGCAGCGG GTCGTACCCC GAATCTACAT AATTTGGGTT TGGATGCAGC AGGCATTTTG TATGACCCTG AACAGCGCAT TCATGTTGAC CACTATTTGC GCACCAGTAA TCCACGAGTG TTTGCCTGTG GCGATGTGAT TGGGCGCTAT CAATTTACCC ATGTGGCGGC ACAAGAAGCA GGCTTGGTGC TACGCAATGC ACTTTTTCCA GGCCAAAGCG CCATGAAATA CGAATTAGTG CCGTGGGCAA CCTTTACCGA CCCCGAAGTT GGCCATGTTG GCTTGAATGA AGACCAAGCG CGGGCCAAGT ATGGCAGCAG CTTGCGGGTG TATGAATTGC CGTGGAGCGC CAACGACCGT GCTCGCACCG AGGATGCAAC TCAGGGCTTT ACCAAAATTT TGGCGGTTGG CCGCAAAGAA CAAATTGTCG GCGTGCATAT TATCGGCCAA GGCGCTGGCG ATATGATCAA TGCCGCAGTG CTAGCGATGG GCACGGGTGT CAGTGCCTCA AAACTCGGTG GCTTAATTAA TGTTTATCCA ACCCGCTCGC AAGGTCTCAA AATGACCGCC CAACGCTCGT TTACCCGCTG GCTCGAAAAA CCATGGCTGC AACGAGCGTT ACGCTGGTAT TTTCGCTAA
|
Protein sequence | MDDLLVIGGG SAGITFAKFG ASLGAKITVI EANKLGGDCT WTGCVPSKSL IHAAKIAHTT ATAARYGISA QPSIDFAAVM GYVHSVQQQI YQHDDAPEVL RQAGARVIEG RARFYDDQTV EVNGELLRAK HFCIATGSHP KIPTIPGLAE AGYLTNEDVF LLEALPKRIV VLGGGPIGCE LGQALFRLGA EVTIIQQGPR LLPKDDHAMG AALAQALKSE GLQLYLNTKT LKVELQAGAK QLTIQTANNQ PQTIVADAIL VAAGRTPNLH NLGLDAAGIL YDPEQRIHVD HYLRTSNPRV FACGDVIGRY QFTHVAAQEA GLVLRNALFP GQSAMKYELV PWATFTDPEV GHVGLNEDQA RAKYGSSLRV YELPWSANDR ARTEDATQGF TKILAVGRKE QIVGVHIIGQ GAGDMINAAV LAMGTGVSAS KLGGLINVYP TRSQGLKMTA QRSFTRWLEK PWLQRALRWY FR
|
| |