Gene Haur_3082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3082 
Symbol 
ID5734954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3890993 
End bp3892261 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content52% 
IMG OID641280226 
ProductNADH dehydrogenase I, D subunit 
Protein accessionYP_001545848 
Protein GI159899601 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGATGC ACGCACATGC GTATCCCGAT ATTGAAATTC CAACACCTTC GCAAATTGCG 
GCGGTGGCAT TAAATAACGA GCCTGATGAA AACGGCGAGC AATCCATGGT GCTCAGCATG
GGGCCACAAC ACCCTTCAAC GCACGGGGTG CTACGGCTAG CCGTCGAACT CAATGGCGAA
AACGTGGTGC AACTGGCTCC CGACGTGGGC TTTTTGCACA CCGGCATCGA AAAAACCATC
GAAACCAAAA CCTACACCAA ATCAATTGTG CTGACCGACC GCATGGATTA CCTTGCGCCG
ATGTCGAACA ATATGGTGTA TGTGGCGGCG ATTGAAAAAT TGATGCAGCA AGATGTGCCA
GAACGCGCCC AAACCTTGCG GGTGTTGTTG CTGGAACTGA CGCGGATCAA CTCGCACTTG
GTTTGGTTGG GCACGCACGC GCTCGACTTG GCCGCGATGA GTGTGTTTTT TTATGCAATG
CGTGAACGTG AGTATATTCT CGATATTTTC GAGATGCTGA CTGGCGCACG TATGATGACC
AGCTATTTCC GCGTTGGTGG TTTGGCATGG GATATTCCAG CCGACTTTAT TCCAACTGTG
GAAGATTTCT TGACCCACTT TTTGCCCAAG GTCGATGAAT ATGAAGATTT GTTGACCAAT
AACTTGTTGT GGAAACAACG CACCAAGGGC GTAGGCGTGG TCAATGCTGC TGATGCAATT
GCCCTCGGCT TGTCGGGCGC GAACCTACGG GCCAGCGGTG TCGATTGGGA TTTGCGCAAA
ACCATGCCCT ACAGCGGCTA CGAAACCTAC CAATTCGATG TACCAGTCGG CCAAGCTGGC
GATATTTACG ATCGCTATCG CTGTCGGGTG CAAGAAATGC GCGAAAGCGT CAAGATTGCC
CGTCAAGCGA TCGAACGCGT CAAGCAAATG CATGGCCAGC CCTATGTGAC CGAAAATCGT
AAGGTTGCGC CACCACCCAA GAGCGAAATT ACCTACAGCA TGGAATCGCT CATTCACCAC
TTTAAGCTGT GGACGGAAGG CTTCCGCCCA CCGCGTGGTT CGGCCTATGC GGCAATCGAA
TCACCACGAG GTGAAATTGG CTGTTATGTG GTGAGCGATG GCACTCCCAA ACCATGGCGT
GTCCACTTCC GCGCACCGTC ATTTATTAAT TTGCAAGCCT TGCCCCACAT CGCCAAAGGC
AAATTGATGG CCGACTTGGT GGCATTGATT GCGAGCATCG ATCCGGTGCT TGGTGAAGTG
GATCGTTAA
 
Protein sequence
MVMHAHAYPD IEIPTPSQIA AVALNNEPDE NGEQSMVLSM GPQHPSTHGV LRLAVELNGE 
NVVQLAPDVG FLHTGIEKTI ETKTYTKSIV LTDRMDYLAP MSNNMVYVAA IEKLMQQDVP
ERAQTLRVLL LELTRINSHL VWLGTHALDL AAMSVFFYAM REREYILDIF EMLTGARMMT
SYFRVGGLAW DIPADFIPTV EDFLTHFLPK VDEYEDLLTN NLLWKQRTKG VGVVNAADAI
ALGLSGANLR ASGVDWDLRK TMPYSGYETY QFDVPVGQAG DIYDRYRCRV QEMRESVKIA
RQAIERVKQM HGQPYVTENR KVAPPPKSEI TYSMESLIHH FKLWTEGFRP PRGSAYAAIE
SPRGEIGCYV VSDGTPKPWR VHFRAPSFIN LQALPHIAKG KLMADLVALI ASIDPVLGEV
DR