Gene EcSMS35_2631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2631 
SymbolhyfD 
ID6147422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2689091 
End bp2690530 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content55% 
IMG OID641617502 
Producthydrogenase 4 subunit D 
Protein accessionYP_001744667 
Protein GI170682661 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.960654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAATC TTGCTCTGAC GACGTTATTG CTGCCTTTTA TCGGCGCACT GGTCGTTTCG 
TTTTCGCCAC AACGTCGGGC CGCCGAATGG GGGATTTTGT TCGCCGCGCT GACCACGCTG
TGCATGTTGT CGCTGATCTC CGCGTTTTAT CAGGCCGATA AAGTTGCCGT CACGTTGACG
TTGGTCAACG TGGGGGATGT GGCGTTGTTT GGCCTGGTCA TTGATCGCGT GAGTACGCTG
ATTCTGTTTG TGGTGGTGTT CCTCGGTTTG CTGGTCACGA TCTACTCCAC GGGTTATCTG
ACGGATAAAA ATCGCGAACA CCCGCATAAC GGCACGAATC GTTATTACGC ATTTTTGCTG
GTGTTTATCG GCGCGATGGC GGGACTGGTA CTCTCCTCAA CGCTGCTCGG TCAGTTGTTG
TTTTTTGAAA TTACGGGCGG CTGCTCCTGG GCGTTGATCA GTTATTACCA GAGCGATAAA
GCGCAGCGTT CAGCACTAAA AGCGTTACTT ATCACTCATA TCGGTTCGCT GGGGTTGTAT
CTTGCCGCCG CCACGCTGTT TTTGCAGACC GGAACGTTTG CGCTTAGCGC GATGAGCGAG
TTACACGGCG ACGCACGTTA TCTGGTTTAT GGCGGCATTC TGTTTGCCGC GTGGGGGAAA
TCGGCCCAGC TACCGATGCA AGCGTGGCTA CCGGATGCAA TGGAAGCGCC AACACCGATC
AGCGCCTATC TCCACGCCGC ATCGATGGTG AAAGTGGGCG TTTACATTTT TGCCCGTGCC
ATTATCGACG GCGGCAATAT CCCGCATGTG ATTGGCGGCG TTGGCATGGT TATGGCACTG
GTCACCATTC TTTACGGCTT CCTGATGTAT TTGCCACAGC AGGATATGAA GCGGTTGCTG
GCCTGGTCGA CCATCACTCA ACTTGGCTGG ATGTTTTTCG GCTTGTCGCT CTCCATCTTC
GGCTCGCGGC TGTCGCTGGA GGGCAGCATC GCCTACATCG TCAACCACGC GTTCGCTAAA
AGCCTGTTTT TCCTTGTAGC AGGTGCGCTG AGTTACAGCT GCGGCACGCG CTTGTTGCCG
CGTCTGCGTG GCGTATTGCA CACCCTGCCG TTGCCAGGCG TGGGTTTCTG CGTAGCCGCG
CTGGCGATTA CTGGCGTACC GCCGTTCAAC GGCTTCTTCA GTAAATTCCC GCTGTTTGCT
GCCGGTTTTT CGTTGTCAGT GGAGTACTGG ATCCTGCTGC CCGCCATGAT TCTGCTGATG
ATTGAATCGG TCGCCAGTTT CGCCTGGTTT ATTCGCTGGT TTGGTCGCGT CGTGCCTGGC
AAACCGAGCG AGGCCGTCGC CGATGCCGCA CCGCTGCCAG GATCAATGCG CCTGGTGTTG
ATTGTACTGA TTGTGATGTC GCTGATTTCC AGCGTAATCG CCGCGACCTG GTTGCAGTAA
 
Protein sequence
MENLALTTLL LPFIGALVVS FSPQRRAAEW GILFAALTTL CMLSLISAFY QADKVAVTLT 
LVNVGDVALF GLVIDRVSTL ILFVVVFLGL LVTIYSTGYL TDKNREHPHN GTNRYYAFLL
VFIGAMAGLV LSSTLLGQLL FFEITGGCSW ALISYYQSDK AQRSALKALL ITHIGSLGLY
LAAATLFLQT GTFALSAMSE LHGDARYLVY GGILFAAWGK SAQLPMQAWL PDAMEAPTPI
SAYLHAASMV KVGVYIFARA IIDGGNIPHV IGGVGMVMAL VTILYGFLMY LPQQDMKRLL
AWSTITQLGW MFFGLSLSIF GSRLSLEGSI AYIVNHAFAK SLFFLVAGAL SYSCGTRLLP
RLRGVLHTLP LPGVGFCVAA LAITGVPPFN GFFSKFPLFA AGFSLSVEYW ILLPAMILLM
IESVASFAWF IRWFGRVVPG KPSEAVADAA PLPGSMRLVL IVLIVMSLIS SVIAATWLQ