Gene EcSMS35_2634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2634 
Symbol 
ID6147381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2692767 
End bp2694482 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content55% 
IMG OID641617505 
Producthydrogenase-4, G subunit 
Protein accessionYP_001744670 
Protein GI170680711 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit
[COG3262] Ni,Fe-hydrogenase III component G 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGTTA ATTCATCGTC AAATCGTGGC GAAGCGATTC TCGCCGCCCT GAAAACGCAG 
TTCCCCGGCG CGGTGCTGGA TGAAGAGCGA CAAACGCCTG AACAGGTCAC CATTACGGTG
AAAATCAATC TGCTGCCTGA CGTTGTGCAT TATCTTTATT ATCAACATGA TGGCTGGCTT
CCAGTCCTGT TTGGCAACGA CGAGCGGACA CTTAACGGTC ATTACGCGGT TTATTATGCC
CTTTCTATGG AAGGGGCCGA AAAATGCTGG ATCGTGGTGA AGGCACTGGT CGATGCCGAC
AGTCGGGAGT TTCCGTCAGT CACACCGCGC GTCCCTGCCG CGGTCTGGGG CGAGCGAGAA
ATTCGCGATA TGTACGGGCT GATTCCGGTT GGCCTGCCGG ATCAGCGTCG GCTGGTGTTG
CCCGATGACT GGCCGGAAGA TATGCATCCG CTGCGCAAAG ACGCGATGGA TTATCGACTG
CGCCCAGAAC CGACGACTGA TACCGAAACG TATCCGTTTA TCAACGAGGG CAACAGCGAT
GCGCGGGTGA TCCCTGTCGG CCCGCTGCAT ATCACCTCCG ATGAACCAGG TCACTTCCGC
TTGTTTGTGG ATGGCGAGCA AATTGTCGAT GCTGATTACC GCCTGTTTTA TGTCCATCGC
GGCATGGAGA AACTGGCAGA AACGCGGATG GGCTACAACG AAGTGACCTT CTTATCCGAC
CGCGTGTGTG GGATTTGCGG TTTTGCCCAC AGTGTGGCCT ATACCAACTC GGTTGAAAAT
GCACTGGGGA TTGAGGTGCC GCAACGAGCG CATACTATTC GCTCGATTCT GCTGGAAGTC
GAACGGCTGC ATAGTCATTT GCTCAACCTT GGCCTCTCCT GCCATTTTGT TGGTTTTGAT
ACCGGCTTTA TGCAATTTTT CCGCGTGCGG GAAAAGTCGA TGACGATGGC GGAATTGCTG
ACCGGGTCGC GTAAAACCTA CGGTCTGAAT CTGATTGGTG GTGTTCGCCG CGATATTCTC
AAAGAGCAAC GTCTGCAAAC GCTGAAACTG GTGCGCGAGA TGCGCGCCGA CGTGTCGGAG
CTGGTAGAGA TGCTGCTTGC CACGCCGAAT ATGGAACAAC GCACTCAGGG CATTGGCATT
CTCGACCGAC AAATCGCCCG TGATTATAGC CCTGTCGGGC CGCTGATCCG CGGCAGTGGT
TTTGCCCGTG ATTTGCGCTT TGATCACCCC TACGCCGACT ACGGTAATAT TCCCAAAACG
CTGTTTACCT TTACCGGCGG CGATGTCTTC TCCCGCGTGA TGGTCCGTGT CAAAGAGACG
TTTGATTCGC TGGCAATGCT GGAATTTGCC CTCGACAACA TGCCGGATAC CCCACTGCTG
ACCGAAGGCT TTAGCTATAA ACCTCACGCA TTCGCGCTGG GCTTTGTTGA AGCGCCACGC
GGTGAAGACG TGCACTGGAG CATGCTCGGT GATAACCAAA AATTGTTCCG CTGGCGCTGT
CGTGCCGCCA CCTACGCCAA CTGGCCAGTG CTGCGTTACA TGCTGCGCGG CAATACCGTT
TCTGACGCAC CGCTGATTAT CGGTAGCCTT GATCCCTGTT ACTCCTGTAC CGACCGTGTG
ACGCTGGTTG ATGTGCGCAA GCGCCAGTCA AAAACCGTGC CGTATAAAGA GATCGAACGC
TACGGCATTG ATCGTAACCG TTCGCCGCTG AAGTAA
 
Protein sequence
MNVNSSSNRG EAILAALKTQ FPGAVLDEER QTPEQVTITV KINLLPDVVH YLYYQHDGWL 
PVLFGNDERT LNGHYAVYYA LSMEGAEKCW IVVKALVDAD SREFPSVTPR VPAAVWGERE
IRDMYGLIPV GLPDQRRLVL PDDWPEDMHP LRKDAMDYRL RPEPTTDTET YPFINEGNSD
ARVIPVGPLH ITSDEPGHFR LFVDGEQIVD ADYRLFYVHR GMEKLAETRM GYNEVTFLSD
RVCGICGFAH SVAYTNSVEN ALGIEVPQRA HTIRSILLEV ERLHSHLLNL GLSCHFVGFD
TGFMQFFRVR EKSMTMAELL TGSRKTYGLN LIGGVRRDIL KEQRLQTLKL VREMRADVSE
LVEMLLATPN MEQRTQGIGI LDRQIARDYS PVGPLIRGSG FARDLRFDHP YADYGNIPKT
LFTFTGGDVF SRVMVRVKET FDSLAMLEFA LDNMPDTPLL TEGFSYKPHA FALGFVEAPR
GEDVHWSMLG DNQKLFRWRC RAATYANWPV LRYMLRGNTV SDAPLIIGSL DPCYSCTDRV
TLVDVRKRQS KTVPYKEIER YGIDRNRSPL K