Gene EcSMS35_2854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2854 
SymbolhypD 
ID6147136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2928028 
End bp2929149 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content56% 
IMG OID641617723 
Producthydrogenase expression/formation protein HypD 
Protein accessionYP_001744878 
Protein GI170680888 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.422409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTTG TTGATGAATA TCGCGCGCCG GAACAGGTGA TGCAGTTAAT TGAGCATCTG 
CGCGAACGTG CTTCACATCT CTCTTACACC GCCGAACGCC CTCTGCGGAT TATGGAAGTG
TGTGGCGGTC ATACCCACGC CATTTTTAAA TTCGGCCTCG ACCAGTTGCT GCCGGAAAAC
GTTGAGTTTA TCCACGGTCC CGGTTGCCCG GTGTGTGTAC TGCCGATGGG TAGAATCGAC
ACCTGCGTGG AGATTGCCAG CCATCCGGAA GTCATCTTCT GTACCTTTGG CGACGCCATG
CGCGTACCAG GGAAACAGGG TTCGCTGTTA CAGGCAAAGG CACGCGGTGC CGATGTGCGC
ATCGTTTACT CGCCGATGGA TGCGTTGAAA CTGGCGCAGG AGAATCCAAC CCGCAAAGTG
GTGTTCTTCG GCTTGGGTTT TGAAACCACC ATGCCGACCA CCGCCATCAC TCTGCAACAG
GCAAAAGCCC GCGATGTGCA GAATTTTTAC TTCTTCTGCC AGCACATTAC GCTCATCCCA
ACACTGCGCA GTTTGCTGGA ACAGCCGGAT AACGGTATCG ACGCGTTCCT CGCGCCCGGC
CACGTCAGTA TGGTGATCGG CACTGATGCC TATAATTTTA TCGCCAGCGA TTTTCATCGT
CCGCTGGTGG TGGCTGGTTT CGAACCGCTT GATCTACTGC AAGGCGTGGT CATGCTGGTG
GAGCAGAAAA TAGCGGCCCA CAGCAAGGTA GAGAATCAGT ATCGTCGGGT GGTGCCGGAT
GCCGGTAACC TGCTGGCGCA ACAGGCGATT GCCGATGTGT TCTGTGTCAA CGGCGACAGC
GAATGGCGCG GCTTAGGCGT GATTGAATCT TCTGGTGTAC ACCTGACGCC GGATTATCAA
CGATTCGATG CCGAAGCGCA TTTCCGCCCG GCACCGCAGC AGGTCTGCGA TGACCCGCGC
GCACGTTGTG GCGAAGTATT AACGGGCAAA TGTAAGCCGC ATCAATGCCC GCTGTTTGGT
AACACCTGTA ATCCTCAAAC CGCGTTTGGT GCGCTGATGG TTTCCTCCGA AGGAGCGTGC
GCCGCGTGGT ATCAGTATCG TCAGCAGGAG AGTGAAGCGT GA
 
Protein sequence
MRFVDEYRAP EQVMQLIEHL RERASHLSYT AERPLRIMEV CGGHTHAIFK FGLDQLLPEN 
VEFIHGPGCP VCVLPMGRID TCVEIASHPE VIFCTFGDAM RVPGKQGSLL QAKARGADVR
IVYSPMDALK LAQENPTRKV VFFGLGFETT MPTTAITLQQ AKARDVQNFY FFCQHITLIP
TLRSLLEQPD NGIDAFLAPG HVSMVIGTDA YNFIASDFHR PLVVAGFEPL DLLQGVVMLV
EQKIAAHSKV ENQYRRVVPD AGNLLAQQAI ADVFCVNGDS EWRGLGVIES SGVHLTPDYQ
RFDAEAHFRP APQQVCDDPR ARCGEVLTGK CKPHQCPLFG NTCNPQTAFG ALMVSSEGAC
AAWYQYRQQE SEA