Gene EcolC_0983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0983 
Symbol 
ID6067799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1070805 
End bp1071926 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content55% 
IMG OID641600391 
Producthydrogenase expression/formation protein HypD 
Protein accessionYP_001723979 
Protein GI170019025 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.379099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTTG TTGATGAATA TCGCGCGCCG GAACAGGTGA TGCAGTTAAT TGAGCATCTG 
CGCGAACGTG CTTCACATCT CTCTTACACC GCCGAACGCC CTCTGCGGAT TATGGAAGTG
TGTGGCGGTC ATACCCACGC TATCTTTAAA TTCGGCCTCG ACCAGTTACT GCCGGAAAAC
GTTGAGTTTA TCCACGGTCC GGGGTGCCCG GTGTGCGTAC TGCCGATGGG TAGAATCGAC
ACCTGCGTGG AGATTGCCAG CCATCCGGAA GTCATCTTCT GTACCTTTGG CGACGCCATG
CGCGTGCCGG GGAAACAGGG ATCGCTGTTG CAGGCAAAAG CACGCGGTGC CGATGTGCGC
ATCGTTTACT CGCCGATGGA TGCGTTGAAA CTGGCGCAGG AGAATCCAAC CCGCAAAGTG
GTGTTCTTCG GCTTAGGTTT TGAAACCACT ATGCCGACCA CCGCTATCAC TCTGCAACAG
GCGAAAGCGC GTGATGTGCA GAATTTTTAC TTCTTCTGCC AGCACATTAC GCTTATCCCG
ACGTTGCGCA GTTTGCTGGA ACAGCCGGAT AACGGTATCG ATGCGTTCCT CGCGCCGGGT
CACGTCAGTA TGGTTATCGG CACCGACGCC TATAATTTTA TCGCCAGCGA TTTTCATCGT
CCGCTGGTGG TTGCTGGATT CGAACCCCTT GATCTACTAC AAGGCGTGGT CATGCTGGTG
CAGCAGAAAA TAGCGGCCCA CAGCAAGGTA GAGAATCAGT ATCGTCGAGT GGTACCGGAT
GCCGGTAACC TGCTGGCGCA ACAGGCGATT GCCGATGTGT TCTGTGTCAA CGGCGACAGC
GAATGGCGCG GCTTAGGCGT GATTGAATCT TCTGGCGTGC ACCTGACGCC GGATTATCAA
CGATTCGATG CCGAAGCACA TTTCCGCCCG GCACCGCAGC AGGTCTGCGA TGACCCGCGC
GCGCGTTGTG GTGAGGTATT AACGGGCAAA TGTAAGCCGC ATCAATGCCC GCTGTTTGGT
AACACCTGTA ATCCTCAAAC CGCGTTTGGT GCGCTGATGG TTTCCTCCGA AGGAGCGTGC
GCCGCGTGGT ATCAGTATCG TCAGCAGGAG AGTGAAGCGT GA
 
Protein sequence
MRFVDEYRAP EQVMQLIEHL RERASHLSYT AERPLRIMEV CGGHTHAIFK FGLDQLLPEN 
VEFIHGPGCP VCVLPMGRID TCVEIASHPE VIFCTFGDAM RVPGKQGSLL QAKARGADVR
IVYSPMDALK LAQENPTRKV VFFGLGFETT MPTTAITLQQ AKARDVQNFY FFCQHITLIP
TLRSLLEQPD NGIDAFLAPG HVSMVIGTDA YNFIASDFHR PLVVAGFEPL DLLQGVVMLV
QQKIAAHSKV ENQYRRVVPD AGNLLAQQAI ADVFCVNGDS EWRGLGVIES SGVHLTPDYQ
RFDAEAHFRP APQQVCDDPR ARCGEVLTGK CKPHQCPLFG NTCNPQTAFG ALMVSSEGAC
AAWYQYRQQE SEA