Gene EcolC_2050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2050 
Symbol 
ID6067750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2262577 
End bp2263596 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content45% 
IMG OID641601462 
Productputative dehydrogenase 
Protein accessionYP_001725021 
Protein GI170020067 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0249357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGCA TATTAATTGA AAAACCGAAT CAACTGGCGA TTGTCGAACG TGAAATACCC 
ACCCCGTCAG CGGGTGAAGT ACGAGTAAAA GTGAAACTTG CCGGAATTTG TGGTTCAGAT
AGCCATATTT ATCGTGGGCA TAATCCTTTT GCGAAATATC CGTGCGTCAT TGGTCATGAA
TTCTTTGGCG TCATTGATGC AGTGGGTGAA GGCGTGGAAA GCGCCAGAGT CGGTGAACGT
GTTGCTGTCG ATCCGGTGGT CAGCTGTGGG CATTGCTATC CGTGCTCTAT AGGTAAACCG
AACGTTTGTA CGACACTGGC TGTATTAGGT GTGCACGCTG ACGGTGGTTT CAGTGAATAT
GCCGTGGTTC CGGCAAAAAA TGCGTGGAAA ATTCCTGAAG CAGTGGCCGA TCAATATGCG
GTAATGATCG AACCTTTTAC CATTGCGGCT AACGTAACCG GACATGGTCA ACCGACTGAA
AATGATACCG TTCTGGTTTA TGGTGCCGGT CCAATCGGCC TGACGATCGT TCAGGTATTA
AAAGGCGTCT ATAACGTTAA AAATGTGATT GTTGCCGATC GCATTGATGA ACGACTGGAA
AAAGCGAAAG AGAGCGGGGC TGACTGGGCG ATTAATAACA GCCAGACACC GCTTGGCGAG
ATTTTCACTG AAAAAGGCAT CAAGCCGACA TTAATTATCG ATGCGGCTTG TCATCCTTCT
ATCCTGAAAG AGGCCGTAAC GCTGGCTTCT CCAGCGGCAC GTATTGTATT GATGGGGTTC
TCCAGTGAAC CGTCTGAAGT GATTCAGCAA GGAATTACCG GAAAAGAACT CTCTATTTTC
TCTTCACGCT TAAATGCAAA TAAATTCCCG ATCGTTATCG ACTGGTTAAG TAAAGGGTTA
ATTAAACCAG AAAAATTAAT TACCCATACG TTTGATTTCC AGCATGTTGC TGATGCCATT
AGTTTATTTG AACAGGATCA AAAACATTGC TGCAAAGTCT TACTCACTTT TTCTGAATAA
 
Protein sequence
MKSILIEKPN QLAIVEREIP TPSAGEVRVK VKLAGICGSD SHIYRGHNPF AKYPCVIGHE 
FFGVIDAVGE GVESARVGER VAVDPVVSCG HCYPCSIGKP NVCTTLAVLG VHADGGFSEY
AVVPAKNAWK IPEAVADQYA VMIEPFTIAA NVTGHGQPTE NDTVLVYGAG PIGLTIVQVL
KGVYNVKNVI VADRIDERLE KAKESGADWA INNSQTPLGE IFTEKGIKPT LIIDAACHPS
ILKEAVTLAS PAARIVLMGF SSEPSEVIQQ GITGKELSIF SSRLNANKFP IVIDWLSKGL
IKPEKLITHT FDFQHVADAI SLFEQDQKHC CKVLLTFSE