Gene EcSMS35_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1989 
Symbolicd 
ID6146030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2010137 
End bp2011387 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content50% 
IMG OID641616865 
Productisocitrate dehydrogenase 
Protein accessionYP_001744041 
Protein GI170679954 
COG category[C] Energy production and conversion 
COG ID[COG0538] Isocitrate dehydrogenases 
TIGRFAM ID[TIGR00183] isocitrate dehydrogenase, NADP-dependent, prokaryotic type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.903484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGTA AAGTAGTTGT TCCGGCACAA GGCAAGAAGA TCACCCTGCA AAACGGCAAA 
CTCAACGTTC CTGAAAATCC GATTATCCCT TACATTGAAG GTGATGGAAT CGGTGTAGAT
GTAACCCCAG CCATGCTGAA AGTGGTCGAC GCTGCAGTCG AGAAAGCCTA TAAAGGCGAG
CGTAAAATCT CCTGGATGGA AATTTACACC GGTGAAAAAT CCACACAGGT TTATGGTCAG
GATGTCTGGC TACCTGCTGA AACCCTTGAT CTGATTCGTG AATATCGCGT TGCCATCAAA
GGCCCGCTGA CCACTCCGGT TGGTGGCGGT ATTCGCTCTC TCAACGTTGC TCTGCGCCAG
GAACTGGATC TCTACATCTG CCTGCGTCCG GTACGTTACT ATCAGGGCAC TCCAAGCCCG
GTTAAACACC CTGAACTGAC CGATATGGTT ATCTTCCGTG AAAACTCGGA AGACATTTAT
GCGGGTATCG AATGGAAAGC AGACTCTGCC GACGCTGAGA AAGTGATTAA ATTCCTGCGT
GAAGAGATGG GCGTGAAGAA AATTCGCTTC CCGGAACATT GCGGTATCGG TATTAAGCCG
TGTTCTGAAG AAGGCACCAA ACGTCTGGTT CGTGCAGCGA TCGAATACGC AATTGCTAAC
GATCGTGACT CTGTGACCCT GGTGCACAAA GGCAACATCA TGAAGTTCAC CGAAGGCGCG
TTTAAAGACT GGGGCTACCA GCTGGCGCGT GAAGAGTTTG GCGGTGAACT GATCGACGGC
GGCCCGTGGC TGAAAGTTAA AAACCCGAAC ACCGGCAAAG AGATCGTCAT TAAAGACGTG
ATTGCTGATG CATTCCTGCA ACAGATCCTG CTGCGTCCGG CTGAATATGA TGTTATCGCC
TGTATGAACC TGAACGGTGA CTACATTTCT GACGCCCTGG CAGCGCAGGT TGGCGGTATC
GGTATCGCCC CTGGCGCAAA CATCGGTGAC GAATGCGCCC TGTTTGAAGC CACCCACGGT
ACTGCGCCGA AATATGCCGG TCAGGACAAA GTAAACCCAG GCTCTATTAT TCTCTCCGCT
GAGATGATGT TACGCCATAT GGGTTGGACT GAAGCCGCAG ACCTGATTGT TAAAGGTATG
GAAGGCGCGA TCAACGCGAA GACTGTAACC TATGACTTCG AACGTCTGAT GGAAGGCGCT
AAACTGCTGA AATGTTCAGA GTTTGGTGAT GCGATCATCA AGAACATGTA A
 
Protein sequence
MESKVVVPAQ GKKITLQNGK LNVPENPIIP YIEGDGIGVD VTPAMLKVVD AAVEKAYKGE 
RKISWMEIYT GEKSTQVYGQ DVWLPAETLD LIREYRVAIK GPLTTPVGGG IRSLNVALRQ
ELDLYICLRP VRYYQGTPSP VKHPELTDMV IFRENSEDIY AGIEWKADSA DAEKVIKFLR
EEMGVKKIRF PEHCGIGIKP CSEEGTKRLV RAAIEYAIAN DRDSVTLVHK GNIMKFTEGA
FKDWGYQLAR EEFGGELIDG GPWLKVKNPN TGKEIVIKDV IADAFLQQIL LRPAEYDVIA
CMNLNGDYIS DALAAQVGGI GIAPGANIGD ECALFEATHG TAPKYAGQDK VNPGSIILSA
EMMLRHMGWT EAADLIVKGM EGAINAKTVT YDFERLMEGA KLLKCSEFGD AIIKNM