Gene EcolC_3651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3651 
Symbol 
ID6065834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3997281 
End bp3998567 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content53% 
IMG OID641603066 
Productthreonine synthase 
Protein accessionYP_001726589 
Protein GI170021635 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCT ACAATCTGAA AGATCACAAT GAGCAGGTCA GCTTTGCGCA AGCCGTAACC 
CAGGGGTTGG GCAAAAATCA GGGGCTGTTT TTTCCGCACG ACCTGCCGGA ATTCAGCCTG
ACTGAAATTG ATGAGATGCT GAAGCTGGAT TTTGTCACCC GCAGTGCGAA GATCCTCTCG
GCGTTTATTG GTGATGAAAT CCCGCAGGAA ATCCTGGAAG AGCGCGTGCG CGCGGCGTTT
GCCTTCCCGG CTCCGGTCGC CAATGTTGAA AGCGATGTCG GTTGTCTGGA ATTGTTCCAC
GGGCCAACGC TGGCATTTAA AGATTTCGGC GGTCGCTTTA TGGCACAAAT GCTGACCCAT
ATTGCGGGCG ATAAGCCAGT GACCATTCTG ACCGCGACCT CCGGTGATAC CGGAGCGGCA
GTGGCTCATG CTTTCTACGG TTTACCGAAT GTGAAAGTGG TTATCCTCTA TCCACGAGGC
AAAATCAGTC CACTGCAAGA AAAACTGTTC TGTACATTGG GCGGCAATAT CGAAACTGTT
GCCATCGACG GCGATTTCGA TGCCTGTCAG GCGCTGGTGA AGCAGGCGTT TGATGATGAA
GAGCTGAAAG TGGCGCTGGG GTTAAACTCA GCTAACTCGA TTAACATCAG CCGTTTGCTG
GCGCAGATTT GCTACTACTT TGAAGCAGTT GCGCAACTGC CGCAGGAAGC GCGCAACCAG
CTGGTTGTCT CGGTGCCAAG CGGAAACTTC GGCGATTTGA CGGCGGGTCT GCTGGCGAAG
TCACTCGGTC TGCCGGTGAA ACGTTTTATT GCTGCGACCA ACGTGAACGA TACCGTGCCA
CGTTTCCTGC ACGACGGTCA GTGGTCACCC AAAGCGACTC AGGCGACGTT ATCCAACGCG
ATGGACGTGA GTCAGCCGAA CAACTGGCCG CGTGTGGAAG AGTTGTTCCG CCGCAAAATC
TGGCAACTGA AAGAGCTGGG TTATGCAGCC GTGGATGATG AAACCACGCA ACAGACAATG
CGTGAGTTAA AAGAACTGGG CTACACCTCG GAGCCGCACG CTGCCGTAGC GTATCGTGCG
CTGCGTGACC AGTTGAATCC AGGCGAATAT GGCTTGTTCC TCGGCACCGC GCATCCGGCG
AAATTTAAAG AGAGCGTGGA AGCGATTCTC GGTGAAACGT TGGATCTGCC AAAAGAGCTG
GCAGAACGTG CTGATTTACC CTTGCTTTCA CATAATCTGC CCGCCGATTT TGCTGCGTTG
CGTAAATTGA TGATGAATCA TCAGTAA
 
Protein sequence
MKLYNLKDHN EQVSFAQAVT QGLGKNQGLF FPHDLPEFSL TEIDEMLKLD FVTRSAKILS 
AFIGDEIPQE ILEERVRAAF AFPAPVANVE SDVGCLELFH GPTLAFKDFG GRFMAQMLTH
IAGDKPVTIL TATSGDTGAA VAHAFYGLPN VKVVILYPRG KISPLQEKLF CTLGGNIETV
AIDGDFDACQ ALVKQAFDDE ELKVALGLNS ANSINISRLL AQICYYFEAV AQLPQEARNQ
LVVSVPSGNF GDLTAGLLAK SLGLPVKRFI AATNVNDTVP RFLHDGQWSP KATQATLSNA
MDVSQPNNWP RVEELFRRKI WQLKELGYAA VDDETTQQTM RELKELGYTS EPHAAVAYRA
LRDQLNPGEY GLFLGTAHPA KFKESVEAIL GETLDLPKEL AERADLPLLS HNLPADFAAL
RKLMMNHQ