Gene EcHS_A0005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0005 
SymbolthrC 
ID5591094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3733 
End bp5019 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content53% 
IMG OID640919194 
Productthreonine synthase 
Protein accessionYP_001456789 
Protein GI157159471 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value0.817747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCT ACAATCTGAA AGATCACAAC GAGCAGGTCA GCTTTGCGCA AGCCGTAACC 
CAGGGGTTGG GCAAAAATCA GGGGCTGTTT TTTCCGCATG ACCTGCCGGA ATTCAGCCTG
ACTGAAATTG ATGAGATGCT GAAGCTGGAT TTTGTCACCC GCAGTGCGAA GATCCTCTCG
GCGTTTATTG GTGATGAAAT CCCGCAGGAA ATCCTGGAAG AGCGCGTGCG CGCGGCGTTT
GCCTTCCCGG CTCCGGTCGC CAATGTTGAA AGCGATGTCG GTTGTCTGGA ATTGTTCCAC
GGGCCAACGC TGGCATTTAA AGATTTCGGC GGTCGCTTTA TGGCACAAAT GCTGACCCAT
ATTGCGGGCG ATAAGCCAGT GACCATTCTG ACCGCGACCT CCGGTGATAC CGGAGCGGCA
GTGGCTCATG CTTTCTACGG TTTACCGAAT GTGAAAGTGG TTATCCTCTA TCCACGAGGC
AAAATCAGTC CACTGCAAGA AAAACTGTTC TGTACGTTGG GCGGCAATAT CGAAACTGTT
GCCATCGACG GCGATTTCGA TGCCTGTCAG GCGCTGGTGA AGCAGGCGTT TGATGATGAA
GAACTGAAAG TGGCGCTGGG GTTAAACTCA GCTAACTCGA TTAACATCAG CCGTTTGCTG
GCGCAGATTT GCTACTACTT TGAAGCTGTT GCGCAGCTGC CGCAGGAAGC GCGCAACCAG
CTGGTTGTCT CGGTGCCAAG CGGAAACTTC GGCGATTTGA CGGCGGGTCT GCTGGCGAAG
TCACTCGGTC TGCCGGTGAA ACGTTTTATT GCTGCGACCA ACGTGAACGA TACCGTGCCA
CGTTTCCTGC ACGACGGTCA GTGGTCACCC AAAGCGACTC AGGCGACGTT ATCCAACGCG
ATGGACGTGA GTCAGCCGAA CAACTGGCCG CGTGTGGAAG AGTTGTTCCG CCGCAAAATC
TGGCAACTGA AAGAGCTGGG TTATGCAGCC GTTGATGATG AAACCACGCA ACAGACAATG
CGTGAGTTAA AAGAACTGGG CTACACCTCG GAGCCGCACG CTGCCGTAGC GTATCGTGCG
CTGCGTGACC AGTTGAATCC AGGCGAATAT GGCTTGTTCC TCGGCACCGC GCATCCGGCG
AAATTTAAAG AGAGCGTGGA AGCGATTCTC GGTGAAACGT TGGATCTGCC AAAAGAGCTG
GCAGAACGTG CTGATTTACC CTTGCTTTCA CATAATCTGC CCGCCGATTT TACTGCGTTG
CGTAAATTGA TGATGAATCA TCAGTAA
 
Protein sequence
MKLYNLKDHN EQVSFAQAVT QGLGKNQGLF FPHDLPEFSL TEIDEMLKLD FVTRSAKILS 
AFIGDEIPQE ILEERVRAAF AFPAPVANVE SDVGCLELFH GPTLAFKDFG GRFMAQMLTH
IAGDKPVTIL TATSGDTGAA VAHAFYGLPN VKVVILYPRG KISPLQEKLF CTLGGNIETV
AIDGDFDACQ ALVKQAFDDE ELKVALGLNS ANSINISRLL AQICYYFEAV AQLPQEARNQ
LVVSVPSGNF GDLTAGLLAK SLGLPVKRFI AATNVNDTVP RFLHDGQWSP KATQATLSNA
MDVSQPNNWP RVEELFRRKI WQLKELGYAA VDDETTQQTM RELKELGYTS EPHAAVAYRA
LRDQLNPGEY GLFLGTAHPA KFKESVEAIL GETLDLPKEL AERADLPLLS HNLPADFTAL
RKLMMNHQ