Gene EcSMS35_3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3037 
SymbolgcvT 
ID6144606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3127351 
End bp3128445 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content55% 
IMG OID641617906 
Productglycine cleavage system aminomethyltransferase T 
Protein accessionYP_001745057 
Protein GI170683423 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID[TIGR00528] glycine cleavage system T protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.318867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAAC AGACTCCTTT GTACGAACAA CACACGCTTT GCGGCGCTCG CATGGTGGAT 
TTCCACGGCT GGATGATGCC GCTGCATTAC GGTTCGCAAA TCGACGAACA TCATGCGGTA
CGTACCGATG CCGGAATGTT TGATGTGTCA CATATGACCA TCGTCGATCT TCGCGGCAGC
CGCACCCGGG AGTTTTTGCG TTATCTGCTG GCGAACGATG TGGCGAAGCT CACCAAAAGC
GGTAAAGCCC TTTACTCGGG GATGTTGAAT GCCTCTGGCG GTGTGATAGA TGACCTCATC
GTCTACTACT TTACTGAAGA TTTCTTCCGC CTCGTTGTTA ACTCCGCCAC CCGCGAAAAA
GACCTCTCCT GGATTACCCA ACACGCTGAA CCTTTCGGCA TCGAAATTAC CGTTCGTGAT
GACCTTTCCA TGATCGCCGT ACAAGGGCCG AATGCGCAGG CAAAAGCTGC CACACTGTTT
AATGACGCCC AGCGTCAGGC GGTGGAAGGG ATGAAACCGT TCTTTGGCGT GCAGGCGGGC
GATCTGTTTA TTGCCACCAC TGGTTATACC GGTGAAGCGG GCTATGAAAT TGCGCTGCCC
AATGAAAAAG CGGCCGATTT CTGGCGTGCG CTGGTGGAAG CGGGCGTTAA GCCATGCGGC
CTGGGCGCGC GTGACACGCT GCGTCTGGAA GCGGGTATGA ATCTTTATGG TCAGGAGATG
GACGAAACCA TTTCTCCGTT AGCCGCCAAC ATGGGCTGGA CTATCGCCTG GGAACCGGCA
GATCGTGACT TTATCGGTCG TGAAGCTCTG GAAGCGCAGC GTGAACATGG TACAGAAAAA
CTGGTTGGTC TGGTGATGAC CGAAAAAGGC GTGCTGCGTA ATGAACTGCC GGTACGTTTT
ACCGATGCGC AGGGCAACCA GCATGAAGGC ATTATCACCA GCGGGACTTT CTCCCCGACG
CTGGGTTACA GCATTGCGCT GGCGCGCGTG CCGGAAGGTA TTGGCGAAAC GGCGATTGTG
CAAATTCGCA ACCGTGAAAT GCCGGTTAAA GTGACGAAAC CTGTTTTTGT GCGTAACGGC
AAAGCCGTCG CGTGA
 
Protein sequence
MAQQTPLYEQ HTLCGARMVD FHGWMMPLHY GSQIDEHHAV RTDAGMFDVS HMTIVDLRGS 
RTREFLRYLL ANDVAKLTKS GKALYSGMLN ASGGVIDDLI VYYFTEDFFR LVVNSATREK
DLSWITQHAE PFGIEITVRD DLSMIAVQGP NAQAKAATLF NDAQRQAVEG MKPFFGVQAG
DLFIATTGYT GEAGYEIALP NEKAADFWRA LVEAGVKPCG LGARDTLRLE AGMNLYGQEM
DETISPLAAN MGWTIAWEPA DRDFIGREAL EAQREHGTEK LVGLVMTEKG VLRNELPVRF
TDAQGNQHEG IITSGTFSPT LGYSIALARV PEGIGETAIV QIRNREMPVK VTKPVFVRNG
KAVA