Gene ECH74115_1287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1287 
Symbol 
ID6971925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1299777 
End bp1300937 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content49% 
IMG OID643385275 
Productputative aminomethyltransferase 
Protein accessionYP_002269770 
Protein GI209395879 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.159154 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCTC TTGCTGAATT TCACTTGAAA AATAATGCGG TAATGGGGGT ATATAACAAC 
CGTACCCTAC CTTCTTCTTA TCACGATGCT ATGACCGAAT ATAAAGCCGT TCGTGAAAAC
GCGCTGCTGG TGGACTATTC TCACCTCTCT ATTGTTAGCG TCATGGGCGA TGATGCCTGG
GCGCTGATCA ACCAGCTGGT TTCCGCAGAT GTCTCCATCA TTCGTGATGA GCAGGCAATC
TACTCACTGG TGTTGAATGA AGAGGGCACC ATTCGCGGAG ACGTCTATGT GCTGTGCAGC
ATTGACGGTT ATTACCTGCT TTCAGAAGAT ATTTCGGCTG CCGAATTGAT CGCCAGCATG
AATACCATTC TCGAAAAAGC GGAAGAACTG GATATTCAAT CGATGCCTGA AATTCAGGAT
ATGCGGGAAA ACAACTGGGG GGCAATCCTG CTCGAAGGCC CATACTCCTG GGAGATCATG
TCTGAAATTC ATGGCTTTGA CGTGATTGGC CTGCCTTACT ACGAATACAT GAATACCGAG
GAGGATCTGC TGCTCTTCCG CTGTGGTAAA CACGGTGAGT ATGCGTACAT GACCATCGGT
GAGCAGGCGA AACTGGCGGA GCAGTGGGAA AAACTGTTAA CCGTTGGCGA GAAATACCTG
ATGCAGACCG GCGGCCTGGA TTATCAAAAA ATCGTACGCC TCGAAAATCC GTGCTGGGAT
GCCAGCCTCT GGGAAGGTCA GGCGGTGAAT CCGGTTCAGT TGCAGATGCA GTGGGCGGTG
CAGTACGACA AAGATGATTT TATTGGCAAG GACGCGGTGA CAGAGCTATC CCAGGAGTAT
ACCGGTAATA AACTCATCGG CATGATCGCA CAGGAAGAAT GCGAAGGCAT TGAGGCTGGC
GATCGCGTGC TGGTGGAAGG TCAAGACGTG GGCTATGTCG TGAAGGCACT CTTTTCCCCA
GCGTTGCAGC GTTTCATTGC CCTGACCCTG CTGGAAAAGG ATTACGCCTG GTCTGACATC
AGCGGCTACG AAATTCAAAC TGCTCACGGA ATTATTCCGG CGCAATCAAA ATGTATGCCG
TTTATTTATA ACCTGAGCAT GTTGGTTAGC CCAACTGAGC ACAGTTATAT CGACGCGTCA
AAAAATAAAA GCGCAGCGTA A
 
Protein sequence
MKSLAEFHLK NNAVMGVYNN RTLPSSYHDA MTEYKAVREN ALLVDYSHLS IVSVMGDDAW 
ALINQLVSAD VSIIRDEQAI YSLVLNEEGT IRGDVYVLCS IDGYYLLSED ISAAELIASM
NTILEKAEEL DIQSMPEIQD MRENNWGAIL LEGPYSWEIM SEIHGFDVIG LPYYEYMNTE
EDLLLFRCGK HGEYAYMTIG EQAKLAEQWE KLLTVGEKYL MQTGGLDYQK IVRLENPCWD
ASLWEGQAVN PVQLQMQWAV QYDKDDFIGK DAVTELSQEY TGNKLIGMIA QEECEGIEAG
DRVLVEGQDV GYVVKALFSP ALQRFIALTL LEKDYAWSDI SGYEIQTAHG IIPAQSKCMP
FIYNLSMLVS PTEHSYIDAS KNKSAA