Gene EcSMS35_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3031 
SymbolygfZ 
ID6144047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3120113 
End bp3121093 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content54% 
IMG OID641617900 
Productputative global regulator 
Protein accessionYP_001745051 
Protein GI170683930 
COG category[R] General function prediction only 
COG ID[COG0354] Predicted aminomethyltransferase related to GcvT 
TIGRFAM ID[TIGR03317] folate-binding protein YgfZ 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTTA CACCTTTTCC TCCCCGTCAG CCTACGGCTT CTGCCCGTTT GCCACTGACG 
CTGATGACGC TTGATGACTG GGCGCTTGCC ACCATTACTG GCGCGGACAG CGAAAAATAT
ATGCAGGGTC AAGTGACAGC AGATGTCAGC CAGATGACAG AAGATCAGCA CCTGCTCGCC
GCCCATTGCG ACGCCAAAGG CAAAATGTGG AGCAACTTAC GTCTGTTCCG CGACGGCGAT
GGCTTTGCAT GGATTGAACG GCGCAGCGTG CGTGAACCGC AGCTGACTGA ACTGAAAAAA
TATGCGGTGT TCTCTAAAGT GACAATCGCG CCAGACGACG AGCGTGTGCT GCTTGGTGTT
GCCGGTTTTC AGGCGCGCGC CGCGCTGGCA AATCTCTTTA GTGAACTGCC TTCGAAAGAA
AAACAGGTAG TTAAAGAAGG CGCGACCACT TTGCTATGGT TTGAACACCC GGCAGAACGT
TTCCTGATCG TAACCGATGA AGCCACTGCC AATATGCTGA CCGATAAACT GCGCGGTGAA
GCGGAACTGA ACAATAGCCA ACAGTGGCTG GCATTAAACA TTGAAGCGGG TTTCCCGGTG
ATTGATGCCG CCAACAGCGG GCAGTTTATC CCACAGGCGA CCAACCTCCA GGCGCTGGGC
GGCATTAGCT TTAAAAAAGG CTGCTATACC GGACAAGAGA TGGTGGCGCG AGCTAAATTC
CGTGGTGCCA ACAAACGCGC GCTCTGGTTG CTGACAGGTA GTGCCAGCCG ACTGCCGGAA
GCTGGTGAAG ACTTAGAGCT GAAAATGGGC GAGAACTGGC GTCGTACCGG TACGGTGCTG
GCTGCGGTCA AACTGGAAGA TGGTCAGGTC GTGGTACAGG TCGTCATGAA TAACGATATG
GAACCGGACA GCATCTTCCG CGTGCGTGAC GATGCGAATA CATTGCGTAT CGAGCCGCTG
CCGTATTCGC TCGAAGAGTA A
 
Protein sequence
MAFTPFPPRQ PTASARLPLT LMTLDDWALA TITGADSEKY MQGQVTADVS QMTEDQHLLA 
AHCDAKGKMW SNLRLFRDGD GFAWIERRSV REPQLTELKK YAVFSKVTIA PDDERVLLGV
AGFQARAALA NLFSELPSKE KQVVKEGATT LLWFEHPAER FLIVTDEATA NMLTDKLRGE
AELNNSQQWL ALNIEAGFPV IDAANSGQFI PQATNLQALG GISFKKGCYT GQEMVARAKF
RGANKRALWL LTGSASRLPE AGEDLELKMG ENWRRTGTVL AAVKLEDGQV VVQVVMNNDM
EPDSIFRVRD DANTLRIEPL PYSLEE