Gene EcHS_A3057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3057 
SymbolygfZ 
ID5594485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3068826 
End bp3069806 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content54% 
IMG OID640922174 
Productputative global regulator 
Protein accessionYP_001459676 
Protein GI157162358 
COG category[R] General function prediction only 
COG ID[COG0354] Predicted aminomethyltransferase related to GcvT 
TIGRFAM ID[TIGR03317] folate-binding protein YgfZ 


Plasmid Coverage information

Num covering plasmid clones83 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTTA CACCTTTTCC TCCCCGTCAG CCTACGGCTT CTGCCCGTTT ACCGCTGACG 
CTGATGACGC TTGATGACTG GGCGCTTGCC ACCATTACTG GCGCGGACAG CGAAAAATAT
ATGCAGGGTC AGGTGACAGC AGATGTCAGC CAGATGGCAG AAGATCAGCA CCTGCTCGCC
GCCCATTGCG ACGCCAAAGG TAAAATGTGG AGCAATTTAC GTCTGTTCCG CGACGGCGAT
GGCTTTGCAT GGATTGAACG GCGCAGCGTG CGTGAACCGC AGCTGACTGA ACTGAAAAAA
TATGCGGTAT TCTCTAAAGT GACCATCGCG CCAGACGACG AGCGTGTGCT GCTTGGTGTT
GCCGGTTTTC AGGCGCGCGC CGCGCTGGCA AATCTCTTTA GCGAACTGCC TTCGAAAGAA
AAACAGGTAG TCAAAGAAGG CGCGACCACT TTGCTATGGT TTGAACACCC GGCAGAACGT
TTCCTGATCG TAACCGATGA AGCTACTGCT AATATGCTGA CCGATAAACT GCGCGGTGAA
GCGGAACTGA ACAATAGCCA ACAGTGGCTG GCATTAAACA TTGAAGCGGG TTTCCCGGTG
ATTGATGCCG CCAACAGCGG GCAGTTTATC CCACAGGCGA CCAATCTCCA GGCGCTGGGC
GGTATCAGCT TTAAGAAAGG CTGTTATACC GGACAAGAGA TGGTGGCGCG AGCAAAATTC
CGTGGTGCCA ATAAACGTGC GCTCTGGTTG CTGGCAGGTA GCGCCAGCCG ACTGCCGGAA
GCTGGTGAAG ACTTAGAGCT GAAAATGGGC GAGAACTGGC GTCGTACCGG TACGGTGCTG
GCTGCGGTAA AACTGGAAGA TGGTCAGGTC GTGGTACAGG TCGTCATGAA TAACGATATG
GAACCGGATA GCATCTTCCG CGTACGCGAC GATGCGAATA CATTGCATAT CGAGCCGCTG
CCGTATTCGC TCGAAGAGTA A
 
Protein sequence
MAFTPFPPRQ PTASARLPLT LMTLDDWALA TITGADSEKY MQGQVTADVS QMAEDQHLLA 
AHCDAKGKMW SNLRLFRDGD GFAWIERRSV REPQLTELKK YAVFSKVTIA PDDERVLLGV
AGFQARAALA NLFSELPSKE KQVVKEGATT LLWFEHPAER FLIVTDEATA NMLTDKLRGE
AELNNSQQWL ALNIEAGFPV IDAANSGQFI PQATNLQALG GISFKKGCYT GQEMVARAKF
RGANKRALWL LAGSASRLPE AGEDLELKMG ENWRRTGTVL AAVKLEDGQV VVQVVMNNDM
EPDSIFRVRD DANTLHIEPL PYSLEE