Gene EcolC_0811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0811 
Symbol 
ID6066545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp870451 
End bp871431 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content54% 
IMG OID641600216 
Productputative global regulator 
Protein accessionYP_001723810 
Protein GI170018856 
COG category[R] General function prediction only 
COG ID[COG0354] Predicted aminomethyltransferase related to GcvT 
TIGRFAM ID[TIGR03317] folate-binding protein YgfZ 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.535495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTTA CACCTTTTCC GCCCCGTCAG CCTACGGCTT CTGCCCGTTT GCCACTGACA 
CTGATGACGC TTGATGACTG GGCGCTTGCC ACCATTACTG GCGCGGACAG CGAAAAATAT
ATGCAGGGTC AGGTGACAGC AGACGTTAGC CAAATGACAG AAGATCAGCA CCTGCTGGCC
GCCCATTGCG ACGCCAAAGG TAAAATGTGG AGCAATTTAC GTCTGTTCCG CGACGGCGAT
GGCTTTGCGT GGATTGAACG GCGCAGCGTG CGTGAACCGC AGCTGACTGA ACTGAAAAAA
TATGCGGTAT TCTCTAAAGT GACCATCGCG CCAGACGACG AGCGTGTGCT GCTTGGTGTT
GCCGGTTTTC AGGCGCGCGC CGCGCTGGCA AACCTCTTTA GTGTATTACC TTCGAAGGAA
AAGCAGGTTA TCAGAGAAGA TGCGACTACC CTGCTATGGT TTGAACATCC GGCAGAACGT
TTCCTGATCG TAACCGATGA AGCTACTGCC AATATGCTGA CCGATAAACT GCGCGGTGAA
GCGGAACTGA ACAATAGCCA ACAGTGGCTG GCATTAAACA TTGAAGCTGG TTTCCCGGTG
ATTGATGCCG CCAACAGCGG GCAGTTTATC CCACAGGCGA CCAACCTCCA GGCGCTGGGC
GGTATCAGCT TTAAGAAAGG CTGCTATACC GGACAAGAGA TGGTGGCGCG AGCAAAATTC
CGTGGTGCCA ACAAACGCGC GCTCTGGTTG CTGACAGGTA GTGCCAGCCG ACTGCCGGAA
GCTGGTGAAG ACTTAGAGCT GAAAATGGGC GAGAACTGGC GTCGCACCGG TACGGTGCTG
GCTGCGGTAA AACTGGAAGA TGGCCAGGTC GTGGTACAAG TCGTCATGAA TAACGATATG
GAACCGGACA GCATCTTCCG CGTACGCGAC GATGCGAATA CATTGCATAT CGAGCCGCTG
CCGTATTCGC TCGAAGAGTA A
 
Protein sequence
MAFTPFPPRQ PTASARLPLT LMTLDDWALA TITGADSEKY MQGQVTADVS QMTEDQHLLA 
AHCDAKGKMW SNLRLFRDGD GFAWIERRSV REPQLTELKK YAVFSKVTIA PDDERVLLGV
AGFQARAALA NLFSVLPSKE KQVIREDATT LLWFEHPAER FLIVTDEATA NMLTDKLRGE
AELNNSQQWL ALNIEAGFPV IDAANSGQFI PQATNLQALG GISFKKGCYT GQEMVARAKF
RGANKRALWL LTGSASRLPE AGEDLELKMG ENWRRTGTVL AAVKLEDGQV VVQVVMNNDM
EPDSIFRVRD DANTLHIEPL PYSLEE