Gene EcHS_A3266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3266 
Symbol 
ID5592385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3278084 
End bp3279220 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content54% 
IMG OID640922383 
Productmethyltransferase family protein 
Protein accessionYP_001459878 
Protein GI157162560 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2813] 16S RNA G1207 methylase RsmC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACT TAGACAACGG TTTCCGTTCA CTGACACTGC AACGTTTTCC GGCGACGGAT 
GACGTTAACC CGCTACAGGC GTGGGAAGCG GCGGATGAAT ATTTGCTGCA ACAGTTGGAC
GACACAGAAA TCCGCGGCCC GGTGTTGATC CTGAATGATG CCTTTGGTGC GTTAAGTTGT
GCGCTGGCGG AACATAAGCC GTACAGCATT GGCGACTCAT ACATCAGTGA ACTGGCGACG
CGCGAGAATT TACGCCTAAA CGGGATTGAT GAGTCGAGCG TGAAGTTTCT CGACAGCACC
GCCGACTACC TGCAACAGCC GGGTGTGGTG CTGATCAAAG TGCCGAAAAC ACTGGCATTG
CTGGAACAGC AACTGCGTGC GCTGCGCAAA GTGGTCACGT CGGATACACG TATTATTGCC
GGTGCCAAAG CCCGTGACAT TCACACTTCC ACGCTGGAGC TGTTCGAGAA AGTGCTCGGA
CCGACCACCA CTACGCTGGC ATGGAAAAAA GCGCGCCTGA TTAATTGCAC TTTCAATGAA
CCGCCGCTGG CCGATGCGCC GCAGACCGTT AGCTGGAAGC TGGAAGGTAC TGACTGGACT
ATCCACAACC ATGCGAATGT CTTCTCCCGC ACCGGGCTGG ATATTGGCGC ACGCTTCTTT
ATGCAACATC TGCCAGAGAA TCTCGAAGGT GAGATTGTCG ATCTCGGCTG TGGTAACGGC
GTTATTGGCC TGACGCTGCT TGATAAAAAC CCGCAGGCGA AAGTGGTGTT TGTCGATGAA
TCGCCGATGG CGGTCGCTTC CAGCCGAATG AATGTTGAAA CCAACATGCC AGAGGCGTTG
GATCGCTGCG AGTTTATGAT CAACAACGCG CTCTCCGGAG TGGAGCCTTT CCGCTTTAAT
GCTGTGCTCT GCAACCCGCC GTTTCATCAG CAACATGCGC TGACCGATAA CGTCGCCTGG
GAGATGTTCC ACCACGCCCG CCGCTGCCTG AAAATCAACG GCGAACTGTA TATCGTTGCC
AACCGTCACC TGGATTACTT CCATAAACTG AAGAAGATTT TCGGCAACTG CACCACCATC
GCCACGAATA ATAAATTTGT GGTGCTGAAA ACAGTGAAGC TGGGGCGTCG TCGGTAA
 
Protein sequence
MSHLDNGFRS LTLQRFPATD DVNPLQAWEA ADEYLLQQLD DTEIRGPVLI LNDAFGALSC 
ALAEHKPYSI GDSYISELAT RENLRLNGID ESSVKFLDST ADYLQQPGVV LIKVPKTLAL
LEQQLRALRK VVTSDTRIIA GAKARDIHTS TLELFEKVLG PTTTTLAWKK ARLINCTFNE
PPLADAPQTV SWKLEGTDWT IHNHANVFSR TGLDIGARFF MQHLPENLEG EIVDLGCGNG
VIGLTLLDKN PQAKVVFVDE SPMAVASSRM NVETNMPEAL DRCEFMINNA LSGVEPFRFN
AVLCNPPFHQ QHALTDNVAW EMFHHARRCL KINGELYIVA NRHLDYFHKL KKIFGNCTTI
ATNNKFVVLK TVKLGRRR