Gene EcSMS35_2151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2151 
Symbol 
ID6143058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2157613 
End bp2158803 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content52% 
IMG OID641617027 
Producthypothetical protein 
Protein accessionYP_001744201 
Protein GI170680438 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.111749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.201335 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTAC GTTTAGTGTT AGCCAAAGGG CGCGAAAAAT CATTACTTCG TCGCCATCCG 
TGGATCTTTT CCGGGGCCGT TGCCCGCATG GAAGGTAAAG CCAGCCTCGG TGAAACCATC
GATATTGTTG ATCATCAGGG AAAATGGTTA GCACGCGGCG CTTATTCGCC AGCTTCGCAA
ATCCGGGCGC GCGTCTGGAC GTTTGACCCG TCTGAGTCTA TCGACATTGC TTTTTTTACC
CGACGTTTAC AACAAGCACA AAAATGGCGT GACTGGCTGG CGCAAAAAGA TGGCCTCAAC
AGCTATCGTT TAATCGCCGG AGAATCTGAT GGCCTGCCGG GTATTACTAT CGATCGTTTC
GGTAATTTTC TGGTGCTGCA ACTGCTGAGT GCTGGCGCAG AGTATCAGCG CGCGGCATTA
ATTAGTGCCC TGCAAACGCT GTACCCGGAA TGTGCGATTT ACGATCGCAG CGATGTTGCG
GTACGTAAAA AAGAAGGGAT GGAGCTGACC CAGGGCCCCG TCACCGGCGA GTTGCCGCCT
GCCCTGCTGC CGATTGAAGA ACACGGCATG AAGCTGCTGG TGGACATACA GCACGGACAC
AAAACGGGCT ACTACCTGGA TCAGCGTGAT AGCCGCCTGG CTACCCGCCG CTACGTTGAA
AATAAACGCG TACTGAACTG TTTCTCCTAT ACCGGTGGTT TCGCCGTATC GGCACTGATG
GGCGGTTGCA GCCAGGTTGT CAGCGTTGAT ACCTCCCAGG AAGCCCTGGA TATTGCACGG
CAGAACGTTG AGCTGAACAA ACTGGATCTG AGCAAGGCTG AGTTTGTCCG TGATGATGTC
TTTAAATTGC TGCGTACCTA TCGCGATCGC GGTGAAAAAT TTGACGTTAT CGTGATGGAC
CCGCCGAAGT TTGTTGAGAA TAAAAGCCAG TTGATGGGCG CGTGTCGTGG CTATAAAGAC
ATCAACATGC TGGCGATTCA GTTGCTGAAT GAAGGCGGAG TTCTCCTGAC TTTCTCCTGT
TCTGGCCTGA TGACCAGCGA TTTATTTCAG AAAATCATCG CAGATGCCGC AATTGATGCC
GGTCGTGATG TACAATTTAT AGAGCAGTTC CGTCAGGCAG CCGATCATCC GGTGATCGCT
ACCTATCCGG AAGGGCTATA TCTGAAAGGG TTTGCCTGTC GCGTCATGTA A
 
Protein sequence
MSVRLVLAKG REKSLLRRHP WIFSGAVARM EGKASLGETI DIVDHQGKWL ARGAYSPASQ 
IRARVWTFDP SESIDIAFFT RRLQQAQKWR DWLAQKDGLN SYRLIAGESD GLPGITIDRF
GNFLVLQLLS AGAEYQRAAL ISALQTLYPE CAIYDRSDVA VRKKEGMELT QGPVTGELPP
ALLPIEEHGM KLLVDIQHGH KTGYYLDQRD SRLATRRYVE NKRVLNCFSY TGGFAVSALM
GGCSQVVSVD TSQEALDIAR QNVELNKLDL SKAEFVRDDV FKLLRTYRDR GEKFDVIVMD
PPKFVENKSQ LMGACRGYKD INMLAIQLLN EGGVLLTFSC SGLMTSDLFQ KIIADAAIDA
GRDVQFIEQF RQAADHPVIA TYPEGLYLKG FACRVM