Gene EcSMS35_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1849 
Symbol 
ID6143885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1871236 
End bp1872405 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content52% 
IMG OID641616725 
Producttetratricopeptide repeat protein 
Protein accessionYP_001743903 
Protein GI170683676 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000245858 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value5.61509e-21 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCTGGAGT TGTTGTTTCT GCTTTTGCCT GTAGCCGCTG CCTATGGCTG GTATATGGGC 
CGCAGAAGTG CGCAACAAAA CAAGCAAGAT GAAGCCAACC GCTTGTCGCG TGATTACGTA
GCGGGGGTTA ACTTCCTGCT TAGTAATCAA CAGGATAAAG CGGTAGATCT GTTTCTCGAT
ATGCTTAAAG AGGATACAGG TACCGTTGAA GCCCACCTTA CGCTCGGAAA CCTGTTCCGT
TCGCGTGGCG AAGTTGATCG CGCTATTCGC ATCCATCAGA CCCTAATGGA AAGCGCCTCG
CTGACCTATG AACAGCGGCT TTTGGCGATT CAACAACTGG GGCGTGATTA CATGGCTGCC
GGGTTATACG ACCGCGCGGA AGACATGTTC AATCAGCTGA CCGATGAAAC TGACTTCCGC
ATTGGCGCGC TGCAACAGTT GCTACAAATC TACCAGGCTA CCAGCGAGTG GCAGAAAGCA
ATTGATGTTG CCGAACGCCT GGTGAAGCTG GGTAAAGATA AACAGCGCGT CGAAATTGCC
CATTTCTACT GTGAGTTAGC TCTGCAGCAT ATGGCCAGCG ACGATCTCGA TCGTGCCATG
ACTTTGCTAA AAAAAGGTGC GGCGGCAGAT AAAAACAGCG CCCGCGTATC CATCATGATG
GGACGCGTGT TTATGGCGAA AGGAGAATAC GCCAAAGCCG TCGAAAGTCT GCAACGTGTC
ATATCCCAGG ACAGAGAACT GGTCAGCGAA ACGCTGGAAA TGCTGCAAAC CTGCTACCAG
CAGTTGGGTA AAACTGCCGA ATGGGCAGAG TTCCTGCAGC GCGCGGTGGA AGAGAACACC
GGTGCCGATG CTGAACTGAT GCTTGCGGAC ATCATCGAAG CGCGCGACGG TAGTGAGGCC
GCACAGGTCT ATATTACGCG TCAGCTTCAG CGTCATCCGA CCATGCGTGT GTTCCATAAG
CTAATGGATT ACCACTTAAA TGAAGCGGAA GAAGGGCGTG CCAAAGAGAG TCTGATGGTG
CTGCGTGACA TGGTTGGCGA GAAGGTGCGG AGTAAGCCTC GTTATCGCTG CCAGAAATGT
GGGTTTACCG CATACACTCT CTACTGGCAT TGTCCGTCTT GTCGGGCCTG GTCAACCATT
AAACCGATTC GCGGTCTTGA TGGCCTGTAA
 
Protein sequence
MLELLFLLLP VAAAYGWYMG RRSAQQNKQD EANRLSRDYV AGVNFLLSNQ QDKAVDLFLD 
MLKEDTGTVE AHLTLGNLFR SRGEVDRAIR IHQTLMESAS LTYEQRLLAI QQLGRDYMAA
GLYDRAEDMF NQLTDETDFR IGALQQLLQI YQATSEWQKA IDVAERLVKL GKDKQRVEIA
HFYCELALQH MASDDLDRAM TLLKKGAAAD KNSARVSIMM GRVFMAKGEY AKAVESLQRV
ISQDRELVSE TLEMLQTCYQ QLGKTAEWAE FLQRAVEENT GADAELMLAD IIEARDGSEA
AQVYITRQLQ RHPTMRVFHK LMDYHLNEAE EGRAKESLMV LRDMVGEKVR SKPRYRCQKC
GFTAYTLYWH CPSCRAWSTI KPIRGLDGL