Gene EcSMS35_4885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4885 
Symbol 
ID6146100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5002090 
End bp5003046 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content50% 
IMG OID641619689 
Producthypothetical protein 
Protein accessionYP_001746796 
Protein GI170679958 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.287711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACT TCACGACCAG CACGCCGCAT GACGCATTAT TTAAATCTTT TCTCACCCAC 
CCTGACACCG CGCGGGATTT TATGGAGATC CACTTACCCA AAGATTTACG TGAACTGTGC
GATCTCGACA GCTTAAAACT GGAATACGCC AGCTTTGTCG ATGAAAAATT GCGGGCGCTA
CATTCCGATA TTCTGTGGTC GGTAAAGACC CGCGAAGGAG ATGGCTATAT TTATGTGGTG
ATTGAACATC AGAGCCGCGA GGATATCCAT ATGGCCTTTC GCCTGATGCG ATATTCCATG
GCGGTGATGC AGCGCCATAT CGAGCATGAT AAACGCCGGC CGCTACCGCT GGTCATCCCG
ATGCTGTTTT ATCACAGTAG CCGTAGTCCT TACCCCTGGT CTTTGTGCTG GCTGGATGAA
TTTGCCGACC CGGCTACCGC GCGGAAGCTT TATACCGCAG CGTTCCCGCT GGTGGATGTC
ACTGTCGTGC CAGACGACGA GATTGTGCAG CATCGCAGAG TCGCCCTGCT GGAGTTGATC
CAAAAGCATA TTCGCCAGCG CGATCTAATG GGGCTTATTG ATCAACTGGC AGTATTACTG
GTTACAGGGT GCGCTAATGA CAGCCAGATA ACCGCGCTGT TAAATTACAT TTTACTGACT
GGCGATGAAG CGCGTTTTAA TGAGTTTATC AGCGAACTTA TCCGTCGAAT GCCACAACAC
AGGGAGCGAA TAATGACGAT TGCAGAGCGA ATTCATAATG ATGGATGGCT GTTGGGAAGG
GAGAGGGGGA GGAAAGAAGG GAAAGAAGAA GGGGAAAAGA GCCTCCTCCG ATTGTTGTTG
CAGAATGGGG CGGATCCTGA ATGGATACAA CGATATACCG GACTTTCGGC AGAGCAAATG
CAGGCATTAG AGCAGCCCTT GCCTGAAAGC AAGCGCGATC CATGGATCGA GTACTAA
 
Protein sequence
MTNFTTSTPH DALFKSFLTH PDTARDFMEI HLPKDLRELC DLDSLKLEYA SFVDEKLRAL 
HSDILWSVKT REGDGYIYVV IEHQSREDIH MAFRLMRYSM AVMQRHIEHD KRRPLPLVIP
MLFYHSSRSP YPWSLCWLDE FADPATARKL YTAAFPLVDV TVVPDDEIVQ HRRVALLELI
QKHIRQRDLM GLIDQLAVLL VTGCANDSQI TALLNYILLT GDEARFNEFI SELIRRMPQH
RERIMTIAER IHNDGWLLGR ERGRKEGKEE GEKSLLRLLL QNGADPEWIQ RYTGLSAEQM
QALEQPLPES KRDPWIEY