Gene EcSMS35_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1801 
Symbol 
ID6145443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1820406 
End bp1821803 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content55% 
IMG OID641616677 
Producthypothetical protein 
Protein accessionYP_001743855 
Protein GI170682550 
COG category[R] General function prediction only 
COG ID[COG3106] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.873662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAC TTAAAAATGA ACTTAATGCG CTGGTGAATC GGGGTGTCGA CAGACATCTG 
CGCCTCGCCG TAACCGGACT TAGCCGCAGC GGCAAAACGG CGTTTATCAC TGCGATGGTC
AATCAGTTGC TCAATATTCA CGCCGGAGCA AGGTTGCCGC TGTTAAGCGC GGTGCGTGAA
GAACGGCTGC TGGGCGTAAA ACGCATTCCC CAGCGTGACT TTGGCATTCC GCGCTTCACC
TATGATGAAG GGCTGGCGCA GTTATATGGC GATCCACCCG CCTGGCCAAC GCCAACGCGC
GGCGTCAGCG AAATTCGCCT GGCGCTACGT TTTAAATCGA ACGATTCGCT GCTACGCCAC
TTTAAGGATA CCTCCACGCT GTATCTGGAG ATTGTGGATT ATCCCGGCGA ATGGTTGCTC
GACCTGCCGA TGCTGGCGCA GGACTATTTA AGCTGGTCGC GCCAGATGAC GGGCTTACTT
AATGGTCAGC GCGGCGAATG GTCGGCGAAA TGGCGAATGA TGTGCGAAGG GCTGGACCCG
CTAGCGCCTG CCGACGAAAA CCGGCTGGCA GACATTGCCG CCGCCTGGAC CGAGTATCTC
CACCACTGCA AACAGCAGGG GTTGCACTTT ATTCAGCCTG GGCGCTTTGT CTTGCCGGGA
GATATGGCAG GAGCACCCGC GCTGCAATTC TTCCCGTGGC CGGATGTCGA TGCCTGGGGC
GAGTCCAAAC TGGCGCAGGC CGATAAGCAT ACCAATGCCG GAATGCTGCG CGAGCGGTTT
AATTATTACT GCGAGAAAGT GGTGAAGGGG TTCTATAAGA ATCATTTTCT GCGCTTTGAC
CGCCAGATTG TGCTGGTGGA TTGCCTGCAA CCTCTCAACA GTGGGCCACA GGCATTTAAT
GATATGCGTC TGGCGCTGAC GCAGCTGATG CAAAGTTTCC ACTACGGGCA GCGAACCCTG
TTCCGGCGTT TGTTTTCACC GGTTATCGAT AAGCTATTGT TTGCTGCCAC TAAAGCAGAC
CATGTGACCA TCGATCAGCA CGCCAATATG GTTTCATTAC TGCAACAACT GATTCAGGAT
GCCTGGCAAA ATGCGGCGTT CGAAGGGATC AGCATGGACT GCCTGGGGCT GGCGTCAGTT
CAGGCGACCA CCAGCGGCAT TATTGATGTT AACGGTGAGA AAATCCCGGC GCTGCGCGGT
AATCGACTCA GCGATGGCGC ACCGCTCACT GTTTATCCTG GCGAAGTTCC CGCTCGTTTG
CCCGGTCAGG CGTTCTGGGA TAAGCAAGGC TTCCAGTTTG AGGCGTTTCG TCCGCAGGTG
ATGGATGTTG ACAAACCGCT GCCGCATATT CGCCTTGATG CTGCGCTGGA ATTTTTAATA
GGAGATAAAT TGCGATGA
 
Protein sequence
MKRLKNELNA LVNRGVDRHL RLAVTGLSRS GKTAFITAMV NQLLNIHAGA RLPLLSAVRE 
ERLLGVKRIP QRDFGIPRFT YDEGLAQLYG DPPAWPTPTR GVSEIRLALR FKSNDSLLRH
FKDTSTLYLE IVDYPGEWLL DLPMLAQDYL SWSRQMTGLL NGQRGEWSAK WRMMCEGLDP
LAPADENRLA DIAAAWTEYL HHCKQQGLHF IQPGRFVLPG DMAGAPALQF FPWPDVDAWG
ESKLAQADKH TNAGMLRERF NYYCEKVVKG FYKNHFLRFD RQIVLVDCLQ PLNSGPQAFN
DMRLALTQLM QSFHYGQRTL FRRLFSPVID KLLFAATKAD HVTIDQHANM VSLLQQLIQD
AWQNAAFEGI SMDCLGLASV QATTSGIIDV NGEKIPALRG NRLSDGAPLT VYPGEVPARL
PGQAFWDKQG FQFEAFRPQV MDVDKPLPHI RLDAALEFLI GDKLR