Gene EcSMS35_3442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3442 
Symbol 
ID6145975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3520156 
End bp3522192 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content54% 
IMG OID641618271 
Productputative lipoprotein 
Protein accessionYP_001745420 
Protein GI170679956 
COG category[R] General function prediction only 
COG ID[COG3107] Putative lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACCCT CAACATTTTC TCGTTTGAAA GCCGCGCGTT GTCTGCCTGT TGTTCTGGCA 
GCCCTGATTT TCGCCGGTTG TGGCACCCAT ACTCCCGATC AGTCCACTGC TTATATGCAG
GGCACGGCGC AGGCTGATTC TGCCTTTTAC CTGCAACAGA TGCAGCAAAG TTCTGATGAT
ACCAGGATCA ACTGGCAATT ACTCGCCATT CATGCACTGG TGAAAGAAGG TAAAACCGGA
CAGGCGGTGG AGTTGTTTAA CCAACTACCG CAAGAACTGA ACGATTCTCA GCGTCGCGAG
AAAACACTGC TGGCGGCAGA GATTAAACTG GCGCAGAAAG ATTTTGCTGG CGCGCAAAAC
TTGCTGGCGA AAATCACACC TGCCGATTTA GAACAAAACC AGCAAGCGCG TTACTGGCAG
GCAAAAATCG ATGCCAGCCA GGGGCGTCCT TCCATTGATT TACTGCGCGC GTTAATTGCT
CAGGAACCGC TGCTCGGCGC GAAAGAAAAG CAGCAGAATA TTGACGCCAC CTGGCAGGCG
CTCTCCTCCA TGACTCAGGA ACAGGCGAAT ACGCTCGTGA TCAATGCCGA CGAAAATATT
CTGCAAGGCT GGTTGGATCT GCAGCGCATC TGGTTTGATA ACCGTAACGA TCCCGACATG
ATGAAAGCCG GGATCGCCGA CTGGCAGAAA CGTTATCCGA ACAACCCGGG CGCGAAAATG
CTGCCAACGC AGTTGGTTAA CGTAAAAGCG TTTAAACCCG CATCGACCAA CAAAATCGCC
CTGCTGTTGC CACTGAATGG CCAGGCAGCG GTATTTGGTC GCACTATTCA GCAAGGCTTT
GAAGCGGCGA AAAATATCGG CACTCAGCCA GTGGCGGCCC AGGTAGCTGC CGCACCTGCC
GCAGACGTAG CAGAACAACC TCAGCCGCAA ACCGTGGATG GCGTTGCCAG CCCGGCACAA
GCCTCGGTTA GCGATCTGAC CGGTGAGCAG CCTGCGGCCC AGCCGGTGCC TGTAAGCGCC
CCGGCGACAA GCACCGCAGC GGTAAGCGCA CCCGCAAATC CATCCGCAGA GCTAAAAATC
TACGATACCT CATCACAACC ACTTAGCCAG ATCTTAAGCC AGGTTCAGCA GGATGGTGCG
AGTATTGTGG TCGGTCCGTT GCTGAAAAAT AACGTTGAAG AGTTGCTGAA GAGCAACACC
CCGCTGAACG TGCTGGCACT GAACCAGCCG GAGAATATCG AAAACCGCGT CAATATTTGT
TACTTCGCGC TTTCACCGGA AGACGAAGCG CGCGATGCAG CGCGTCATAT TCGTGACCAG
GGTAAACAAG CGCCGCTGGT GCTGATCCCA CGCAGTTCAT TGGGCGATCG CGTAGCCAAT
GCGTTTGCGC AAGAGTGGCA GAAACTGGGT GGCGGCACCG TTCTGCAACA AAAATTTGGT
TCCACCAGCG AATTACGCGC GGGTGTTAAC GGCGGTTCTG GTATTGCTTT AACGGGTAGC
CCGATTACTC CCAGAGAAAC AACCGACTCC GGCATGACGA CCAACAATCC AACGCTGCAA
ACCACGCCAA CCGATGACCA GTTCACCAAT AATGGCGGTC GTGTCGATGC GGTGTACATT
GTGGCAACGC CGGGTGAAAT CGCTTTTATC AAACCAATGA TCGCCATGCG TAACGGTAGC
CAGAGCGGTG CAACGCTGTA CGCCAGCTCC CGCAGTGCGC AAGGGACCGC TGGCCCGGAT
TTCCGCCTGG AGATGGAAGG CTTGCAGTAC AGCGAAATCC CGATGCTGGC AGGCGGTAAT
CTGCCGTTAA TGCAGCAGGC GCTCAGCGCG GTGAACAACG ATTATTCGCT GGCTCGCATG
TATGCGATGG GCGTCGATGC CTGGTCGTTG GCAAATCATT TCTCACAAAT GCGCCAGGTT
CAGGGTTTTG AAATCAACGG TAATACCGGA AGTCTGACGG CTAACCCGGA TTGCGTGATT
AACAGGAAGT TATCATGGCT ACAGTACCAA CAAGGTCAGG TAGTCCCCGC CAGTTAA
 
Protein sequence
MVPSTFSRLK AARCLPVVLA ALIFAGCGTH TPDQSTAYMQ GTAQADSAFY LQQMQQSSDD 
TRINWQLLAI HALVKEGKTG QAVELFNQLP QELNDSQRRE KTLLAAEIKL AQKDFAGAQN
LLAKITPADL EQNQQARYWQ AKIDASQGRP SIDLLRALIA QEPLLGAKEK QQNIDATWQA
LSSMTQEQAN TLVINADENI LQGWLDLQRI WFDNRNDPDM MKAGIADWQK RYPNNPGAKM
LPTQLVNVKA FKPASTNKIA LLLPLNGQAA VFGRTIQQGF EAAKNIGTQP VAAQVAAAPA
ADVAEQPQPQ TVDGVASPAQ ASVSDLTGEQ PAAQPVPVSA PATSTAAVSA PANPSAELKI
YDTSSQPLSQ ILSQVQQDGA SIVVGPLLKN NVEELLKSNT PLNVLALNQP ENIENRVNIC
YFALSPEDEA RDAARHIRDQ GKQAPLVLIP RSSLGDRVAN AFAQEWQKLG GGTVLQQKFG
STSELRAGVN GGSGIALTGS PITPRETTDS GMTTNNPTLQ TTPTDDQFTN NGGRVDAVYI
VATPGEIAFI KPMIAMRNGS QSGATLYASS RSAQGTAGPD FRLEMEGLQY SEIPMLAGGN
LPLMQQALSA VNNDYSLARM YAMGVDAWSL ANHFSQMRQV QGFEINGNTG SLTANPDCVI
NRKLSWLQYQ QGQVVPAS