Gene EcSMS35_0127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0127 
Symbol 
ID6143926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp139712 
End bp141469 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content53% 
IMG OID641615028 
Producthypothetical protein 
Protein accessionYP_001742244 
Protein GI170684092 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.146537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA CTTTGCCGTT TAAACCCCAT GTGCTGGCGC TAATTTGCAG TGCCGGGCTT 
TGTGCCGCCT CTGCCGGGCT ATATATAAAA AGCCGCACAG TGGAAGCGCC TGTGGAATCG
CAATCGACAC AACAGACTGC GCCTGACATC TCCGCAGTTA CGCTTCCTGC AACGGTTTCC
GCGCCTCCCG TAACGCCTGC CGTCGTCAAA TCTACATTCA GCACTGCACA AATAGATCAA
TGGGTTGCGC CTGTCGCGCT GTACCCCGAC TCTCTGCTTT CACAAGTGTT AATGGCATCA
ACCTATCCGG CAAACGTTGC GCAAGCAGTG CAATGGTCGC ACGATAATCC ACTTAAACAA
GGCGATGCTG CTATTCAGGC GGTATCTGAC CAGCCCTGGG ACGCCAGCGT CAAATCACTG
GTGGCCTTTC CACAGTTGAT GGCATTGATG GGCGAAAACC CGCAATGGGT GCAAAACCTG
GGCGATGCTT TTCTGGCGCA GCCGCAGGAC GTGATGGACT CGGTACAACG ATTGCGGCAA
CTGGCGCAAC AAACCGGTTC GCTAAAGTCA TCAACTGAAC AAAAGATCAT CACGACAACG
AAAAAAGTCG TACCGGTAAA TCAGCCCGCC AACGCCCCTG TTACACAATC AAATACCGTT
TCTACGTCCA GCCCCGTCGT CGCAGAACCT GCACCGACCG TAATAACCAT TGAGCCAGCC
AATCCGGATG TGGTTTATAT TCCGAACTAC AACCCAAACG TGGTTTACGG CAGTTGGGCC
AATACCGCTT ATCCGCCAGT TTATCTGCCG CCGCCAGCCG GAGAACCGTT TGTTGACAGC
TTTGTGCGCG GATTCGGCTA TAGCATGGGC GTTGCTACCA CATACGCACT ATTCAGCAGC
ATCGACTGGG ACGACGACGA TCATGACCAT CATCATCATG ACGATGATGA TTATCATCAC
CACGATGGCG GTCATCGTGA CGGTAATGAC TGGCAACACA ACGGCGACAA CATCAATATC
GACGTCAACA ATTTCAACCG TATCACTGGT GAGCATCTTA CTGATAAGAA TATGGCATGG
CGGCACAATC CAAACTACCG TGATGGTGTG CCCTATCATG ATCAGGATAT GGCAAAGCGG
TTTCACCAAA CCGATGTCAA CGGCGGCATG AGCGCCACGC AGTTACCTGC TCTATCGCGC
GACAGTCAGC GTCAGGCAGC AGCAAGTCAG TTTCAGCAAC GAACACACAC TGCGCCAGTC
ATTACGCGAG ATACTCAACG TCAGGCTGCG GCACAGCGAT TTAATGAAGC GGAACATTAT
GGGAGCTATG ACGACTTCCG CGAGTTCAGC CGTCGCCAAC CACTGACCCA GCAACAAAAG
GACGCTGCTC GTCAGCGTTA TCAGTCGGCC TCGCCTGAGC AGCGCCAGGC AGTCCGCGAG
AAAATGCAGA CTAACCCGCA GAACCAGCAG CGAAGAGAGG TAGCGCGTGA GCGTATTCAG
TCTGCAACTC CCGAACAGCG CCAGGTGTTT AAGGAAAAAG TACAGCAGCG CCCACTGAAC
CAACAGCAAC GTGATAACGC CCGCCAGCGT ATACAATCGG CATCACCTGA ACAACGTCAG
GTTTTTCGGG AGAAAGTTCA GGAGAGCCGC CCACAACGTC TAAACGACAG TAACCATACT
GTCAGGCTGA ATAACGAGCA ACGGTCAGCA GTACGCGAAC GTCTCTCTGA GCGCGGAGCA
AGGCGACTGG AAAGGTAA
 
Protein sequence
MKMTLPFKPH VLALICSAGL CAASAGLYIK SRTVEAPVES QSTQQTAPDI SAVTLPATVS 
APPVTPAVVK STFSTAQIDQ WVAPVALYPD SLLSQVLMAS TYPANVAQAV QWSHDNPLKQ
GDAAIQAVSD QPWDASVKSL VAFPQLMALM GENPQWVQNL GDAFLAQPQD VMDSVQRLRQ
LAQQTGSLKS STEQKIITTT KKVVPVNQPA NAPVTQSNTV STSSPVVAEP APTVITIEPA
NPDVVYIPNY NPNVVYGSWA NTAYPPVYLP PPAGEPFVDS FVRGFGYSMG VATTYALFSS
IDWDDDDHDH HHHDDDDYHH HDGGHRDGND WQHNGDNINI DVNNFNRITG EHLTDKNMAW
RHNPNYRDGV PYHDQDMAKR FHQTDVNGGM SATQLPALSR DSQRQAAASQ FQQRTHTAPV
ITRDTQRQAA AQRFNEAEHY GSYDDFREFS RRQPLTQQQK DAARQRYQSA SPEQRQAVRE
KMQTNPQNQQ RREVARERIQ SATPEQRQVF KEKVQQRPLN QQQRDNARQR IQSASPEQRQ
VFREKVQESR PQRLNDSNHT VRLNNEQRSA VRERLSERGA RRLER