Gene EcSMS35_1585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1585 
Symbol 
ID6146897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1571129 
End bp1572637 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content49% 
IMG OID641616462 
Producthypothetical protein 
Protein accessionYP_001743640 
Protein GI170682142 
COG category[S] Function unknown 
COG ID[COG5339] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAT CGCTGGTAGC GGTAGGCGTC ATTGTTGCGC TAGGCGTAGT CTGGACAGGC 
GGCGCATGGT ATACAGGCAA GAAGATTGAA ACCCATCTCG AAGACATGGT CGCGCAGGCG
AACGCGCAAC TCAAACTGAC CGCTCCTGAA TCCAACCTGG AAGTGAGTTA TCAAAACTAT
CATCGCGGCG TATTCAGCAG TCAGCTGCAA CTGTTGGTGA AACCCATTGC CGGGAAAGAA
AATCCGTGGA TTAAAAGCGG TCAGAGCGTC ATCTTCAACG AATCGGTTGA TCATGGTCCC
TTCCCCCTTG CCCAGCTAAA AAAACTGAAC CTGCTCCCGT CGATGGCATC AATTCAAACC
ACGCTGGTTA ATAACGAAGT TAGCAAGCCA CTGTTTGATA TGGCAAAAGG TGAAACGCCT
TTTGAGATTA ACTCCCGCAT TGGTTACAGC GGAGATTCCA GTTCCGATAT TTCGCTCAAG
CCGCTGAATT ACGAGCAAAA GGATGAAAAA GTCGCCTTTA GCGGCGGCGA GTTCCAGTTA
AATGCGGACA GAGACGGCAA AGCTATCTCC CTTTCCGGAG AGGCGCAAAG TGGTCGGATA
GACGCAGTTA ACGAATACAA CCAGAAAGTG CAGTTGACCT TTAATAATCT GAAAACCGAC
GGTTCCAGCA CGCTGGCAAG TTTTGGTGAG CGCGTAGGAA ACCAAAAACT GTCACTGGAA
AAAATGACCA TTTCAGTGGA AGGCAAAGAA CTGGCACTGC TGGAAGGCAT GGAGATCAGC
GGTAAATCGG ATCTGGTCAA TGACGGTAAA ACGATCAATA GCCAACTGGA TTACTCGCTA
AACAGCCTGA AGGTACAGAA TCAGGATCTG GGCAGTGGCA AGCTGACTTT AAAAGTCGGC
CAGATTGATG GTGAAGCCTG GCATCAGTTT AGCCAGCAAT ATAACGCGCA AACTCAGGCG
CTGCTGGCAC AGCCAGAAAT TGCCAACAAT CCCGAACTTT ATCAGGAGAA AGTGACGGAA
GCCTTCTTTA GCGCCCTGCC GCTGATGTTG AAAGGCGATC CGGTGATTAC TATCGCGCCG
CTAAGCTGGA AAAACAGTCA GGGTGAAAGT GCGCTGAATC TGTCGCTGTT CCTGAAAGAT
CCGGCAACGA CTAAAGAAGC GCCGCAAACG CTGACGCAGG AAGTAGATCG TTCGGTTAAA
TCTCTGGATG CGAAACTGAC CATTCCGGTG GATATGGCAA CTGAGTTGAT GACTCAGGTA
GCGAAGCTGG AAGGTTATCA GGAAGATCAA GCGAAAAAAC TGGCGAAACA GCAAGTTGAA
GGTGCATCAG CAATGGGGCA GATGTTCCGT CTGACCACCT TGCAGGACAA TACCATCACC
ACCAGCCTGC AATATGCTAA CGGTCAGATA ACGTTAAACG GGCAGAAAAT GCCACTGGAA
GATTTCGTGG GTATGTTTGC GATGCCGACA TTAAACGTTC CGGCTGTACC CGCTATTCCG
CAGCAGTAA
 
Protein sequence
MNKSLVAVGV IVALGVVWTG GAWYTGKKIE THLEDMVAQA NAQLKLTAPE SNLEVSYQNY 
HRGVFSSQLQ LLVKPIAGKE NPWIKSGQSV IFNESVDHGP FPLAQLKKLN LLPSMASIQT
TLVNNEVSKP LFDMAKGETP FEINSRIGYS GDSSSDISLK PLNYEQKDEK VAFSGGEFQL
NADRDGKAIS LSGEAQSGRI DAVNEYNQKV QLTFNNLKTD GSSTLASFGE RVGNQKLSLE
KMTISVEGKE LALLEGMEIS GKSDLVNDGK TINSQLDYSL NSLKVQNQDL GSGKLTLKVG
QIDGEAWHQF SQQYNAQTQA LLAQPEIANN PELYQEKVTE AFFSALPLML KGDPVITIAP
LSWKNSQGES ALNLSLFLKD PATTKEAPQT LTQEVDRSVK SLDAKLTIPV DMATELMTQV
AKLEGYQEDQ AKKLAKQQVE GASAMGQMFR LTTLQDNTIT TSLQYANGQI TLNGQKMPLE
DFVGMFAMPT LNVPAVPAIP QQ