Gene EcSMS35_2195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2195 
Symbol 
ID6145177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2204363 
End bp2206210 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content53% 
IMG OID641617071 
Producthypothetical protein 
Protein accessionYP_001744245 
Protein GI170683737 
COG category[S] Function unknown 
COG ID[COG2989] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.555237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.324858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCTTA ATATGATGTG TGGTCGTCGG CTGTCGGCAA TCAGTTTGTG CGTGGCCGTA 
ACATTCGCTC CACTGTTCAA TGCGCAGGCC GATGAGCCTG AAGTAATCCC TGGCGACAGC
CCGGTGGCTG TCAGTGAACA GGGCGAGGCA CTGCCGCAGG CGCAAGCCAC GGCAATTATG
GCGGGGATCC AGCCATTGCC TGAAGGTGCG GCAGAAAAAG CCCGCACGCA AATCGAATCT
CAATTACCCG CAGGTTATAA GCCGGTTTAT CTTAACCAGC TTCAACTGTT GTATGCCGCA
CGCGATATGC AACCCATGTG GGAAAACCGT GATGCTGTTA AAGCCTTCCA GCAACAGCTG
GCAGAGGTGG CGATTGCCGG TTTCCAGCCG CAGTTTAATA AATGGGTAGA GTTACTGACC
GATCCTGGTG TTAACGGGAT GGCACGCGAC GTCGTGCTCT CTGATGCGAT GATGGGCTAT
CTCCATTTCA TTGCAAATAT TCCGGTCAAA GGCACTCGCT GGCTATATAG CAGTAAACCT
TATGCGCTTT CAACGCCGCC GCTTTCGGTG ATTAACCAAT GGCAGCTGGC GCTGGATAAA
GGTCAATTGC CGACGTTTGT TGCAGGTCTG GCACCGCAGC ATCCGCAATA TGCAGCGATG
CATGAATCGT TACTGGCCTT ACTCAGTGAC ACCAAACCGT GGCCCCAACT GACCGGCAAA
GCAACGTTGC GCCCAGGGCA GTGGAGTAAC GACGTACCGG CGTTGCGCGA AATATTGCAA
CGCACAGGCA TGCTGGACGG GGGGCCGAAA ATTACTCTAC CTGGCGATGA CACGCCAACT
GACGCGGTAG TCAGCCCATC CGCTGTTACT GTTGAAACAG CAGAAACTAA GCCGATGGAC
AAGCAAACGA CGTCTCGTAG TAAACCTGCG CCTGCCGTTC GTGCCGCCTA CGATAATGAA
CTGGTGGAAG CCGTTAAACG TTTTCAGGCA TGGCAAGGAT TGGGAGCAGA TGGCGCTATT
GGCCCGGCAA CGCGTGACTG GTTAAACGTA ACGCCCGCCC AGCGTGCTGG CGTGTTGGCA
CTCAACATCC AGCGATTGCG CTTGCTGCCA ACAGAGCTTT CTACCGGGAT CATGGTTAAC
ATTCCGGCCT ATTCGCTGGT CTACTATCAG AACGGCAATC AGGTGCTGGA TTCGCGAGTC
ATTGTCGGTC GCCCCGATCG CAAAACGCCG ATGATGAGCA GTGCACTTAA CAACGTAGTG
GTAAACCCGC CGTGGAACGT ACCACCAACT CTGGCACGCA AAGATATTCT GCCGAAAGTG
CGCAACGATC CGGGATATCT CGAAAGCCAT GGCTATACGG TGATGCGCGG CTGGAACAGC
AGAGAAGCGA TTGACCCATG GCAGGTTGAC TGGTCTACAA TCACGGCCTC GAATTTACCG
TTCCGCTTCC AGCAGGCTCC AGGCCCACGG AACTCGCTGG GGCGCTATAA ATTCAATATG
CCGAGTTCAG AGGCCATTTA TTTGCATGAC ACGCCGAACC ACAATCTGTT CAAACGTGAT
ACACGCGCAT TGAGCTCAGG CTGTGTGCGA GTGAATAAAG CTTCCGATCT GGCGAATATG
CTGTTGCAGG ATGCAGGATG GAATGACAAA CGTATTTCTG ATGCGCTGAA GCAGGGCGAT
ACGCGTTACG TCAATATTCG GCAGTCGATT CCGGTGAATC TCTACTACCT GACGGCCTTT
GTTGGTGCAG ATGGTCGTAC CCAGTATCGT ACAGATATTT ACAATTATGA TCTGCCTGCG
CGATCCAGCT CGCAAATCGT ATCGAAAGCG GAACAATTAA TCAGGTAA
 
Protein sequence
MLLNMMCGRR LSAISLCVAV TFAPLFNAQA DEPEVIPGDS PVAVSEQGEA LPQAQATAIM 
AGIQPLPEGA AEKARTQIES QLPAGYKPVY LNQLQLLYAA RDMQPMWENR DAVKAFQQQL
AEVAIAGFQP QFNKWVELLT DPGVNGMARD VVLSDAMMGY LHFIANIPVK GTRWLYSSKP
YALSTPPLSV INQWQLALDK GQLPTFVAGL APQHPQYAAM HESLLALLSD TKPWPQLTGK
ATLRPGQWSN DVPALREILQ RTGMLDGGPK ITLPGDDTPT DAVVSPSAVT VETAETKPMD
KQTTSRSKPA PAVRAAYDNE LVEAVKRFQA WQGLGADGAI GPATRDWLNV TPAQRAGVLA
LNIQRLRLLP TELSTGIMVN IPAYSLVYYQ NGNQVLDSRV IVGRPDRKTP MMSSALNNVV
VNPPWNVPPT LARKDILPKV RNDPGYLESH GYTVMRGWNS REAIDPWQVD WSTITASNLP
FRFQQAPGPR NSLGRYKFNM PSSEAIYLHD TPNHNLFKRD TRALSSGCVR VNKASDLANM
LLQDAGWNDK RISDALKQGD TRYVNIRQSI PVNLYYLTAF VGADGRTQYR TDIYNYDLPA
RSSSQIVSKA EQLIR