Gene EcSMS35_2276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2276 
Symbol 
ID6147043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2299911 
End bp2301476 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content43% 
IMG OID641617150 
Producthypothetical protein 
Protein accessionYP_001744323 
Protein GI170682431 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00656399 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.253648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAT TATCCCCCCC TGAACTTGAC ATAACTAAAC TGCCTGACAG GTGCCAGGCA 
TTACTTGATG AAATGCATGA AGAAACGGGA ATAAGCCGTG AAATACTGCT GTCTGTTATG
CTGACGGTCA AGGCTGTCTC GGTCCAGGAT ACGCATGAAG TTGAACTTTC TGGAGGGCAG
CGTACCAGTC TTCAGATATA TATGTGTCTT TCATCAGCAT CAGGCAGTGG AAAAACCTCT
GCCTGCGCAA AATTAATCGC CCCTGTTCAT GAAACAGAAG AAGAACTACA TCAGGCATAT
ATCGACGATA AAAAAAACTA TGATCGCATG ATGGAAATGT GGACAACAGA TAAAAAAATC
CTGGAGCGGA GATATAAAAA GGAAATGGAA AGATCCCCGG AAAATGCTGC TGCTGCACGA
GCAGCGCTTG AAAAGTGTAT AGCAAATAAA CCCGTACCTC CCGTACAACA GGTTCTCATC
GTGAATGACG CGACACCAGA AGGGATAGCT TTGAAACTCA GCCAGTCACC TTCTCTTCTG
TTGCTGTCTG ATGAAGGAGG AACTATTCTG GATAAGCGTT TCGAACGCAA ATCGGCGCTG
TACAACACAT TATGGAGTGG GCAACCAGTC ACCGTAGAAC GGGCATCCAG ACCCGGGTTC
CGGGTAAAGG ATTCCCGTCT CACTATGCTT ATACTGACTC AACCGGTAAT ATTTGATAAG
TTTTTTACTC TCACTGGCGA CCAGATCCGG GGCAATGGCT TTCTGGCCCG GGTGCTATTT
TGCGAACCCG GAGCAAATAA AATAATGACG ACAGAACCGC ATACTGCCAC TCCTGTTGTA
ACGCAGCAAT GTGCAACCTG TTTTCATGAG AAAAGTTTTG GCAGTCAGAT CAGAGACTCT
CTGAGAGCGT CCCGGGAAAG GCGTGCAAAA GGTGAGCAAC GCATCCGCAT GACATTATCA
CACGCAGCAT CCCACGATCT GGAAATCTTT CACGAAGAAA ATATGAGTGC TGTCAGGCAG
AACCCACGTA TGACCACTTT CGAAGACATT ATCGTCAGGA AAAGAGAACA GGCTGTTCGA
ATAGCCGCAC TTCTGGAGCT GGAGAACAAT CCACATGGTA CAGTAATAAC ACGAGAAAGT
ATCAATAGTG CTATTTATCT CGTTGATTTT TATTTTCAGC ATCTTATATC CAAACTGGAA
TCACTTCGGG AAATATCTCC TGCAGAAAAA CTGGATAAAT GGCTCAGAGA AAATATCATC
CGGGTAAAAG GATACGAATA TCAAAAAAGC CATATACTAC AATATGGTCC ATATGCTCTC
AGGAATAAAT GTGTTCTTGA TGAAGCACTT GATATACTTG CAGAACAGAA AAAAATCGTA
ATTGATTATT CAATCGGCCA AAAAATCATT TATATCGGTG ACGCGATTAC ACCTTGTGAA
TTAGCAAATG AGGCAAACAT CCCGATAATG GAGCGTGGAA TGTTTATAGT CTGTTGGGAC
CATAAGCTGA ATAAGTACAG AAATGAAGAA CAGACTAAAA TATGGAACAT AACTACAAAA
AATTAA
 
Protein sequence
MTKLSPPELD ITKLPDRCQA LLDEMHEETG ISREILLSVM LTVKAVSVQD THEVELSGGQ 
RTSLQIYMCL SSASGSGKTS ACAKLIAPVH ETEEELHQAY IDDKKNYDRM MEMWTTDKKI
LERRYKKEME RSPENAAAAR AALEKCIANK PVPPVQQVLI VNDATPEGIA LKLSQSPSLL
LLSDEGGTIL DKRFERKSAL YNTLWSGQPV TVERASRPGF RVKDSRLTML ILTQPVIFDK
FFTLTGDQIR GNGFLARVLF CEPGANKIMT TEPHTATPVV TQQCATCFHE KSFGSQIRDS
LRASRERRAK GEQRIRMTLS HAASHDLEIF HEENMSAVRQ NPRMTTFEDI IVRKREQAVR
IAALLELENN PHGTVITRES INSAIYLVDF YFQHLISKLE SLREISPAEK LDKWLRENII
RVKGYEYQKS HILQYGPYAL RNKCVLDEAL DILAEQKKIV IDYSIGQKII YIGDAITPCE
LANEANIPIM ERGMFIVCWD HKLNKYRNEE QTKIWNITTK N