Gene EcSMS35_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1023 
Symbol 
ID6144192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1044280 
End bp1046046 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content31% 
IMG OID641615910 
Producthypothetical protein 
Protein accessionYP_001743102 
Protein GI170679679 
COG category[R] General function prediction only 
COG ID[COG5610] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00221592 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA CTGTTTATAC ATATGATGTA TGGGATACTA TTTTAAAACG TCATTGTTTA 
CCATCATATA CGTTAGAGGT TTCGTTGTAC TGGCTACTTT TGTATCTTGG AAAACCTGAG
AAACATAAGG ATGTGTATAA AAAAGTAAAA GCACTGGAGA GTGATTTTTT TAATCGAAAA
GAAGAATATT TTATTGAAGA GCTTATTGTT GGGATATTAA TTGAAAATAA TTTAATGAAT
GGATTTTGCA ATAACTCTGT GACTGCTGCT TGGGAACAAA TATATTATTT CGTAGAGAGA
AATTGTACAT ATAAAAATAA AGAGGTTATA GAGTTAATTG GGAGTGACTA TGGGGACAAA
TTATTTATTT CTGATTTTTA TACTGACTCC AGTTTCATAA AGAAATTGCT ACACCACCAC
CAAGTATCGG TTTTAGATGG TGTGACGTCA GCTGAGTATG GCGTCAGTAA GCATATGGGT
TCATTGTATG AAAAAATAGA ACTATTGAAG CAAAGGAAAT GGATCCATAC TGGAGATAAT
GAATTAGCTG ATATAAAAAT GGCAAAGCGT GCAGGTGCCC AAGTTAGACT TATAAAATCC
AAAAAGAAGA AAATAAATAA GTTAAAAATA AAAAATGACT ATGAGCGGTT AGCATTTATT
ATTTTGTCCT TTAGCCTGTT TATATTAAAA ACTTCCAGAA AGAAAAAGTG CAATAAGATC
TACTTTTTTA CAAGGGAGGG GGTTTTTTTC AAAAAAAACT TTGATCTTCT ACTAAACCAA
ATACCTAACA TATTCCCATC TATAACTACG GAAATATTGC CAATCAGTCG AGTAGCCAGC
TTAGCATTGA AGTTTAATGA GTCTGATGAA TTTTTAGGAT TTAATGATGC ATTAACCCAG
TATGGCTATG GAGTGTCTAC GTTTCTTAGT TTCTTTCATC TTGATGATTG TTACTCCGAG
CTCACAACAA AGTATCATGA TGTAAATGAT TTATTATCTC ACAGAAATGA TCCTTTGGTA
AAACAACTAA TAAAAAGTAT AGAGGATAAA AAAAAACGAA CAGAAAATTT TCTTACATCC
ATTGAGTTCG ATTCTGAGCC AAGTATCATC GTGGATATTG GTTGGCGGGG TAGCATCCAA
GATAATTTAA TGTCGCGGAA AAATAATTAT ATTCAACATG GATGTTATCT CGGATTGTTT
GATTTTTACC CAGGCCAGTC AGGTCAGAAT AAATCATCTG TAATTTTTGA TAACAACAGA
TTTAGACAAA AATGGACCAT GAAAGGTGTA GCTTTTATGG AGACGGTTTT CAATGCTTTG
GACGGAAGTG TTGTCGATTA TGTAAACAAT AAACCGAAAA GAAAAGAAAA TTACGCTGAG
ATAGAAAATT CAAAACATTT AATAGTTATG CAAGATGCTA TCATTGAGGA ATTTATGAAT
CTATCTATTA TGTTAAAAGA GGGAAAGTTA AATATCAATG ATATTGAAAG ATTAGCTATT
GAAAGTTACA AAAAAATAGT AACTAACCCA TCTAAAGAAA TAGCTGATTT TTATGTTGGC
AGTATCCAAA ATGAAAGCTT CGGTTTGAAT GATTTTATCG ACAAAGAATT TAACATTAGC
GCAGTAGATG TTTTCCGAAG TTTTTTTTCT AAAAGTAAAA GAAATGAAAT AAGAATAAAA
TTGCATAGAA ATGGATGGCG TGAGTCGATA CTTAAATCCT CTAAAGTATC GCTGAGCGCA
AAATTATGTT CACTTTTAAT TAGATAA
 
Protein sequence
MNKTVYTYDV WDTILKRHCL PSYTLEVSLY WLLLYLGKPE KHKDVYKKVK ALESDFFNRK 
EEYFIEELIV GILIENNLMN GFCNNSVTAA WEQIYYFVER NCTYKNKEVI ELIGSDYGDK
LFISDFYTDS SFIKKLLHHH QVSVLDGVTS AEYGVSKHMG SLYEKIELLK QRKWIHTGDN
ELADIKMAKR AGAQVRLIKS KKKKINKLKI KNDYERLAFI ILSFSLFILK TSRKKKCNKI
YFFTREGVFF KKNFDLLLNQ IPNIFPSITT EILPISRVAS LALKFNESDE FLGFNDALTQ
YGYGVSTFLS FFHLDDCYSE LTTKYHDVND LLSHRNDPLV KQLIKSIEDK KKRTENFLTS
IEFDSEPSII VDIGWRGSIQ DNLMSRKNNY IQHGCYLGLF DFYPGQSGQN KSSVIFDNNR
FRQKWTMKGV AFMETVFNAL DGSVVDYVNN KPKRKENYAE IENSKHLIVM QDAIIEEFMN
LSIMLKEGKL NINDIERLAI ESYKKIVTNP SKEIADFYVG SIQNESFGLN DFIDKEFNIS
AVDVFRSFFS KSKRNEIRIK LHRNGWRESI LKSSKVSLSA KLCSLLIR