Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1023 |
Symbol | |
ID | 6144192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1044280 |
End bp | 1046046 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 641615910 |
Product | hypothetical protein |
Protein accession | YP_001743102 |
Protein GI | 170679679 |
COG category | [R] General function prediction only |
COG ID | [COG5610] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00221592 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAA CTGTTTATAC ATATGATGTA TGGGATACTA TTTTAAAACG TCATTGTTTA CCATCATATA CGTTAGAGGT TTCGTTGTAC TGGCTACTTT TGTATCTTGG AAAACCTGAG AAACATAAGG ATGTGTATAA AAAAGTAAAA GCACTGGAGA GTGATTTTTT TAATCGAAAA GAAGAATATT TTATTGAAGA GCTTATTGTT GGGATATTAA TTGAAAATAA TTTAATGAAT GGATTTTGCA ATAACTCTGT GACTGCTGCT TGGGAACAAA TATATTATTT CGTAGAGAGA AATTGTACAT ATAAAAATAA AGAGGTTATA GAGTTAATTG GGAGTGACTA TGGGGACAAA TTATTTATTT CTGATTTTTA TACTGACTCC AGTTTCATAA AGAAATTGCT ACACCACCAC CAAGTATCGG TTTTAGATGG TGTGACGTCA GCTGAGTATG GCGTCAGTAA GCATATGGGT TCATTGTATG AAAAAATAGA ACTATTGAAG CAAAGGAAAT GGATCCATAC TGGAGATAAT GAATTAGCTG ATATAAAAAT GGCAAAGCGT GCAGGTGCCC AAGTTAGACT TATAAAATCC AAAAAGAAGA AAATAAATAA GTTAAAAATA AAAAATGACT ATGAGCGGTT AGCATTTATT ATTTTGTCCT TTAGCCTGTT TATATTAAAA ACTTCCAGAA AGAAAAAGTG CAATAAGATC TACTTTTTTA CAAGGGAGGG GGTTTTTTTC AAAAAAAACT TTGATCTTCT ACTAAACCAA ATACCTAACA TATTCCCATC TATAACTACG GAAATATTGC CAATCAGTCG AGTAGCCAGC TTAGCATTGA AGTTTAATGA GTCTGATGAA TTTTTAGGAT TTAATGATGC ATTAACCCAG TATGGCTATG GAGTGTCTAC GTTTCTTAGT TTCTTTCATC TTGATGATTG TTACTCCGAG CTCACAACAA AGTATCATGA TGTAAATGAT TTATTATCTC ACAGAAATGA TCCTTTGGTA AAACAACTAA TAAAAAGTAT AGAGGATAAA AAAAAACGAA CAGAAAATTT TCTTACATCC ATTGAGTTCG ATTCTGAGCC AAGTATCATC GTGGATATTG GTTGGCGGGG TAGCATCCAA GATAATTTAA TGTCGCGGAA AAATAATTAT ATTCAACATG GATGTTATCT CGGATTGTTT GATTTTTACC CAGGCCAGTC AGGTCAGAAT AAATCATCTG TAATTTTTGA TAACAACAGA TTTAGACAAA AATGGACCAT GAAAGGTGTA GCTTTTATGG AGACGGTTTT CAATGCTTTG GACGGAAGTG TTGTCGATTA TGTAAACAAT AAACCGAAAA GAAAAGAAAA TTACGCTGAG ATAGAAAATT CAAAACATTT AATAGTTATG CAAGATGCTA TCATTGAGGA ATTTATGAAT CTATCTATTA TGTTAAAAGA GGGAAAGTTA AATATCAATG ATATTGAAAG ATTAGCTATT GAAAGTTACA AAAAAATAGT AACTAACCCA TCTAAAGAAA TAGCTGATTT TTATGTTGGC AGTATCCAAA ATGAAAGCTT CGGTTTGAAT GATTTTATCG ACAAAGAATT TAACATTAGC GCAGTAGATG TTTTCCGAAG TTTTTTTTCT AAAAGTAAAA GAAATGAAAT AAGAATAAAA TTGCATAGAA ATGGATGGCG TGAGTCGATA CTTAAATCCT CTAAAGTATC GCTGAGCGCA AAATTATGTT CACTTTTAAT TAGATAA
|
Protein sequence | MNKTVYTYDV WDTILKRHCL PSYTLEVSLY WLLLYLGKPE KHKDVYKKVK ALESDFFNRK EEYFIEELIV GILIENNLMN GFCNNSVTAA WEQIYYFVER NCTYKNKEVI ELIGSDYGDK LFISDFYTDS SFIKKLLHHH QVSVLDGVTS AEYGVSKHMG SLYEKIELLK QRKWIHTGDN ELADIKMAKR AGAQVRLIKS KKKKINKLKI KNDYERLAFI ILSFSLFILK TSRKKKCNKI YFFTREGVFF KKNFDLLLNQ IPNIFPSITT EILPISRVAS LALKFNESDE FLGFNDALTQ YGYGVSTFLS FFHLDDCYSE LTTKYHDVND LLSHRNDPLV KQLIKSIEDK KKRTENFLTS IEFDSEPSII VDIGWRGSIQ DNLMSRKNNY IQHGCYLGLF DFYPGQSGQN KSSVIFDNNR FRQKWTMKGV AFMETVFNAL DGSVVDYVNN KPKRKENYAE IENSKHLIVM QDAIIEEFMN LSIMLKEGKL NINDIERLAI ESYKKIVTNP SKEIADFYVG SIQNESFGLN DFIDKEFNIS AVDVFRSFFS KSKRNEIRIK LHRNGWRESI LKSSKVSLSA KLCSLLIR
|
| |