Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4320 |
Symbol | |
ID | 6144586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4417598 |
End bp | 4418803 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641619141 |
Product | hypothetical protein |
Protein accession | YP_001746265 |
Protein GI | 170681446 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.40643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.00116448 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATTTTA ATGCAGAGTT ATGCCCAGCA TTTTTGTACA CTTCGATGTA TCAAATGCGC TGCAAACGAT CAAATATGGA TGTTTTATCA AGCATCCCCC AAAAGATATT TACATCATCC CATGAGGTTA AGATGGATAA CAAAATCGTA GAAATTGAGA CAAATAAGCT TGATTTTGAC CCTAAAAACC CACGTTTCTT TCGTCTCAAT GATGCCAGTA ACGCTGCAAC AGTCATTGAG GAAATGTTAG ATGACGAAAG TGTCCACGAT CTAATGCTAT CAATCGGTCA GCAAGGTTAC TTTCCTGGAG AACCTTTATT GGCAGTAAAA AGCAATGGAA ACTACATCGT GGTTGAGGGA AACAGACGCT TAGCTGCTGT AAAGTTGCTC AATGGAGATC TGCTTCCTCC AAAAAGAAAA CTTAAAGGTG TGCAAGAAAT CATTGATGAT ACTACCAATA AACCTAAGAA GCTTCCCTGC ATCATTTATG AAAACCGAGA GGATGTACTG AGATATATCG GTTATCGTCA TATAACTGGG GTCAAAGAAT GGGACTCATT ATCTAAAGCC AAATACCTTA AAGAGTTATG TGATACTTTT TATTCACATG AGCCTAAAGA GATAGTATTA AAAAATCTGG CTCGTGAGAT TGGGAGTAAA CCACATTATG TTGCAACACT TCTCACTGCA CTGAACTTAT ATGAAGTCGC GCATGACCAT GAGTTTTTTA ATTTACCCAT GAAGGCTTCT GACGTGGAAT TTTCATATAT AACCACAGCT TTGGGATATT CAAAAATCAC AAACTGGTTA GGTCTACAGG ATAAAAAGGA TTTTTTAGAC CCAAATTTAA ATGAAGAAAA CCTTAAGCGT TTATTCTCTT GGTTTTTTGT GCCTGACCAA CAAGGTAGAA CCATCATCGG TGAGTCTCGA AGAATAAAAG ATATTGCAGC AGTGGTTGAG AAACCCGAAG CAATTGAAAT TCTCATGAAA AGTTCAAACT TGGATGAAGC ATATCTATAT ACCAGCGGAG AAAGAGAAGC ATTAGATAAA GCACTAAACG CAGCTAGTGT TAAATTAAGA GTAGTTTGGG ATATGCTACT TAAAGCTAAA GAATTAACAT TAGAGCATGA AGAGGCTGCA TCTGAAATTT TTGAGATGTC AAAAAATATT AGAAATCAGA TCAGAAGCAA AAGGGAGGAT GATTGA
|
Protein sequence | MHFNAELCPA FLYTSMYQMR CKRSNMDVLS SIPQKIFTSS HEVKMDNKIV EIETNKLDFD PKNPRFFRLN DASNAATVIE EMLDDESVHD LMLSIGQQGY FPGEPLLAVK SNGNYIVVEG NRRLAAVKLL NGDLLPPKRK LKGVQEIIDD TTNKPKKLPC IIYENREDVL RYIGYRHITG VKEWDSLSKA KYLKELCDTF YSHEPKEIVL KNLAREIGSK PHYVATLLTA LNLYEVAHDH EFFNLPMKAS DVEFSYITTA LGYSKITNWL GLQDKKDFLD PNLNEENLKR LFSWFFVPDQ QGRTIIGESR RIKDIAAVVE KPEAIEILMK SSNLDEAYLY TSGEREALDK ALNAASVKLR VVWDMLLKAK ELTLEHEEAA SEIFEMSKNI RNQIRSKRED D
|
| |