Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2307 |
Symbol | |
ID | 6145923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2337411 |
End bp | 2338499 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617181 |
Product | hypothetical protein |
Protein accession | YP_001744354 |
Protein GI | 170683507 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.945779 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.00150581 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATAACC GGGAGAAGGA GATCCTTGCA ATATTAAGGC GTAACCCGCT GATTCAGCAG AACGAAATTG CGGATATGCT GCAAATCAGC CGTTCGCGTG TTGCAGCGCA TATTATGGAT TTAATGCGCA AAGGACGGAT TAAAGGCAAA GGTTACATTC TCACCGAGCA GGAATACTGC GTAGTGGTGG GGACAATCAA TATGGATATT CGCGGGATGG CGGATATCCG TTACCCGCAA GCGGCTTCTC ATCCCGGTAC CATTCATTGC TCGGCAGGCG GCGTTGGACG CAATATCGCC CACAATCTGG CGCTGTTAGG CCGCGACGTT CATTTGCTTT CAGTGATTGG CGATGACTTT TATGGCGAAA TGCTCCTGGA AGAAACGCGC CGCGCCGGCG TGAATGTCTC CGGCTGCGTT CGTTTACATG GTCAAAGCAC ATCGACGTAT CTGGCAATTG CCAATCGAGA CGATCAAACC GTGCTGGCGA TTAACGATAC CCATCTGCTG GATCAGTTGA CACCGCAACT ACTGAACGGG TCGCGCGATT TACTTCGTCA TGCGGGCGTG GTACTGGCAG ATTGTAACCT GACAGCCGAG GCGCTGGAAT GGGTCTTTAC CCTCGCTGAT GAAATCCCGG TGTTTGTCGA TACCGTTTCA GAATTCAAAG CGGGCAAAAT CAAACACTGG CTGGCGCATA TTCACACTCT GAAACCCACT TTGTCGGAGC TGGAAATTTT ATGGGGCCAG CCGATAACCC GCGATGCTGA TCGTAATGCC GCAGTGAATG CGTTGCATCA GCAAGGCGTT CAGCAACTGT TTGTTTATTT GCCCGATGAG TCTGTTTATT GCAGCCAAAA GGATGGCGAA CAATTTTTGC TGACTGCGCC AGCGCATACG ACGGTAGACA GTTTTGGTGC TGACGATGGT TTTATGGCGG GCCTGGTGTA TAGCTTTCTG GAAGGAAGCA GTTTCCGTGA CAGCGCCCGT TTTGCGATGG CCTGCGCGGC AATTTCACGC GCCAGCGGCA GCTTAAACAA CCCTACCCTG TCTGCCGATA ACGCGCTTTC ATTAGTGCCG ATGGTGTAA
|
Protein sequence | MNNREKEILA ILRRNPLIQQ NEIADMLQIS RSRVAAHIMD LMRKGRIKGK GYILTEQEYC VVVGTINMDI RGMADIRYPQ AASHPGTIHC SAGGVGRNIA HNLALLGRDV HLLSVIGDDF YGEMLLEETR RAGVNVSGCV RLHGQSTSTY LAIANRDDQT VLAINDTHLL DQLTPQLLNG SRDLLRHAGV VLADCNLTAE ALEWVFTLAD EIPVFVDTVS EFKAGKIKHW LAHIHTLKPT LSELEILWGQ PITRDADRNA AVNALHQQGV QQLFVYLPDE SVYCSQKDGE QFLLTAPAHT TVDSFGADDG FMAGLVYSFL EGSSFRDSAR FAMACAAISR ASGSLNNPTL SADNALSLVP MV
|
| |