Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4707 |
Symbol | |
ID | 6145528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4807272 |
End bp | 4808294 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641619523 |
Product | putative sugar ABC transporter, permease protein |
Protein accession | YP_001746631 |
Protein GI | 170680997 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCAAT CTCTCCCGGA CACTACGCCG TCGAAAAGGC GCTTTCGCTG GCCAACGGGA ATGCCGCAGC TGGCGGCACT ATTGCTGGTG CTGCTGGTCG ACAGCCTGGT AGCCCCGCAT TTCTGGCAAG TGGTGCTTCA GGACGGGCGT TTGTTCGGTA GCCCCATCGA CATTCTTAAC CGTGCGGCAC CCGTTGCGCT GTTGGCGATT GGCATGACGC TGGTGATCGC CACCGGTGGT ATCGATCTCT CCGTAGGGGC AGTGATGGCT ATCGCCGGAG CGACTACCGC TGCGATGACG GTCGCGGGAT TTAGCCTGCC GATTGTTTTG TTAAGCGCCC TGGGCACCGG CATCCTGGCG GGATTGTGGA ACGGCATTCT GGTGGCAATC CTTAAAATTC AGCCGTTTGT CGCCACCCTG ATCCTGATGG TTGCCGGGCG CGGCGTGGCG CAGTTGATCA CCGCCGGGCA GATCGTCACG TTTAACTCGC CGGATCTCTC ATGGTTCGGC AGCGGATCGC TGTTGTTCCT GCCAACGCCC GTCATTATTG CGGTGCTGAC GCTTCTCCTC TTCTGGCTGT TGACCCGCAA AACGGCGCTG GGAATGTTTA TCGAAGCCGT TGGTATCAAC ATTCGGGCGG CAAAAAATGC CGGAGTTAAC ACGCGAATCA TCGTCATGCT CACCTACGTG TTGAGCGGGC TATGTGCGGC GATTGCGGGC ATTATCGTGG CGGCGGATAT TCGCGGTGCC GATGCCAATA ACGCCGGGTT ATGGCTGGAG CTGGACGCCA TTCTCGCGGT GGTGATTGGC GGCGGATCGC TGATGGGCGG GCGCTTTAAC CTGCTGCTTT CGGTGGTGGG GGCGCTGATT ATTCAGGGAA TGAACACCGG AATTTTGCTT TCTGGCTTTC CGCCGGAGAT GAACCAGGTC GTAAAAGCGG TGGTGGTGCT TTGTGTGCTG ATTGTTCAGT CGCAACGCTT TATCAGTCTG ATTAAAGGGG TACGTAACCG TGATAAAACG TAA
|
Protein sequence | MPQSLPDTTP SKRRFRWPTG MPQLAALLLV LLVDSLVAPH FWQVVLQDGR LFGSPIDILN RAAPVALLAI GMTLVIATGG IDLSVGAVMA IAGATTAAMT VAGFSLPIVL LSALGTGILA GLWNGILVAI LKIQPFVATL ILMVAGRGVA QLITAGQIVT FNSPDLSWFG SGSLLFLPTP VIIAVLTLLL FWLLTRKTAL GMFIEAVGIN IRAAKNAGVN TRIIVMLTYV LSGLCAAIAG IIVAADIRGA DANNAGLWLE LDAILAVVIG GGSLMGGRFN LLLSVVGALI IQGMNTGILL SGFPPEMNQV VKAVVVLCVL IVQSQRFISL IKGVRNRDKT
|
| |