Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2800 |
Symbol | proW |
ID | 6143762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2881497 |
End bp | 2882561 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641617669 |
Product | glycine betaine transporter membrane protein |
Protein accession | YP_001744829 |
Protein GI | 170680258 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4176] ABC-type proline/glycine betaine transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.591808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.0101611 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGATC AAAATAATCC GTGGGATACC ACGCCAGCGG CGGACAGTGC TGCACAATCC GCAGACGCCT GGGGTACACC GGCGACTGCA CCGACTGACG GCGGTGGCGC TGACTGGCTG ACCAGTACGC CTGCGCCAAA CGTCGAGCAT TTTAATATTC TCGATCCGTT CCATAAAACG CTAATCCCGC TCGACAGTTG GGTCACTGAA GGGATCGACT GGGTCGTTAC CCATTTTCGT CCCGTCTTTC AGGGCGTGCG CCTTCCGGTT GATTACATCC TCAACGGTTT CCAGCAATTG CTGCTGGGTA TGCCCGCGCC GGTGGCGATT ATCGTTTTCG CTCTCATCGC CTGGCAGATT TCCGGGGTCG GAATGGGCGT GGCGACGCTG GTTTCGCTGA TTGCCATCGG CGCAATCGGT GCCTGGTCGC AGGCCATGGT TACCCTGGCG CTGGTGTTAA CCGCCCTGCT GTTCTGTATC GTCATAGGTT TGCCGTTGGG GATCTGGCTG GCGAGAAGTC CGCGAGCGGC GAAAATTATT CGTCCACTGC TTGATGCCAT GCAGACCACG CCCGCGTTTG TTTATCTGGT GCCAATCGTC ATGCTGTTTG GTATCGGTAA CGTGCCGGGC GTGGTGGTGA CAATCATCTT TGCGCTGCCG CCGATTATCC GTCTGACGAT TCTGGGAATT AACCAGGTTC CGGCGGATCT GATTGAAGCC TCGCGCTCAT TCGGTGCCAG CCCGCGCCAG ATGCTGTTCA AAGTTCAGTT ACCACTGGCG ATGCCAACCA TTATGGCGGG CGTTAACCAG ACGCTGATGC TGGCCCTTTC TATGGTGGTC ATCGCCTCGA TGATTGCCGT CGGCGGGCTG GGTCAGATGG TACTTCGCGG TATCGGTCGT CTGGATATGG GGCTTGCCAC CGTTGGCGGC GTCGGGATTG TGATCCTCGC CATTATCCTC GACCGCCTGA CGCAGGCCGT TGGGCGCGAC TCACGCAGTC GCGGCAACCG TCGCTGGTAC ACCACTGGCC CTGTCGGTCT GCTGACCCGC CCATTCATTA AGTAA
|
Protein sequence | MADQNNPWDT TPAADSAAQS ADAWGTPATA PTDGGGADWL TSTPAPNVEH FNILDPFHKT LIPLDSWVTE GIDWVVTHFR PVFQGVRLPV DYILNGFQQL LLGMPAPVAI IVFALIAWQI SGVGMGVATL VSLIAIGAIG AWSQAMVTLA LVLTALLFCI VIGLPLGIWL ARSPRAAKII RPLLDAMQTT PAFVYLVPIV MLFGIGNVPG VVVTIIFALP PIIRLTILGI NQVPADLIEA SRSFGASPRQ MLFKVQLPLA MPTIMAGVNQ TLMLALSMVV IASMIAVGGL GQMVLRGIGR LDMGLATVGG VGIVILAIIL DRLTQAVGRD SRSRGNRRWY TTGPVGLLTR PFIK
|
| |