Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1387 |
Symbol | |
ID | 6143964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1375452 |
End bp | 1377062 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616265 |
Product | putative transporter |
Protein accession | YP_001743445 |
Protein GI | 170681413 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1292] Choline-glycine betaine transporter |
TIGRFAM ID | [TIGR00842] choline/carnitine/betaine transport |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.216138 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAGCA ATGTTAAGAA AAAAGATGTG CCGCTGATAA GCATCAGTCT GGTGGCCATT CTTTTCATCG CAGCTGCATT AAGCCTTTTC CCACAACAAT CGGCCGACGC GGCCAACGCT ATATACACTT TTGTTACGCG TACGTTAGGT TCCGCCGTAC AGGTATTGGT TTTGCTGGCA ATGGGACTGG TGATTTATTT AGCCACCAGT AAATACGGCA ATATTCGTCT TGGCGAAGGA AAACCGGAAT ACAGCACGCT CTCCTGGCTG TTTATGTTTA TTTGTGCCGG TTTAGGTTCT TCTACGCTTT ATTGGGGGGT TGCTGAATGG GCCTATTATT ATCAAACACC TGGATTAAAT ATCGCACCGC GTTCACAACA GGCACTCGAA TTTAGCGTCC CTTACTCCTT CTTCCACTGG GGCATCAGCG CCTGGGCAAC TTATACGCTG GCCTCATTAA TCATGGCTTA TCACTTTCAT GTGCGGAAAA ACAAAGGTCT GAGTCTTTCC GGCATTATTG CCGCGATTAC CGGCGTTCGC CCGCAAGGCC CGTGGGGAAA ACTGGTCGAT TTGATGTTCC TGATCGCCAC TGTCGGCGCA CTGACCATTT CCCTTGTTGT CACCGCAGCA ACCTTTACCC GTGGGCTTTC CGCGCTGACC GGTTTACCCG ATAATTTCAC CGTGCAGGCA TTTGTGATCC TGCTTTCCGG CGGCATTTTT TGCCTAAGCT CATGGATTGG TATCAACAAC GGTTTGCAAC GTCTGAGCAA AATGGTTGGC TGGGGCGCGT TCCTGCTGCC ATTACTGGTG CTCATTGTCG GCCCAACCGA ATTTATTACC AACAGCATCA TCAATGCCAT CGGCCTGACC ACGCAAAACT TCCTGCAAAT GAGCTTATTC ACCGATCCGC TTGGCGATGG TTCATTTACC CGCAACTGGA CCGTTTTCTA CTGGCTGTGG TGGATCTCAT ACACCCCTGG CGTAGCAATG TTTGTCACCC GCGTTTCCCG CGGTCGTAAG ATTAAAGAAG TTATCTGGGG ACTGATCCTC GGCAGCACCG TCGGTTGCTG GTTCTTCTTC GGCGTAATGG AAAGCTATGC CATTCACCAG TTTATCAATG GCGTAATCAA CGTCCCACAG GTGCTGGAAA CACTGGGCGG CGAGACAGCT GTGCAGCAAG TTCTGATGTC GTTGCCAGCC GGTAAATTGT TCCTCGCCGC ATACCTGGGC GTGATGATTA TTTTCCTTGC CTCGCATATG GATGCAGTGG CCTACACCAT GGCGGCGACC AGTACGCGTA ATCTCCAGGA AGGTGACGAT CCTGACCGTG GGCTGCGTCT TTTCTGGTGC GTGGTAATTA CCCTGATCCC GCTTTCAATT CTTTTCACTG GCGCGTCGCT GGAAACGATG AAAACCACCG TCGTACTCAC AGCCCTTCCG TTCCTCGTCA TTTTACTGGT GAAAGTCGGC GGATTTATTC GCTGGCTGAA ACAGGATTAC GCCGACATTC CGGCTCATCA AGTTGAATAT TATCTCCCGC AGACACCGGT TGAAGCCCTG GAAAAAACAC CAGTGCTCCC TGCGGGAACC GTATTCAAAG GCGACAACTG A
|
Protein sequence | MMSNVKKKDV PLISISLVAI LFIAAALSLF PQQSADAANA IYTFVTRTLG SAVQVLVLLA MGLVIYLATS KYGNIRLGEG KPEYSTLSWL FMFICAGLGS STLYWGVAEW AYYYQTPGLN IAPRSQQALE FSVPYSFFHW GISAWATYTL ASLIMAYHFH VRKNKGLSLS GIIAAITGVR PQGPWGKLVD LMFLIATVGA LTISLVVTAA TFTRGLSALT GLPDNFTVQA FVILLSGGIF CLSSWIGINN GLQRLSKMVG WGAFLLPLLV LIVGPTEFIT NSIINAIGLT TQNFLQMSLF TDPLGDGSFT RNWTVFYWLW WISYTPGVAM FVTRVSRGRK IKEVIWGLIL GSTVGCWFFF GVMESYAIHQ FINGVINVPQ VLETLGGETA VQQVLMSLPA GKLFLAAYLG VMIIFLASHM DAVAYTMAAT STRNLQEGDD PDRGLRLFWC VVITLIPLSI LFTGASLETM KTTVVLTALP FLVILLVKVG GFIRWLKQDY ADIPAHQVEY YLPQTPVEAL EKTPVLPAGT VFKGDN
|
| |