Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2027 |
Symbol | |
ID | 5586358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2010621 |
End bp | 2012231 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640925698 |
Product | putative transporter |
Protein accession | YP_001463101 |
Protein GI | 157154828 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1292] Choline-glycine betaine transporter |
TIGRFAM ID | [TIGR00842] choline/carnitine/betaine transport |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.12575 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGCA ATGTTAAGAA AAAAGATGTG CCGCTGATAA GCATCAGCCT GGTGGCCATT CTTTTCATCG CAGCTGCATT AAGCCTTTTC CCACAACAAT CGGCCGACGC GGCCAACGCA ATATACACTT TTGTTACGCG TACGTTAGGT TCCGCCGTAC AGGTATTGGT TTTGCTGGCA ATGGGACTGG TGATTTATTT AGCCACCAGT AAATACGGCA ATATTCGTCT TGGCGAAGGA AAACCGGAAT ACAGCACGCT CTCCTGGCTG TTTATGTTTA TTTGTGCCGG TTTAGGTTCT TCTACGCTTT ATTGGGGGGT TGCTGAATGG GCCTATTATT ATCAAACGCC TGGATTAAAT ATCGCACCGC GTTCACAACA GGCACTCGAA TTTAGCGTCC CTTACTCCTT CTTCCACTGG GGCATCAGCG CCTGGGCAAC TTATACGCTG GCCTCATTAA TCATGGCTTA TCACTTTCAT GTGCGTAAAA ACAAAGGTCT GAGTCTTTCC GGCATTATTG CCGCGATTAC CGGCGTTCGC CCGCAAGGCC CATGGGGAAA ACTGGTTGAT TTGATGTTCC TGATCGCCAC TGTCGGCGCA CTGACTATTT CCCTTGTTGT CACCGCAGCA ACCTTTACTC GTGGACTTTC CGCGCTGACC GGTTTACCCG ATAACTTCAC CGTGCAGGCA TTTGTGATCC TGCTTTCCGG CGGCATTTTT TGCCTAAGCT CGTGGATTGG TATCAACAAC GGTTTGCAAC GTCTGAGCAA AATGGTTGGC TGGGGCGCGT TCCTGCTGCC ATTACTGGTG CTGATTGTCG GCCCAACCGA ATTTATTACC AGCAGCATCA TCAATGCCAT CGGTCTGACC ACGCAAAACT TCCTGCAAAT GAGCTTATTC ACCGATCCGC TTGGCGATGG TTCATTTACC CGCAACTGGA CCGTTTTCTA CTGGCTGTGG TGGATCTCAT ACACCCCTGG CGTAGCAATG TTTGTCACCC GCGTTTCCCG CGGTCGTAAG ATTAAAGAAG TTATCTGGGG ACTGATCCTC GGCAGCACCG TCGGTTGCTG GTTCTTCTTC GGCGTAATGG AAAGCTATGC CATTCATCAG TTTATCAATG GCGTAATCAA CGTCCCACAG GTGCTGGAAA CACTGGGCGG CGAAACAGCT GTACAGCAAG TTCTGATGTC GTTGCCAGCC GGTAAATTGT TCCTCGCCGC ATACCTGGGC GTGATGATTA TTTTCCTTGC CTCGCATATG GATGCAGTGG CCTACACCAT GGCGGCGACC AGTACGCGTA ATCTCCAGGA AGGTGACGAT CCTGACCGTG GGCTGCGTCT TTTCTGGTGC GTGGTGATCA CTCTGATCCC GCTTTCCATC TTGTTTACCG GTGCTTCGCT GGAAACGATG AAAACCACCG TCGTGCTCAC AGCCCTTCCC TTCCTCGTCA TTTTACTGGT GAAAGTCGGC GGATTTATTC GCTGGCTGAA ACAGGATTAC GCCGACATTC CGGCTCATCA AGTTGAACAT TATCTCCCGC AGACACCGGT TGAAGCCCTG GAAAAAACAC CAGTGCTCCC TGCGGGAACC GTATTCAAAG GCGACAACTG A
|
Protein sequence | MMSNVKKKDV PLISISLVAI LFIAAALSLF PQQSADAANA IYTFVTRTLG SAVQVLVLLA MGLVIYLATS KYGNIRLGEG KPEYSTLSWL FMFICAGLGS STLYWGVAEW AYYYQTPGLN IAPRSQQALE FSVPYSFFHW GISAWATYTL ASLIMAYHFH VRKNKGLSLS GIIAAITGVR PQGPWGKLVD LMFLIATVGA LTISLVVTAA TFTRGLSALT GLPDNFTVQA FVILLSGGIF CLSSWIGINN GLQRLSKMVG WGAFLLPLLV LIVGPTEFIT SSIINAIGLT TQNFLQMSLF TDPLGDGSFT RNWTVFYWLW WISYTPGVAM FVTRVSRGRK IKEVIWGLIL GSTVGCWFFF GVMESYAIHQ FINGVINVPQ VLETLGGETA VQQVLMSLPA GKLFLAAYLG VMIIFLASHM DAVAYTMAAT STRNLQEGDD PDRGLRLFWC VVITLIPLSI LFTGASLETM KTTVVLTALP FLVILLVKVG GFIRWLKQDY ADIPAHQVEH YLPQTPVEAL EKTPVLPAGT VFKGDN
|
| |