Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1889 |
Symbol | |
ID | 5592421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1901225 |
End bp | 1902835 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640921031 |
Product | putative transporter |
Protein accession | YP_001458583 |
Protein GI | 157161265 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1292] Choline-glycine betaine transporter |
TIGRFAM ID | [TIGR00842] choline/carnitine/betaine transport |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGCA ATGTTAAGAA AAAAGATGTG CCGCTGATAA GCATCAGCCT GGTGGCCATT CTTTTCATCG CAGCTGCATT AAGCCTTTTC CCACAACAAT CGGCCGACGC GGCCAACGCA ATATACACTT TTGTTACTCG TACGTTAGGT TCCGCCGTAC AGGTATTGGT TTTGCTGGCA ATGGGACTGG TGATTTATTT AGCCACCAGT AAATACGGCA ATATTCGTCT TGGCGAAGGA AAACCGGAAT ACAGCACGCT CTCCTGGCTG TTTATGTTTA TTTGTGCCGG TTTAGGTTCT TCTACGCTTT ATTGGGGGGT TGCTGAATGG GCCTATTATT ATCAAACACC TGGATTAAAT ATCGCACCGC GTTCACAACA GGCACTCGAA TTTAGCGTTC CCTACTCTTT CTTCCACTGG GGCATCAGCG CCTGGGCAAC TTATACACTG GCCTCATTAA TCATGGCTTA TCACTTTCAT GTGCGGAAAA ACAAAGGTCT GAGCCTTTCC GGCATTATTG CTGCTATTAC CGGCGTTCGC CCGCAAGGCC CATGGGGAAA ACTGGTCGAT TTGATGTTCC TGATCGCCAC TGTCGGCGCA CTGACCATTT CCCTTGTTGT TACCGCAGCA ACCTTTACCC GTGGGCTTTC CGCGCTGACC GGTTTACCCG ATAACTTCAC CGTGCAGGCA TTTGTGATCC TGCTTTCCGG CGGCATTTTT TGCCTAAGCT CGTGGATTGG TATCAACAAC GGTTTGCAAC GTCTGAGCAA AATGGTTGGC TGGGGCGCGT TCCTGCTGCC ATTACTGGTG CTGATTGTCG GCCCAACCGA ATTTATTACC AACAGCATCA TCAATGCCAT CGGCCTGACC ACGCAAAACT TCCTGCAAAT GAGCTTATTC ACCGATCCGC TTGGCGATGG TTCATTTACC CGCAACTGGA CCGTTTTCTA CTGGCTGTGG TGGATCTCAT ACACCCCTGG CGTAGCAATG TTTGTCACCC GCGTTTCCCG CGGTCGTAAG ATTAAAGAAG TTATCTGGGG ACTGATCCTC GGCAGCACCG TCGGTTGCTG GTTCTTCTTT GGCGTAATGG AAAGCTATGC CATTCATCAG TTTATCAATG GCGTAATCAA CGTCCCACAG GTGCTGGAAA CACTGGGCGG CGAAACAGCT GTACAGCAAG TTCTGATGTC GTTGCCAGCC GGTAAATTGT TCCTCGCCGC ATACCTGGGC GTGATGATTA TTTTCCTTGC CTCGCATATG GATGCAGTGG CCTACACCAT GGCGGCGACC AGTACGCGTA ATCTCCAGGA AGGTGACGAT CCTGACCGTG GGCTGCGTCT TTTCTGGTGC GTGGTGATCA CTCTGATCCC GCTTTCCATC TTGTTTACCG GTGCTTCGCT GGAAACGATG AAAACCACCG TCGTGCTCAC AGCCCTTCCC TTCCTCGTCA TTTTACTGGT GAAAGCCGGC GGGTTTATTC GCTGGCTGAA ACAGGATTAC GCCGACATTC CGGCTCATCA AGTTGAACAT TATCTCCCGC AGACACCGGT TGAAGCCCTG GAAAAAACGC CAGTGCTCCC TGCGGGAACC GTATTCAAAG GCGACAACTG A
|
Protein sequence | MMSNVKKKDV PLISISLVAI LFIAAALSLF PQQSADAANA IYTFVTRTLG SAVQVLVLLA MGLVIYLATS KYGNIRLGEG KPEYSTLSWL FMFICAGLGS STLYWGVAEW AYYYQTPGLN IAPRSQQALE FSVPYSFFHW GISAWATYTL ASLIMAYHFH VRKNKGLSLS GIIAAITGVR PQGPWGKLVD LMFLIATVGA LTISLVVTAA TFTRGLSALT GLPDNFTVQA FVILLSGGIF CLSSWIGINN GLQRLSKMVG WGAFLLPLLV LIVGPTEFIT NSIINAIGLT TQNFLQMSLF TDPLGDGSFT RNWTVFYWLW WISYTPGVAM FVTRVSRGRK IKEVIWGLIL GSTVGCWFFF GVMESYAIHQ FINGVINVPQ VLETLGGETA VQQVLMSLPA GKLFLAAYLG VMIIFLASHM DAVAYTMAAT STRNLQEGDD PDRGLRLFWC VVITLIPLSI LFTGASLETM KTTVVLTALP FLVILLVKAG GFIRWLKQDY ADIPAHQVEH YLPQTPVEAL EKTPVLPAGT VFKGDN
|
| |