Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4862 |
Symbol | |
ID | 6145795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4970675 |
End bp | 4972216 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641619666 |
Product | putative carnitine transporter CglC |
Protein accession | YP_001746773 |
Protein GI | 170682254 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1292] Choline-glycine betaine transporter |
TIGRFAM ID | [TIGR00842] choline/carnitine/betaine transport |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.905862 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATATT TAAAAAAACG CTTTTCTTTG ATTGAACTGA ATGTGTTTAT CCCAGCAATA TTATTTATTG CGGTAATCAT TCTGTGTTTG ACGATATATC CACAAGATAC CAGCCGGTAT ATAAATAAAA TACATCATTT TTTAACCTGG GAAATGGGCG GGATTTTTCT GGTCATGACT TTTCTGGTTG TACTTTGTTG TCTGTGGCTG GCCTTCTCAC GTTATGGCGA TATCTTGTTA GGTCAGTCGG GAGAAAAGCC TGACTTCAGC TTGTTAACCT GGCTTGGACT TATTTTTACT TCTGGAACGG GGGGCAGCTT GTTATATCTG GCCTCTGTAG AGTGGATTTG GATCATTCAG CAACCGCCTT TTGGCGCGAC AGCAGGAAGC GCTCAGGCTG CTCGTTGGGC CTCTGCTTAT GGCATGTTCC ATTGGGGGCC GTCCGCATGG GCATGGTATC TGATTTGTGC CGTTCCCATT GGTTGGTTTA TGCATGTTAA GAAAACGAAC TCATTAAAGG TCAGTGATTT ATGCCGTGGG TGTCTGGGGG CACGTGCTGA TGGTTTTTGC GGGCATTGTG TGAATTTTTT CTACATGTTT GGTTTGCTCG GCGGCGCGGT AACGTCTCTG GCGCTGGGAA CGCCGATGAT TTCTGCCGTA TTTTGCCATG TGTTCCATCT GGATCCTGCC GGGCAGTTTA TCAATGTCAT GGTTATTTTT ATCTGGACGC TAGTGCCATT ATTAATTCTC TTTTTTGGAC TTAAAAAAGG TGTGGCATGG GCCAGTAACT GGAATATTCG TGCCGATATT TTTATGCTAC TGGCAATACT GATTTGTGGA CCGACAGCTT TTATACTTAA CCAATCAATT GATGGTTTCG GCCTGATGCT GCAAAATTTT GTAGCGATGA GTTTAAGTAC CGATGCTATT GGTCGTAGCG GATTTCCACA GATGTGGACC GTATTTTATT TTTCATGGTG GGTCGTGTAT GCCATCCCAT TCGGTTTATT TATCGCCCGT ATTTCAAAAG GAAGGACGAT CCGGCAATTG ATTGTATGTG GAACTCTGGC AGGTTCATTG GGATGCATGG TTTTTTACAT GGTACTGGCT AATTTCGGCT TATCGTTGCA GACAACTCAT GTCATCGATT TTGTTCCCAT ACTTAACGAA CAAGGGCGAG GCGTTGTCGT TTCTCGTTTA CTGGAGCAGC TACCCGCAAG TCAGGTGTTT TTGGTTGCTT TTGGGGCTAT AGCATTAATT TCATATATTA CCGGACACTG TACTGTGGGT TATGCCCTCG GTTTTGCGAC GCAAAAACGG GCAGATAGTG AGAGTGAACC GGCATTCTGG AACGTGGCAT TTTGGTTGAT TATGACCGGA ATCGTCGCAA TCACACTCTA TCTTCTTGAT GCGCAAAGTC TGCAACCGCT ACAAACGGTC TCTATCCTGG CCGGACTACC GCTTTGCGGC GTAGTGTTTA TTTTATTGAA GAGTTTTTTG ACACAGCTTG CGGCTGAAGA GAAAACCGCG AGAGATGAAT AA
|
Protein sequence | MQYLKKRFSL IELNVFIPAI LFIAVIILCL TIYPQDTSRY INKIHHFLTW EMGGIFLVMT FLVVLCCLWL AFSRYGDILL GQSGEKPDFS LLTWLGLIFT SGTGGSLLYL ASVEWIWIIQ QPPFGATAGS AQAARWASAY GMFHWGPSAW AWYLICAVPI GWFMHVKKTN SLKVSDLCRG CLGARADGFC GHCVNFFYMF GLLGGAVTSL ALGTPMISAV FCHVFHLDPA GQFINVMVIF IWTLVPLLIL FFGLKKGVAW ASNWNIRADI FMLLAILICG PTAFILNQSI DGFGLMLQNF VAMSLSTDAI GRSGFPQMWT VFYFSWWVVY AIPFGLFIAR ISKGRTIRQL IVCGTLAGSL GCMVFYMVLA NFGLSLQTTH VIDFVPILNE QGRGVVVSRL LEQLPASQVF LVAFGAIALI SYITGHCTVG YALGFATQKR ADSESEPAFW NVAFWLIMTG IVAITLYLLD AQSLQPLQTV SILAGLPLCG VVFILLKSFL TQLAAEEKTA RDE
|
| |