Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0842 |
Symbol | |
ID | 6143800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 845326 |
End bp | 846444 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615730 |
Product | citrate transporter family protein |
Protein accession | YP_001742922 |
Protein GI | 170683327 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.917388 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGC CTTTTTTACG CACGCTGCAA GGGGATCGTT TTTTTCAGTT ATTAATTCTT GTTGGTATCG GATTAAGCTT TTTCGTGCCC TTTGCACCGA AATCCTGGCC TGCTGCTATC GACTGGCACA CCATCATCAC CTTAAGCGGC CTGATGCTGC TGACCAAAGG TGTGGAGTTA AGCGGTTATT TTGATGTGCT GGGGCGCAAA ATGGTGCGCC GCTTTGCTAC GGAGCGTCGG CTGGCGATGT TTATGGTGCT GGCGGCGGCG CTGCTTTCTA CCTTTCTGAC CAACGATGTC GCGCTGTTTA TAGTTGTTCC GCTGACTATC ACGCTAAAAA GACTGTGTGA GATCCCGGTT AATCGGCTGA TTATTTTTGA GGCGCTGGCA GTCAATGCGG GTTCGCTACT GACGCCAATT GGCAACCCGC AAAATATTCT TATCTGGGGA CGTTCTGGTC TTTCGTTTGC CGGATTTATT GCCCAAATGG CACCGCTGGC TGGCGCAATG ATGCTGACGC TCCTGCTGTT GTGCTGGTGT TGTTTCCCTG GAAAGGCACT CCAATACCAT ACGGGGGTGC AAACACCGGA GTGGAAACCG CGGCTGGTGT GGAGTTGTCT GGGGCTGTAT ATCGTCTTTC TGACGGCGCT GGAGTTAAAA CAAGAGCTGT GGGGACTGGT GATTGTGGCG GCGGGCTTTG CGCTGCTGGC ACGTCGCGTG GTGTTGAGTG TGGACTGGAC GCTGCTGCTG GTATTTATGG CGATGTTTAT CGACGTCCAT TTGCTGACCC AACTGCCAGC TTTGCAGGGC GTGTTGAGCA ATGTGAGCCA TTTATCCGAA CCCGGATTAT GGTTAACGGC AATCGGTTTA TCGCAGGTGA TCAGTAATGT GCCGAGTACC ATATTGTTGC TGAACTATGT GCCGCCGTCG TTGCTGCTGG CATGGGCGGT AAACGTGGGG GGCTTTGGTT TATTACCCGG ATCGCTGGCA AATTTGATTG CGTTACGGAT GGCGAACGAT CGCCGTATCT GGTGGCGTTT TCATCTTTAT TCGATACCGA TGCTGTTGTG GGCGGCGCTG GTGGGATACG TTTTATTCGT TATGCTCCCG GCCAACTAG
|
Protein sequence | MSLPFLRTLQ GDRFFQLLIL VGIGLSFFVP FAPKSWPAAI DWHTIITLSG LMLLTKGVEL SGYFDVLGRK MVRRFATERR LAMFMVLAAA LLSTFLTNDV ALFIVVPLTI TLKRLCEIPV NRLIIFEALA VNAGSLLTPI GNPQNILIWG RSGLSFAGFI AQMAPLAGAM MLTLLLLCWC CFPGKALQYH TGVQTPEWKP RLVWSCLGLY IVFLTALELK QELWGLVIVA AGFALLARRV VLSVDWTLLL VFMAMFIDVH LLTQLPALQG VLSNVSHLSE PGLWLTAIGL SQVISNVPST ILLLNYVPPS LLLAWAVNVG GFGLLPGSLA NLIALRMAND RRIWWRFHLY SIPMLLWAAL VGYVLFVMLP AN
|
| |