Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0793 |
Symbol | |
ID | 6142643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 793898 |
End bp | 795331 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641615681 |
Product | anion transporter |
Protein accession | YP_001742873 |
Protein GI | 170683868 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | [TIGR00785] anion transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.575691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAGA AATCGTTATG GAAGCTAATT CTGATATTAG CAATCCCATG TATTATTGGC TTTATGCCAG CTCCAGCAGG ATTAAGCGAA CTGGCGTGGG TGCTTTTTGG TATTTACCTG GCGGCCATTG TGGGGCTGGT TATCAAGCCT TTCCCGGAAC CTGTCGTACT GTTAATTGCC GTCGCCGCAT CGATGGTAGT GGTGGGTAAC TTATCAGGTG GGGAATTTAA AACCACCGCT GTATTAAGCG GTTACTCTTC CGGTACCACC TGGCTGGTGT TTTCTGCGTT TACTTTAAGC GCCGCGTTTG TAACCACAGG CTTAGGTAAA CGTATTGCCT ATCTGCTGAT TGGTAAAATT GGTAGCACTA CCCTGGGTCT GGGTTACGTT ACGGTATTTC TCGATCTGGT ACTGGCTCCG GCAACACCGT CTAACACCGC GCGTGCGGGC GGCATCGTGT TACCGATCAT CAACAGCGTG GCAGTGGCTT TGGGATCAGA ACCGGAAAAA AGTCCGCGTC GTGTTGGACA TTACCTGATG ATGTCCATTT ACATGGTCAC CAAAACCACC AGCTATATGT TCTTTACCGC AATGGCGGGG AACATTCTGG CGCTGAAAAT GATCAACGAC ATTCTGCACC TGCAAATTAG CTGGGGTGGA TGGGCGCTAG CCGCCGGATT GCCTGGCATC ATTATGCTGC TGGTCACCCC GCTGGTGATT TACACCATGT ATCCGCCAGA AATTAAGAAG GTGGATAACA AAACCATCGC CAAAGCGGGC CTTGCCGAAC TGGGACCGAT GAAAATCCGC GAAAAAATGC TGCTCGGTGT CTTCGTGCTG GCGCTGCTGG GCTGGATTTT CAGTAAGTCA CTGGGGGTTG ATGAATCCAC CGTGGCAATC GTTGTTATGG CGACTATGCT GCTGCTGGGT ATCGTTACCT GGGAAGACGT GGTTAAAAAT AAAGGCGGCT GGAATACCTT AATCTGGTAC GGCGGTATTA TCGGCTTAAG CTCCTTATTA TCGAAAGTTA AATTCTTCGA ATGGTTAGCT GAAGTCTTTA AAAATAACCT GGCATTTGAT GGTCACGGTA ACGTTGCTTT CTTCGTTATT ATTTTCCTCA GCATCATCGT GCGTTATTTC TTCGCTTCCG GTAGTGCCTA TATCGTTGCC ATGTTACCGG TATTTGCCAT GCTGGCGAAC GTCTCCGGCG CGCCGTTAAT GTTAACCGCG CTGGCACTGT TGTTCTCTAA CTCCTATGGC GGCATGGTTA CTCACTATGG CGGCGCGGCA GGTCCGGTCA TCTTTGGCGT GGGTTACAAC GATATTAAAT CCTGGTGGTT GGTCGGTGCG GTACTGACGA TATTAACCTT CCTGGTGCAT ATCACCCTCG GCGTGTGGTG GTGGAATATG CTGATCGGCT GGAACATGCT GTAA
|
Protein sequence | MNKKSLWKLI LILAIPCIIG FMPAPAGLSE LAWVLFGIYL AAIVGLVIKP FPEPVVLLIA VAASMVVVGN LSGGEFKTTA VLSGYSSGTT WLVFSAFTLS AAFVTTGLGK RIAYLLIGKI GSTTLGLGYV TVFLDLVLAP ATPSNTARAG GIVLPIINSV AVALGSEPEK SPRRVGHYLM MSIYMVTKTT SYMFFTAMAG NILALKMIND ILHLQISWGG WALAAGLPGI IMLLVTPLVI YTMYPPEIKK VDNKTIAKAG LAELGPMKIR EKMLLGVFVL ALLGWIFSKS LGVDESTVAI VVMATMLLLG IVTWEDVVKN KGGWNTLIWY GGIIGLSSLL SKVKFFEWLA EVFKNNLAFD GHGNVAFFVI IFLSIIVRYF FASGSAYIVA MLPVFAMLAN VSGAPLMLTA LALLFSNSYG GMVTHYGGAA GPVIFGVGYN DIKSWWLVGA VLTILTFLVH ITLGVWWWNM LIGWNML
|
| |