Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1117 |
Symbol | |
ID | 6142907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1133553 |
End bp | 1134887 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641615997 |
Product | anaerobic C4-dicarboxylate transporter |
Protein accession | YP_001743189 |
Protein GI | 170682600 |
COG category | [R] General function prediction only |
COG ID | [COG2704] Anaerobic C4-dicarboxylate transporter |
TIGRFAM ID | [TIGR00770] anaerobic c4-dicarboxylate membrane transporter family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATTA TCACTGTTCT TCAGATAGTG GTTCTGCTGG GGGCAATCTT TTTGGGTGTC CGTATGGGCG GGATTGGTAT TGGTTATGCC GGTGGCATTG GGGTATTGAT TCTGGGGCTT TGTCTGGATA TGAAGCCTGG TAATATCCCC TGGGATGTGA TTCTAATTAT TGCATCGGTA ATTTCTGCTA TTTCGGCCAT GCAACTGGCG GGAGGACTGG ACTATCTCGT TCAGGTCGCT GAACGAATTC TTCGTAAAAA TCCAAAATAT ATTAATTACC TGGCTCCGGT TGTCACTTAT GTGCTGACAA TCTTTGCCGG TACGGGACAT ACTGCATTTT CAATGATCCC TGTAATTGTC GAAGTTTCCA AAGAACAAAA CATTAAGCCC TCATGTCCGC TCTCAATTGC GGTTGTATCA TCTCAAATTG CAATTACAGC ATCACCTGTA TCGGCTGCCG TTATCTATAT GTCTGGCGTA CTGGAAGGCT TTGGGTGGAG TTATCCAGTA TTGCTTGGGA TATGGTTGTT TACCACATTC GTTGGTTGCA TGCTTACCGC ATTTATAATC AGTCTGATTT CTGACATGAA ATTAGACAAT GATCCGGTTT ACCGGGAGCG TCTTTCCAAA GGACTCGTAA GCGCGCCAGT AAAGAGTGTT AACAAACAGC TCAAGCCTTA TGCCAGACGA TCTGTCGCAA TTTTCCTTAT TGGCGTCATC CTGGTGGTCC TTTACGCTTC GGCCATTAGC CCGACGCTGG GTCTGATTGA TAACGTTGTT GTTAGTCGTG ATGCGGCTAT TATGAGTCTC ATGCTACTGG TTGGCGGATT CATTACCCTT TTCTGTAAAG CAGATATAAA CAAGATTGCG GACTCCTCTG TGTTCAAGTC AGGCATGGTT GCCTGTATCT GTGTACTGGG TGTGGCATGG TTGGGGGACA CTTTTGTATC TGGTCATTCA GGAGAAATTA AGGAGCTTGC CAGAACTACT GTATCCCAGT ATCCAGCTCT TCTGGCTGTG GTATTTTTCC TGGCTGCGAT GCTTCTGTAT TCACAGGCTG CAACTGCCAA AGCTATCACT CCGGCTATTG TGACTGCATT GGGTATTACT GCAGCGAATC CGGATGACAG TTACATGCTG GTAGCTTCTT TTGCTGCTGT GTCTGCTTTA TTTGTGTTAC CAACTTACCC AACCCTTCTG GGGGCGGTGC AAATGGATGA CACAGGAACG ACCCGTATTG GTAAGTATGT GTTCAACCAT GCTTTCTTTA TTCCGGGTGT ACTGGCTATT GCTTTCTCAG TGCTTCTGGG ATTTCTTGTG GTGAGTATGT TCTGA
|
Protein sequence | MDIITVLQIV VLLGAIFLGV RMGGIGIGYA GGIGVLILGL CLDMKPGNIP WDVILIIASV ISAISAMQLA GGLDYLVQVA ERILRKNPKY INYLAPVVTY VLTIFAGTGH TAFSMIPVIV EVSKEQNIKP SCPLSIAVVS SQIAITASPV SAAVIYMSGV LEGFGWSYPV LLGIWLFTTF VGCMLTAFII SLISDMKLDN DPVYRERLSK GLVSAPVKSV NKQLKPYARR SVAIFLIGVI LVVLYASAIS PTLGLIDNVV VSRDAAIMSL MLLVGGFITL FCKADINKIA DSSVFKSGMV ACICVLGVAW LGDTFVSGHS GEIKELARTT VSQYPALLAV VFFLAAMLLY SQAATAKAIT PAIVTALGIT AANPDDSYML VASFAAVSAL FVLPTYPTLL GAVQMDDTGT TRIGKYVFNH AFFIPGVLAI AFSVLLGFLV VSMF
|
| |