Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1490 |
Symbol | |
ID | 6146530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1474498 |
End bp | 1475898 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616368 |
Product | divalent anion:Na+ symporter (DASS) family protein |
Protein accession | YP_001743548 |
Protein GI | 170683002 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | [TIGR00785] anion transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTGCT ATTTGAAAGT CCTGATTTGT GCGATTGTCG CACTCGCTAT ATGGTTTTTC CCCGTCCCCG ACGGATTAAC ACCCTTAACC TGGCACATTC TGGCCATATT TATCACCACT GTGGTGGCGT TTATTCTGCA ACCGTTGCCG GTGGGCGCTA TCGCGTTAAT AGCCATCAGT TTTATTATGC TGACCGGTAT GATGAAAACC AGCGAAGCCT TAAAAGGTTT CAGCAGCACG ACGGTATGGC TTATTGTCGC CGCATTCCTG TACGCGAAAG GATTCATCAA AACGGGGTTG GGGCGGCGTA TCGCCTATTT GCTGATTCGC GGATTTGGCG GCAGCTCGTT GCGTCTGGGT TATACATTGG CGCTAAGCGA TATGATTATC GCGCCCGCAA CGCCATCAAA CACAGCGCGA GCAGGAGGGA TTTTGTTCCC GATTGTTCGC AGTGTCTCTA ACAGCTTTGG ATCTGAACCG GATCAAGGGC CGCGTAAAAT AGGCGCTTAT CTGATGCAGA CGGTGTTCCA TTCCAACTGT CTCTCATCCT CGATGTTTAT GACGGCCAGC GCGCCAAATG CACTCATTGT ATCTCTGGCG GCCAGCACTT TATTTGTCGA TATCTCATGG GGGATGTGGA CGCTAAGCGC ACTGGTGCCG GGCATTATCG CTTTTATCAC TATGCCGTTG GTCATTTACA AACTCTACCC GCCAGAGATC AAAAAGACGC CAGAAGCCAA AGCGTTGGCA CAGGAAGAAC TTATCAAAAT GGGGCCGGTA ACCCGTGATG AGCGCGTGAC AATTGGCATT TTCCTGCTTT CATTATTGGC CTGGAGTACA TCGAAATGGA CTGGCCTGGA TGCGACGGCT GTCGCGCTTT CTGGCGTCTG TTTAATGTTG ATGACCCGGA TTATCACCTG GCAGGATGTG CAGTCAGAGA AGGGGGCCTG GGACATTCTG GTCTGGCTGG GAGTCATGAT CTGTATGGCA GATAAGCTCA ATCAGCTTGG CCTGTTCAAA TGGTTTGCCG TCACAACCTC GGCGCTGTTC ACCGGCATTC CGTGGGAGAT AACGTTGACG GTGTTGCTGA TAGTTTACTG CTATTCCCAC TATTTCTTTG CCGGAAGTAC GCCGCATGTA GTGGCGATGT ATGCCGCTTT TGGCAGCGTC TCGGTTGCTG CTGGCGCACC GCCGATGATG GCGGCTCTGT CACTGGCGTT CGTCACTAAC CTGATGAGCG GCATTTCGCA TTATGGCAAC GGACCCGCAG TGATCTATTA CGGCGCTGGC TATGTCTCGC AACGAGAATG GTGGCGGCTG GGCTTTATCG TCATGTTGTT GAACATTGCC ATCTGGTTCG GACTGGGGGC GGTGTGGTGG AAAATTCTTG GCCTGTGGTA A
|
Protein sequence | MSCYLKVLIC AIVALAIWFF PVPDGLTPLT WHILAIFITT VVAFILQPLP VGAIALIAIS FIMLTGMMKT SEALKGFSST TVWLIVAAFL YAKGFIKTGL GRRIAYLLIR GFGGSSLRLG YTLALSDMII APATPSNTAR AGGILFPIVR SVSNSFGSEP DQGPRKIGAY LMQTVFHSNC LSSSMFMTAS APNALIVSLA ASTLFVDISW GMWTLSALVP GIIAFITMPL VIYKLYPPEI KKTPEAKALA QEELIKMGPV TRDERVTIGI FLLSLLAWST SKWTGLDATA VALSGVCLML MTRIITWQDV QSEKGAWDIL VWLGVMICMA DKLNQLGLFK WFAVTTSALF TGIPWEITLT VLLIVYCYSH YFFAGSTPHV VAMYAAFGSV SVAAGAPPMM AALSLAFVTN LMSGISHYGN GPAVIYYGAG YVSQREWWRL GFIVMLLNIA IWFGLGAVWW KILGLW
|
| |