Gene EcSMS35_1490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1490 
Symbol 
ID6146530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1474498 
End bp1475898 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content52% 
IMG OID641616368 
Productdivalent anion:Na+ symporter (DASS) family protein 
Protein accessionYP_001743548 
Protein GI170683002 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID[TIGR00785] anion transporter 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTGCT ATTTGAAAGT CCTGATTTGT GCGATTGTCG CACTCGCTAT ATGGTTTTTC 
CCCGTCCCCG ACGGATTAAC ACCCTTAACC TGGCACATTC TGGCCATATT TATCACCACT
GTGGTGGCGT TTATTCTGCA ACCGTTGCCG GTGGGCGCTA TCGCGTTAAT AGCCATCAGT
TTTATTATGC TGACCGGTAT GATGAAAACC AGCGAAGCCT TAAAAGGTTT CAGCAGCACG
ACGGTATGGC TTATTGTCGC CGCATTCCTG TACGCGAAAG GATTCATCAA AACGGGGTTG
GGGCGGCGTA TCGCCTATTT GCTGATTCGC GGATTTGGCG GCAGCTCGTT GCGTCTGGGT
TATACATTGG CGCTAAGCGA TATGATTATC GCGCCCGCAA CGCCATCAAA CACAGCGCGA
GCAGGAGGGA TTTTGTTCCC GATTGTTCGC AGTGTCTCTA ACAGCTTTGG ATCTGAACCG
GATCAAGGGC CGCGTAAAAT AGGCGCTTAT CTGATGCAGA CGGTGTTCCA TTCCAACTGT
CTCTCATCCT CGATGTTTAT GACGGCCAGC GCGCCAAATG CACTCATTGT ATCTCTGGCG
GCCAGCACTT TATTTGTCGA TATCTCATGG GGGATGTGGA CGCTAAGCGC ACTGGTGCCG
GGCATTATCG CTTTTATCAC TATGCCGTTG GTCATTTACA AACTCTACCC GCCAGAGATC
AAAAAGACGC CAGAAGCCAA AGCGTTGGCA CAGGAAGAAC TTATCAAAAT GGGGCCGGTA
ACCCGTGATG AGCGCGTGAC AATTGGCATT TTCCTGCTTT CATTATTGGC CTGGAGTACA
TCGAAATGGA CTGGCCTGGA TGCGACGGCT GTCGCGCTTT CTGGCGTCTG TTTAATGTTG
ATGACCCGGA TTATCACCTG GCAGGATGTG CAGTCAGAGA AGGGGGCCTG GGACATTCTG
GTCTGGCTGG GAGTCATGAT CTGTATGGCA GATAAGCTCA ATCAGCTTGG CCTGTTCAAA
TGGTTTGCCG TCACAACCTC GGCGCTGTTC ACCGGCATTC CGTGGGAGAT AACGTTGACG
GTGTTGCTGA TAGTTTACTG CTATTCCCAC TATTTCTTTG CCGGAAGTAC GCCGCATGTA
GTGGCGATGT ATGCCGCTTT TGGCAGCGTC TCGGTTGCTG CTGGCGCACC GCCGATGATG
GCGGCTCTGT CACTGGCGTT CGTCACTAAC CTGATGAGCG GCATTTCGCA TTATGGCAAC
GGACCCGCAG TGATCTATTA CGGCGCTGGC TATGTCTCGC AACGAGAATG GTGGCGGCTG
GGCTTTATCG TCATGTTGTT GAACATTGCC ATCTGGTTCG GACTGGGGGC GGTGTGGTGG
AAAATTCTTG GCCTGTGGTA A
 
Protein sequence
MSCYLKVLIC AIVALAIWFF PVPDGLTPLT WHILAIFITT VVAFILQPLP VGAIALIAIS 
FIMLTGMMKT SEALKGFSST TVWLIVAAFL YAKGFIKTGL GRRIAYLLIR GFGGSSLRLG
YTLALSDMII APATPSNTAR AGGILFPIVR SVSNSFGSEP DQGPRKIGAY LMQTVFHSNC
LSSSMFMTAS APNALIVSLA ASTLFVDISW GMWTLSALVP GIIAFITMPL VIYKLYPPEI
KKTPEAKALA QEELIKMGPV TRDERVTIGI FLLSLLAWST SKWTGLDATA VALSGVCLML
MTRIITWQDV QSEKGAWDIL VWLGVMICMA DKLNQLGLFK WFAVTTSALF TGIPWEITLT
VLLIVYCYSH YFFAGSTPHV VAMYAAFGSV SVAAGAPPMM AALSLAFVTN LMSGISHYGN
GPAVIYYGAG YVSQREWWRL GFIVMLLNIA IWFGLGAVWW KILGLW