Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0746 |
Symbol | |
ID | 6144114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 753570 |
End bp | 754850 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641615635 |
Product | dicarboxylate/amino acid:cation family protein |
Protein accession | YP_001742834 |
Protein GI | 170680220 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.579753 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAA TAAGTTTAAC CACGATGATT CTTTTGGCGC TGGTACTTGG AATGATTATC GGCGTAGTGC TCAATAACAC TGCTTCACCG GAAACCGCAA AACTCTATGC GCAAGAAATA TCGATATTCA CGACGATTTT CTTACGACTG ATAAAAATGA TTATCGCTCC GTTAGTGGTC TCTACCCTGG TGGTAGGTAT AGCTAAAATG GGAGATGCCA AAGCCCTTGG TCGTATTTTT TCTAAAACAC TCTTTTTATT TATTTGCGCC TCATTGCTGT CAATCGCCTT AGGCTTGATA ACAGTAAATT TCTTCATGCC AGGCACAGGA ATTAATTTTG TTGCACACGG AGCCGAAACC ACCGGAGTGG TCGCGTCAGA ACCCTTTACG CTAAAAGTAT TTATTTCGCA TGCTTTCCCC ACCAGCATTG TCGATGCCAT GGCGCACAAT GAAATTTTGC AAATCGTGGT GTTCTCAATT TTCCTCGGCT GTAGCCTGAC GGCGATTGGT GAGAAAGGCA GCGCCATCGT TCACGCCTTA GATTCGCTGG CACATGCCAT GTTAAAGCTC ACTGGCTACG TCATGCTCTT CGCTCCCCTG ACCGTATTCG CCGCTATTTC AGCATTGATT GCTGAACGAG GACTGGCAGT TATGGTGAGC GCCGGGATCT TTATGGGTGA ATTTTATTTC ACCATGTTGT TACTTTGGGT GCTGCTTATC GGTCTGGCCA TCGTTTATGT CGGCCCCTGC ATCAGACGCC TGACCCGTGC CCTTTCGGAA CCCGCCCTGC TGGCATTTAC CACATCCAGT TCTGAAGCGG CTTTTCCGGG AACGCTTGAA AAACTGGAGC AATTTGGCGT TTCCCCCAAA ATTGCCAGCT TTGTCTTACC CATTGGCTAC TCATTTAATC TCGTTGGATC AATGGCCTAC TGCTCCTTCG CCACAGTTTT CATCGCCCAG GCCTGCAATA TCCATTTATC CATCGGTGAG CAAATCACCA TGCTGTTGAT CCTGATGTTG ACCTCGAAAG GAATGGCTGG CGTACCACGC GCCTCAATGG TGGTTATCGC CGCCACGCTC AACCAGTTCA ATATTCCGGA AGCGGGGCTG ATCTTGCTGA TGGGCGTTGA TCCGTTCCTT GATATGGGGC GTTCCGCGAC AAACGTCATG AGCAACGCAA TGGGCGCTGC GATGGTGAGT CGGTGGGAAG GCGAACATTT CGGCGAGGGC TGTCGGGGTA AAGCATTAAA ACCCAATGAA TCGAACGTTG CTCTGCCCTG A
|
Protein sequence | MKKISLTTMI LLALVLGMII GVVLNNTASP ETAKLYAQEI SIFTTIFLRL IKMIIAPLVV STLVVGIAKM GDAKALGRIF SKTLFLFICA SLLSIALGLI TVNFFMPGTG INFVAHGAET TGVVASEPFT LKVFISHAFP TSIVDAMAHN EILQIVVFSI FLGCSLTAIG EKGSAIVHAL DSLAHAMLKL TGYVMLFAPL TVFAAISALI AERGLAVMVS AGIFMGEFYF TMLLLWVLLI GLAIVYVGPC IRRLTRALSE PALLAFTTSS SEAAFPGTLE KLEQFGVSPK IASFVLPIGY SFNLVGSMAY CSFATVFIAQ ACNIHLSIGE QITMLLILML TSKGMAGVPR ASMVVIAATL NQFNIPEAGL ILLMGVDPFL DMGRSATNVM SNAMGAAMVS RWEGEHFGEG CRGKALKPNE SNVALP
|
| |