Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2267 |
Symbol | |
ID | 3831378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2373679 |
End bp | 2375229 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637830187 |
Product | Sodium/sulphate symporter |
Protein accession | YP_431097 |
Protein GI | 83591088 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | [TIGR00785] anion transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0413087 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAGCT GTAACCTGCT CCCTGCTCCT TATATCTTAA TAATAAAATC ACCCTTTGGC AACGACAATT CAACTAATTG CAAAGGCGGG GATAATATGG CCACTGCCAC TCAAACCGGC ACACAGGCGG TAGCAACCGG CAAAAAAGCT GGTACCAGGT GGGCTTATTT TATAATCGGC CTGGCTGTCC TGGGCCTCTT CTATATCTTA CCTGTTCCGG CAGGTATGAA ACCTGCGGCC ATGCGCACCC TGGGGGTAAT GGCGACCACA GTTTTCTGGT GGCTGACGGA GACGCTGCCC ATTCCAGTTA CGGCCTTAAT GGTGCCGTTA ATGCTTCATT TTACCGGAAT CCTTAGTCTG GATAAATCTG TAGCCCAGAG TTTTGGTGAT AGTTTTGTTC CTTTTTTGAT AGGTGTTCTG GCCCTGTCGG TGGCCTTTAC CATGTCCGGC CTGGGTAAAC GGATCACTTA TCTGCTGCTG GCCCTTTCAG GTACCAAAGT TAGCCGGGTA ATCGGCATTT ACTTCCTGGT ATCCTTTGTA ATCTCCATGT TCGTCACCGA TGTGGCCGTG GTGGCGATGA TGCTGCCGAT CGTAGTAGGC TTGTTACAAT CCGTGGATGC CAAACCGGGG GAGAGTAACC TGGGCCGGGG CTTGATGATG GCCATTATGT TTGGTTCTAC CCTGGGCGGT ATTTGTACAC CTTCGGGGGT AGCTTCCAAT GTCATTACCA TGAGTTTTCT GACAAAAAAC GCCAAAATAG GAGTATCGTT TCTTGACTGG GTAGCTATTG CTACACCTAT CTTCGTAGCA GTCGGCATTA TTGCCTGGTG GCTGATTCTC AAGATCTTTC CGCCGGAAAT TAAAGAATTG CCCTACGGCA AAGATATGAT TCACAAAGAA CTCAAGGGAA TGGGCTCCTG GTCCATCGAG GAAATAACTA CCATGGTTGT TTTTCTCCTG GCGGTAGTCC TCTGGCTGAC CAGTTCATGG AATGGATTGC CGATAGCTTT TGTCTCCCTT CTCATCCTGG GACTTTTATC AATGCCGGGT GTTGGAGTTT TCAAGAAATG GAGCGATGTG GAGAAACGCC TGGAATGGGG GGCCCTGATG CTTGTAGTAG GCGGCTTCGC CCTGGGCCTG GCAGCCAGCC AGAGTGGCCT GGCCCAGTGG GTGGCCCAGC ACGCCCTGAA ACCCATGACT ATTCTTCCCC GGCCGCTGCA GCCACTGGCG GTGACCCTGT TGGTAGCAGT GGATTCCCTG GGCTTCTCCA GCTTTACAGC AGCTGCTTCG GTTAATGTAC CCTTTATCAT TGCCTACGCC CAGCAGAACG CCCTGCCAGT GCTGTCGATG GCCCTGGCGG CCGGTTTCGC CGCTTCCACC CATTTCATCC TGGTTACCGA GAGCCCGTCC TTTGTGTTAC CCTATGCTTA CGGGTACTTT AGTTTTAAGG ACCTTTTCAA GATTGGCGTG ATTCTAACCA TAGTGAGCGC CGGGGCCATT GCCATCGGCC TGGTTCTCGC CGGCATGCCG GCAGGTGTAC CGTTGCATTA A
|
Protein sequence | MVSCNLLPAP YILIIKSPFG NDNSTNCKGG DNMATATQTG TQAVATGKKA GTRWAYFIIG LAVLGLFYIL PVPAGMKPAA MRTLGVMATT VFWWLTETLP IPVTALMVPL MLHFTGILSL DKSVAQSFGD SFVPFLIGVL ALSVAFTMSG LGKRITYLLL ALSGTKVSRV IGIYFLVSFV ISMFVTDVAV VAMMLPIVVG LLQSVDAKPG ESNLGRGLMM AIMFGSTLGG ICTPSGVASN VITMSFLTKN AKIGVSFLDW VAIATPIFVA VGIIAWWLIL KIFPPEIKEL PYGKDMIHKE LKGMGSWSIE EITTMVVFLL AVVLWLTSSW NGLPIAFVSL LILGLLSMPG VGVFKKWSDV EKRLEWGALM LVVGGFALGL AASQSGLAQW VAQHALKPMT ILPRPLQPLA VTLLVAVDSL GFSSFTAAAS VNVPFIIAYA QQNALPVLSM ALAAGFAAST HFILVTESPS FVLPYAYGYF SFKDLFKIGV ILTIVSAGAI AIGLVLAGMP AGVPLH
|
| |