Gene Moth_2267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2267 
Symbol 
ID3831378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2373679 
End bp2375229 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content52% 
IMG OID637830187 
ProductSodium/sulphate symporter 
Protein accessionYP_431097 
Protein GI83591088 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID[TIGR00785] anion transporter 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0413087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAGCT GTAACCTGCT CCCTGCTCCT TATATCTTAA TAATAAAATC ACCCTTTGGC 
AACGACAATT CAACTAATTG CAAAGGCGGG GATAATATGG CCACTGCCAC TCAAACCGGC
ACACAGGCGG TAGCAACCGG CAAAAAAGCT GGTACCAGGT GGGCTTATTT TATAATCGGC
CTGGCTGTCC TGGGCCTCTT CTATATCTTA CCTGTTCCGG CAGGTATGAA ACCTGCGGCC
ATGCGCACCC TGGGGGTAAT GGCGACCACA GTTTTCTGGT GGCTGACGGA GACGCTGCCC
ATTCCAGTTA CGGCCTTAAT GGTGCCGTTA ATGCTTCATT TTACCGGAAT CCTTAGTCTG
GATAAATCTG TAGCCCAGAG TTTTGGTGAT AGTTTTGTTC CTTTTTTGAT AGGTGTTCTG
GCCCTGTCGG TGGCCTTTAC CATGTCCGGC CTGGGTAAAC GGATCACTTA TCTGCTGCTG
GCCCTTTCAG GTACCAAAGT TAGCCGGGTA ATCGGCATTT ACTTCCTGGT ATCCTTTGTA
ATCTCCATGT TCGTCACCGA TGTGGCCGTG GTGGCGATGA TGCTGCCGAT CGTAGTAGGC
TTGTTACAAT CCGTGGATGC CAAACCGGGG GAGAGTAACC TGGGCCGGGG CTTGATGATG
GCCATTATGT TTGGTTCTAC CCTGGGCGGT ATTTGTACAC CTTCGGGGGT AGCTTCCAAT
GTCATTACCA TGAGTTTTCT GACAAAAAAC GCCAAAATAG GAGTATCGTT TCTTGACTGG
GTAGCTATTG CTACACCTAT CTTCGTAGCA GTCGGCATTA TTGCCTGGTG GCTGATTCTC
AAGATCTTTC CGCCGGAAAT TAAAGAATTG CCCTACGGCA AAGATATGAT TCACAAAGAA
CTCAAGGGAA TGGGCTCCTG GTCCATCGAG GAAATAACTA CCATGGTTGT TTTTCTCCTG
GCGGTAGTCC TCTGGCTGAC CAGTTCATGG AATGGATTGC CGATAGCTTT TGTCTCCCTT
CTCATCCTGG GACTTTTATC AATGCCGGGT GTTGGAGTTT TCAAGAAATG GAGCGATGTG
GAGAAACGCC TGGAATGGGG GGCCCTGATG CTTGTAGTAG GCGGCTTCGC CCTGGGCCTG
GCAGCCAGCC AGAGTGGCCT GGCCCAGTGG GTGGCCCAGC ACGCCCTGAA ACCCATGACT
ATTCTTCCCC GGCCGCTGCA GCCACTGGCG GTGACCCTGT TGGTAGCAGT GGATTCCCTG
GGCTTCTCCA GCTTTACAGC AGCTGCTTCG GTTAATGTAC CCTTTATCAT TGCCTACGCC
CAGCAGAACG CCCTGCCAGT GCTGTCGATG GCCCTGGCGG CCGGTTTCGC CGCTTCCACC
CATTTCATCC TGGTTACCGA GAGCCCGTCC TTTGTGTTAC CCTATGCTTA CGGGTACTTT
AGTTTTAAGG ACCTTTTCAA GATTGGCGTG ATTCTAACCA TAGTGAGCGC CGGGGCCATT
GCCATCGGCC TGGTTCTCGC CGGCATGCCG GCAGGTGTAC CGTTGCATTA A
 
Protein sequence
MVSCNLLPAP YILIIKSPFG NDNSTNCKGG DNMATATQTG TQAVATGKKA GTRWAYFIIG 
LAVLGLFYIL PVPAGMKPAA MRTLGVMATT VFWWLTETLP IPVTALMVPL MLHFTGILSL
DKSVAQSFGD SFVPFLIGVL ALSVAFTMSG LGKRITYLLL ALSGTKVSRV IGIYFLVSFV
ISMFVTDVAV VAMMLPIVVG LLQSVDAKPG ESNLGRGLMM AIMFGSTLGG ICTPSGVASN
VITMSFLTKN AKIGVSFLDW VAIATPIFVA VGIIAWWLIL KIFPPEIKEL PYGKDMIHKE
LKGMGSWSIE EITTMVVFLL AVVLWLTSSW NGLPIAFVSL LILGLLSMPG VGVFKKWSDV
EKRLEWGALM LVVGGFALGL AASQSGLAQW VAQHALKPMT ILPRPLQPLA VTLLVAVDSL
GFSSFTAAAS VNVPFIIAYA QQNALPVLSM ALAAGFAAST HFILVTESPS FVLPYAYGYF
SFKDLFKIGV ILTIVSAGAI AIGLVLAGMP AGVPLH