Gene Moth_1817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1817 
Symbol 
ID3830738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1876434 
End bp1878041 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content61% 
IMG OID637829747 
ProductNa/Pi cotransporter II-related 
Protein accessionYP_430660 
Protein GI83590651 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTTTA CCGTTGCGAC CGGGCTGCTG GGAGGCCTGG CCTTCTTCCT TTATGGCATG 
AACCTCATCA GCACCGGCTT GCAAAAGGCG GCCGCCCGCC AGGTGCAGCA ACTCCTGGGG
ACGGTGACCA GGAATCGTTT CTACGGCATG CTGGCCGGGC TGGTGGTTAC CATCTTCCTG
CAAACCAGCG CCGTCACCAC CGTGCTTCTG GTAGGCTTCG TTAGCGCCGG CCTCATGAGC
CTGGGCCAGG CCCTGGGGGT CATCCTGGGT GCGGCTATCG GCTCCACCCT TACGGCCCAG
CTCATTGCCT TTCGCCTCAG CGACTATTCC CTCTGGGCGG TGGTTGCCGG TCTCATCCCA
TATCTCGCCG CCCGGCGCCT GCGCTGGCGT TACCTTGGCC AGGCCATCCT CGGGTTTGGC
CTTATTTTTT ATGGTGCCGC CCTCATGGGT ACGGCTATAG CGCCCTTTAG CACCATACCC
GGTTTTACCG CCGTCCTGCA GCGCCTGGCG GCCCATCCCT GGATGATGAT GCTGGCGGCC
ACCCTCTTTA CGGCCATTGT CCAGGGCAGC GCCGCCCCGG TGGTGGTGGC CATGACCCTG
GCGGCCCAGG GCGGCCTGGC CCTGGAACCG GCCCTGGCCC TGGTCCTGGG CGCCAACCTG
GGGACCACAG CCACGGCCTT TATCTCCAGC ATCGCCCTTT CCCGGGAGGC CAAGCGAGTA
GCCCTGGCCC ATTTCTTTTT TAAACTCGCG GGCGTTCTCC TCTTTTTACC CTTCCTGGGC
CTCTATGCCG GCCTGTCCCG CCTTACTTCC ACGGACGTTG CCCGCCAGGT GGCCAACGGC
CATACCCTGT TTAACATTAT AAACATGCTG GTCTTTATTC CCTTTACGCC CATGGTAGGC
CGCCTGATGG AAAGGCTCCT GCCCGATGCT CCCGAGGAGG AAAAGGTAGC CAAGTATCTG
GACACCTCCC TTCTGAACGT GCCGGAACTG GCCCTGGCAG GGGTGACCCG GGAACTTCTG
CGCATGGCCG GGATTATCAG GGAGGAGATG TTCCCCCGGG TCATGCGACC CCTGGCCGAA
CGGGAGGTTG ACCTGTTGGA GAAACTGCGC CGCCTGGACG GTACCCTCGA CTACCTCTAC
AAGGCCATCG CCCGGTACCT GGCCAATATG AACCATGATA ACTTGAGTGA GGAACAGATG
GTGGCCCAGA CCAGGCTGCT TTACATTGCC AATGACCTGG AGCATATCGG TGACGTGACT
GTGGAGATGG CCCGCCAGTG GCGCAAGATC GAAACCAGTG GCATAGAGTT TTCCCCTGAG
GGGCAGGCTG AACTCCAGGA GATGTTTACA AGGGTTAAAG AGAACTTCAC CTCCGCCATC
CAGGCCTTCG CCAGTGATGA CCAGGCCCTG GCCGCCCGGG TCATCCGCGG CCACCCGGAG
ATCCTGCGGC TGGAGAAGAA TCTGCGCTAC TCCCATTTCC AGCGCCTGCA GCGGGAGAAT
CGTCTCAGCC TGGAAACCAC CTCCGTCCAT ATGGAGCTCA TTAACCACCT CCTGCGCCTG
AACATCCATA ACGTCAGCAT AGCCCAGGCA GTGATGGGTA TTATTTAA
 
Protein sequence
MLFTVATGLL GGLAFFLYGM NLISTGLQKA AARQVQQLLG TVTRNRFYGM LAGLVVTIFL 
QTSAVTTVLL VGFVSAGLMS LGQALGVILG AAIGSTLTAQ LIAFRLSDYS LWAVVAGLIP
YLAARRLRWR YLGQAILGFG LIFYGAALMG TAIAPFSTIP GFTAVLQRLA AHPWMMMLAA
TLFTAIVQGS AAPVVVAMTL AAQGGLALEP ALALVLGANL GTTATAFISS IALSREAKRV
ALAHFFFKLA GVLLFLPFLG LYAGLSRLTS TDVARQVANG HTLFNIINML VFIPFTPMVG
RLMERLLPDA PEEEKVAKYL DTSLLNVPEL ALAGVTRELL RMAGIIREEM FPRVMRPLAE
REVDLLEKLR RLDGTLDYLY KAIARYLANM NHDNLSEEQM VAQTRLLYIA NDLEHIGDVT
VEMARQWRKI ETSGIEFSPE GQAELQEMFT RVKENFTSAI QAFASDDQAL AARVIRGHPE
ILRLEKNLRY SHFQRLQREN RLSLETTSVH MELINHLLRL NIHNVSIAQA VMGII