Gene Moth_0569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0569 
Symbol 
ID3832482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp591931 
End bp593439 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content53% 
IMG OID637828510 
Producthypothetical protein 
Protein accessionYP_429442 
Protein GI83589433 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0080395 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTTT ATTATGGAAT TGGCCCGAAA ATTGTATTCT CAGAGAAGGG ACAGGTGGAG 
ATGAGCATTA AAAGAGTTGC GGTATTAACC TTGCTTCTTT GCGGCTCTTT TTTGCCCCTC
GGGGGGGTGG CATGGGCGGC GACAGGAAAC GAGCTGGGTA AAATGTTACC TTTATGGAGC
GTATTACCCT TTGTAGGGAT GCTCCTGTCC ATTGCCATCT GGCCGCTGGT CAACGCCTCC
TGGTGGGAGC ACAACATGGG GAAGGTAAGC TTGTTCTGGT CCCTGGTTTT CTTTGTACCG
TTCTTAATTG CCTTCGGGAG CGGTACGGCC TTTACCCAGG CAGTGGAAGT GTATCTCCTG
GATTACCTGC CCTTTATCAT CCTTCTCTTC GGTCTCTTCG TAGTGGCAGG GGGAATCATT
CTTCGTGGTA CCCTGCGGGG TAATCCGGGT GTGAACGTCC TGCTCCTGCT GGTGGGCACG
ATCCTCTCAA GCTGGATTGG GACCACCGGA GCCAGCATGC TCATGATCCG GCCGGTTATC
CGGGCCAACG AGTGGCGGCG TTACAAGGCC CACATAATCA TCTTTTTTAT CTTCCTCATT
TCCAATATCG GCGGGGCCCT CACCCCCGTA GGCGACCCGC CCCTGTTCCT GGGTTATTTA
CGGGGGGTGC CCTTCTTCTG GACCATGAGG TTAATCCTGC CCATGGGATT TAATGTACTA
ATCCTCCTCA CCCTTTACTA CTTCCTGGAT AGTTACTTTT ACCGTAAAGA AAAGGTGCCC
CGGGCAAAGG GCGGGCAGGA CCCCTTGCGG GTAGAGGGTT TACAGAATCT TATTTACCTG
GGTATCATTG TCGGCGCTGT TATTTTAAGT GGTATCCTGG CCAAGAACCC GGCCTTTGCC
GACCAGCAAA CCGGGAACCT TTACGGCATT ACTATCTTCC GCCACGGCGA AGAAGCAGTG
GTGCTGCCCT ACACCAACAT AATCCGCGAC CTGGCCATTC TCCTGGCGGC TTTCCTGTCA
TGGAAGACTA CTTCTATGGA TATTCGTAAA GATAATCGCT TTACCTGGGG TCCCATCAAG
GAAGTAGCCA TCCTTTTTGC CGGTATCTTT ATGACCATGA TTCCGGCCCT GGCCATCCTC
CACGCCCGCG GCGCGGAACT GGGCTTAACT CACCCGGCCC AATTTTTCTG GGCCACGGGA
GCCCTTTCCA GCTTCCTGGA TAACGCCCCG ACGTACCTGG TATTCCTGAC CACCGCTACC
AGCCTGGGGG CAACCACCGG GGTGCCTACC ACCCTGGGTG TCGTGGCTCC CAAGATGCTC
CTGGCCATAT CCTGCGGCGC CGTGTTTATG GGTGCCAATA CTTATATCGG TAATGCCCCC
AACTTTATGG TGCGGTCAAT AGCGGAAGAG AATAATATTC GCATGCCCAG CTTTTTCGGC
TACATGGGCT GGTCGATAGG GATTTTAATT CCTTTATTTA TCCTGGACAC CTTGATTTTC
TTCCGTTAA
 
Protein sequence
MFFYYGIGPK IVFSEKGQVE MSIKRVAVLT LLLCGSFLPL GGVAWAATGN ELGKMLPLWS 
VLPFVGMLLS IAIWPLVNAS WWEHNMGKVS LFWSLVFFVP FLIAFGSGTA FTQAVEVYLL
DYLPFIILLF GLFVVAGGII LRGTLRGNPG VNVLLLLVGT ILSSWIGTTG ASMLMIRPVI
RANEWRRYKA HIIIFFIFLI SNIGGALTPV GDPPLFLGYL RGVPFFWTMR LILPMGFNVL
ILLTLYYFLD SYFYRKEKVP RAKGGQDPLR VEGLQNLIYL GIIVGAVILS GILAKNPAFA
DQQTGNLYGI TIFRHGEEAV VLPYTNIIRD LAILLAAFLS WKTTSMDIRK DNRFTWGPIK
EVAILFAGIF MTMIPALAIL HARGAELGLT HPAQFFWATG ALSSFLDNAP TYLVFLTTAT
SLGATTGVPT TLGVVAPKML LAISCGAVFM GANTYIGNAP NFMVRSIAEE NNIRMPSFFG
YMGWSIGILI PLFILDTLIF FR