Gene Moth_0243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0243 
Symbol 
ID3833206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp244931 
End bp246988 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content66% 
IMG OID637828179 
Producthypothetical protein 
Protein accessionYP_429121 
Protein GI83589112 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCTTTA AAAGGCTTGT GCTGGTAGTG GCGGCGGTGA TGATGCTGCT CCTGGGCGCG 
ACATCGGCCT ACGCCGCCTA CACCATCAAT TTAAACGAGC CGGTAGTCAG CACCAGCGCC
CCGCCCTACG CCACCGTGAC GGCTGGAGGC TCGGTCCAGG CTGACGGCCA GCCAGCGGTG
GGCGTGCCCG TGGCCCTACG CCTGAGCGGC GCCGACGGCC AGCTCCTGGC CGCCGACCAG
GTGGCGACCG ACAGCTCAGG CTCCTTCAGC GAGCCCCTGC CCCTGCCTGC GGGCACTAAC
AGCGGGAGCG CCACCCTGGT GGCCGCTGCG GCCGGGGCGA TAGCCCAGCA GGTTGTCGAG
ATTCCCCCCC TGCCCGCGGC TTACTACGGC AGCGTTAAAA ACTCCGGCGG CTCAAACGTC
GCCGCCGGCA GCGTCGAGGC CTATATCGAC GGCCAGAAGG TCGGCTCCCT GGACTTCCAG
AACGGCTTCT ACGGAGGTTC CGGCGGCCTC GACCCCAAGC TGGTCGCCAG CGGGACCGAT
GCTGATAAAG GTAAAACCGT TACCTTCAAA GTCGTCATCA ACGGCCAGGC TTACGACGCC
ACCCCCGCCT CCCCGGTAAC GTGGGAGCCG GGCGACGTGC GGCAGGTCGA CCTGAGCATT
ACGGCCCAGC AGCCGCCTGC GGGCGATCAA GTACCGCCGG CAGTTACCAC TACCGATCCG
GCTAACGGAG CCACGGGCGT TGCTGTAACC AAAGCTGTAG CCATCACCTT CAGCGAAAAT
ATCCAGGCCG GCGCGGCTTA CGACAGCATC GCTTTGAAAG ACGGCTCCGG CAACAGCGTG
GCCTTCACCA AGAGCATAAG CGGCAGCGTC CTTACTATCA CGCCGTCCGC CAACCTGGCC
TATAGCACCA TTTACACCGT GAGCCTCCCG GCGGCGGCGG TCCAGGACCT GGCCGGAAAC
GCCCTGGCCA GCCCCTTCAC CTTTAGCTTC ACCACCCAGG CCGCGTCCAG CGGCGGAGGC
GGCGGTGGCG GTGGTGGGGG AGGCGGCGGC GGCGGCTCGA CCACCCCTTC CACCAACCGG
GTAGAGCAGC CCATCCAGGC CGGCGCAGCT ACGGTAGCTG AACTCGCCGG CGTCGCCCGC
GTTGACGTGC CGGCCGGGGC GGTTTCCGGC ACGGGCGCGA CCATCGCCGC CGAGGTAAAA
GACGACTCCC TGGCCGCCCA GGCCGGTATG ACCCTCATCA GCAAAGTCGT CGACGTGACC
CTGAACAACG GTACCCTGAA CGGGCAGCTA ACCATCACCC TCTACTACGA TAAAAGCAAG
CTCGCCGCCG ATCAGCAGCC TGTGGCCTGC TATTACGACG CCAGCCAGGG GCAGTGGGTG
CGCCTTGAAG GTAGCGTGGA CGAAAACGCC GGCACGGTGG CGGTCACTGT AGACCACCTC
ACCACTTTTG CCGTCTTCGC CGCGGCTAAG GAAGTAACGC CGCCGGTGAC CTTCAAGGAT
ATGCAGGGCC ACTGGGCCGC CGAGACGGTC AGCCGCCTCG CGAGCATGGA CATCATTTCC
GGTTATCCCG ACGGCACCTT TAAGCCCGAC AACCAGATCA CCCGGGCGGA AGCCACCGCC
ATCCTGGCGC GGGCGCTGAA GCTGGCTCCG GGCAGCGAAC AGGACTTGAA GTTTAAAGAC
AACGCCGCCA TCCCCGAATG GGCCCGGGGT GTCGTCGCCG CGGCGGCAAG GGAAGGGCTG
ATTAAAGGTT ATCCGCAGCC TGACGGCAGC GTCACCTTCG TGCCCGGCCG CTCCATCTCC
CGGGCGGAAC TGGCCGCCAT AGTGGTCCGC ATCCTGGCCA TGAAGTCGGG CGCCCCGGCG
CCGGCCGAGC TGAAGTTCGC CGACACGGCC AGCATCCCTG ACTGGGCGCG CCAGAGCGTG
GCCCAGGCCG TGGCCAGCGG CATCGTCGCC GGCTATCCCG ACAACACCTT CCAGGCCGAG
CGGCCCGTCA CCCGGGCGGA AGCCGCGGCC ATGATCCTGC GCCTCTTGGA TAAGATTGGA
ACCCAGGGAG GCCAGTAA
 
Protein sequence
MTFKRLVLVV AAVMMLLLGA TSAYAAYTIN LNEPVVSTSA PPYATVTAGG SVQADGQPAV 
GVPVALRLSG ADGQLLAADQ VATDSSGSFS EPLPLPAGTN SGSATLVAAA AGAIAQQVVE
IPPLPAAYYG SVKNSGGSNV AAGSVEAYID GQKVGSLDFQ NGFYGGSGGL DPKLVASGTD
ADKGKTVTFK VVINGQAYDA TPASPVTWEP GDVRQVDLSI TAQQPPAGDQ VPPAVTTTDP
ANGATGVAVT KAVAITFSEN IQAGAAYDSI ALKDGSGNSV AFTKSISGSV LTITPSANLA
YSTIYTVSLP AAAVQDLAGN ALASPFTFSF TTQAASSGGG GGGGGGGGGG GGSTTPSTNR
VEQPIQAGAA TVAELAGVAR VDVPAGAVSG TGATIAAEVK DDSLAAQAGM TLISKVVDVT
LNNGTLNGQL TITLYYDKSK LAADQQPVAC YYDASQGQWV RLEGSVDENA GTVAVTVDHL
TTFAVFAAAK EVTPPVTFKD MQGHWAAETV SRLASMDIIS GYPDGTFKPD NQITRAEATA
ILARALKLAP GSEQDLKFKD NAAIPEWARG VVAAAAREGL IKGYPQPDGS VTFVPGRSIS
RAELAAIVVR ILAMKSGAPA PAELKFADTA SIPDWARQSV AQAVASGIVA GYPDNTFQAE
RPVTRAEAAA MILRLLDKIG TQGGQ