Gene Moth_0463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0463 
Symbol 
ID3830892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp465936 
End bp467726 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content63% 
IMG OID637828398 
Productadenine deaminase 
Protein accessionYP_429337 
Protein GI83589328 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1001] Adenine deaminase 
TIGRFAM ID[TIGR01178] adenine deaminase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0688218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCCCAA CCAGGCGTCC CCTGGCTGAA GTCACCAGAG AACTGGTGGC TGTGGCTACC 
GGAAAACTCC CTGCCGACAC CGTTATCAAG GGCGGCAAAG TAGTAAACGT TTTTACCGGC
GAAATACTGC CCTGGGATAT CGCCATCAAA AATGGCCGCA TTGCCAGCGT CGGTGACGTA
TCCGCGGCTG TGGGGCCGGA AACGGAGGTG ATTGACGCCA GCGGCTATTA CCTTTGCCCG
GGTTTCATGG ATGGCCATGT CCATGTGGAA AGCAGTATGG TGACGGTGAC CCAGTTCGCC
CGGGCCGTCC TCCCCGGCGG CACCACGGCG ATTTTTATGG ACCCCCATGA GATTGCCAAT
GTCCTGGGGA TGGACGGGGT AAAGCTGATG GTGGACGAGG GCCGGGAACT CCCCTTAAAG
GTCTTTGCCA CCATGCCTTC CTGTGTTCCG GCTGCGCCCG GATTTGAGGA CGCCGGTGCC
AGCTTTGGGC CGGAAGAGGT GGCCGCCGCC ATGCAGTGGC CCGGTATCTG CGGCCTGGGG
GAAATGATGA ACTTCCCCGG TGTCCTGGCC GGTGATCCGG CCGTCCACGG CGAACTAAGG
GCCACCCTGG CGGCCGGTAA ACCCATCACC GGACACTTCG CCATGCCCGC CGACTTTCAG
GGCCTGGCCG GCTATACGGC CGCCGGCATC TCTTCCTGCC ACGAATCCAC CCGTACCGAA
GACGCCCTCA ACCGCCTGCG CCTCGGCATG TATGCCATGA TGCGGGAGGG ATCGGCCTGG
CACGATATCA AGGCCACCAT TAAGAGTCTG ACAGAAACCC GGGTCGACAG CCGCCGGGCC
ATGCTGGTGA GCGACGACAC CCACCCGGAA ACCCTGTTAT CCACCGGGCA CTTAAACCAT
GTGGTCCGCC GGGCCATCGA AGAGGGCCTG AACCCCATCC GGGCCATCCA GGCGGTAACC
ATTAATACGG CCGAGTGCTT TGGCGTAGCC CAGGACCTGG GGGCCATTGC GCCGGGGCGT
TATGCCGATA TCCTCTTCTT GAAGGACCTG GCCCGGGTAG CGATAGATAA GGTTATGGTC
GACGGCCGGG TGGTTGCCGC CGGGGGCAGG CTCCTGGTGG ACCTGCCGGC AGTCGCCTAC
CCCGATAGGG TGCGCCATTC GGTACACTTA AAGGAACCTC TGACCCCCTG GCATTTCCGG
ATCAACGCCC CGGCCGGCAA AAGCCGGGTT CAGGTCCGGG TAATGGAAAT TATTGAAGCC
AATGTCAACA CCCGCCACCT GACAGTCACC GTACCGGTGG TGGACGGCCA GGTCACAGCC
GGGGTCGAGG CCGACCTGGC CAAAGTGGCG GTTGTCGAGC GCCACGGCGG CAACGGCAGT
ATCGGCCTGG GCTTTGTCCG GGGTTTCGGC TTTAAAGCCG GCGCCGTGGC CTCTACGGTG
GCCCACGACA GCCACAACCT CCTCATTGTT GGTATGAATG ACGCCGATAT GGCCCTGGCG
GGCAACACCC TGGCTGGATG CGGCGGTGGT ATGGTGGCCG TCAGGGACGG CCAGGTTCTG
GCCCTCCTGC CCCTGCCCAT TGCCGGCCTG ATGTCGGACA GGCCGGTGGA AGAGGTGGCC
GCCCGGTTGG CAGCCGTCCA CCGGGCCTGG CAGGAACTGG GGTGCCGGCT GGTGTCGCCC
TTTATGACCA TGGCCCTCCT TTCCCTGCCG GTCCTGCCGG AGTTACGCCT GACCAACCGC
GGCCTGGTAG ACACCCTGCA GTTTAAAATG GTTGACTTGA TAACCGGTTG A
 
Protein sequence
MLPTRRPLAE VTRELVAVAT GKLPADTVIK GGKVVNVFTG EILPWDIAIK NGRIASVGDV 
SAAVGPETEV IDASGYYLCP GFMDGHVHVE SSMVTVTQFA RAVLPGGTTA IFMDPHEIAN
VLGMDGVKLM VDEGRELPLK VFATMPSCVP AAPGFEDAGA SFGPEEVAAA MQWPGICGLG
EMMNFPGVLA GDPAVHGELR ATLAAGKPIT GHFAMPADFQ GLAGYTAAGI SSCHESTRTE
DALNRLRLGM YAMMREGSAW HDIKATIKSL TETRVDSRRA MLVSDDTHPE TLLSTGHLNH
VVRRAIEEGL NPIRAIQAVT INTAECFGVA QDLGAIAPGR YADILFLKDL ARVAIDKVMV
DGRVVAAGGR LLVDLPAVAY PDRVRHSVHL KEPLTPWHFR INAPAGKSRV QVRVMEIIEA
NVNTRHLTVT VPVVDGQVTA GVEADLAKVA VVERHGGNGS IGLGFVRGFG FKAGAVASTV
AHDSHNLLIV GMNDADMALA GNTLAGCGGG MVAVRDGQVL ALLPLPIAGL MSDRPVEEVA
ARLAAVHRAW QELGCRLVSP FMTMALLSLP VLPELRLTNR GLVDTLQFKM VDLITG