Gene Moth_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2009 
Symbol 
ID3831963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2094705 
End bp2096168 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content63% 
IMG OID637829938 
Productaspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit A 
Protein accessionYP_430848 
Protein GI83590839 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID[TIGR00132] glutamyl-tRNA(Gln) and/or aspartyl-tRNA(Asn) amidotransferase, A subunit 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTAT ATTACTTAAC AGCCCATGAA TTGAGCGACC TGTTGAACCG TAAGGAAATC 
AGCTCCGAAG AAGCCACGGC GGCCATCATT GACCGGATTG AAGCCGTCGA CGGCCGGGTC
CAGGCTTATC TAACCCGGAC GGCGGAACAG GCCCTGGAAG AGGCCCGGGC TGTGGATGCC
GCCCGGGCCC GGGGTGAAAC CCTGGGACCC CTGGCCGGGG TGCCCATGGC CCTGAAGGAC
AATCTCTGTA CGGAGGGGGT GCGGACCACC TGTTCGTCCC GGATGCTGGC AGACTGGGTG
CCTCCCTATG ACGCCACCGT GGTCCGGCGT CTTAAAGAGG CCGGGGCGGT GATGCTGGGC
AAGCTCAATA TGGACGAGTT CGCCATGGGT TCTTCCACGG AGAATTCCAG TTTCTTCCCC
ACCCGGAATC CCTGGGACCT GGAGCGGGTG CCGGGAGGTT CCAGCGGCGG CGCCGTGGCG
GCGGTGGCTG CCGGCGAAGC CTATTTCGCC TTAGGGTCGG ATACGGGCGG TTCCATCCGC
CAGCCGGCTT CCTTCTGTGG CGTCGTGGGG ATGAAGCCCA CCTACGGCCG GGTATCCCGT
TACGGCCTGG TGGCCTTCGC CTCCTCCCTG GACCAGATCG GCCCCATTAC CCGCGACGTC
ACCGATTGCG CCCTGGTCCT GGAGGCCATT GCCGGGCATG ACCCCCTGGA TTCTACGTCC
GCCGATCTGC CGGTGCCCGA TTACCGGTCG GCCCTGAAGC CGGAAGTGAA AGGCCTGAAG
ATAGGCGTTC CGCGGGAGTA TTTCGGCGCT GGCATGGAAC CGGAGGTGGC GGCCATTGTC
CGGCGGGCTA TTGCCAAACT GGAGGAACTG GGAGCCGTTT GTGAGGAAAC CTCCCTGCCC
CATACCGAGT ACGCCCTGCC GGCCTATTAC CTGGTGGCCC CGGCCGAGGC CTCGTCCAAC
CTGGCCCGTT ATGACGGCGT CAGTTACGGC CTGCGGGTGC CGGGCAAGGA TATTATCGAG
ATGTACATGA ATACCCGCAG CCAGGGTTTC GGCCCCGAGG TCAAACGGCG GATTATGCTC
GGCACCTATG CCTTGAGCTC GGGCTACTAT GATGCCTATT ACCTGAAGGC CCTTAAGGTA
CGAACCCTTA TCCGCCGGGA CTTTGAAACG GCCTTCGAGA AATACGATCT CCTGGCCACG
CCTACCTCCC CGACGGTGGC CTTCCGCCTG GGTGAAAAGG CGGGAGACCC CCTGGCCATG
TACCTGTCCG ACCTGTGTAC CATCCCTATT AACATGGCCG GGCTACCGGC CCTGTCTTTG
CCCTGCGGTT TCAGCCAGGG ACTACCCGTC GGCCTGCAGC TGATTGGTCG GCCCTTTGCC
GAGGCCACCC TGTTGCAGGC AGCTTATGCC TTTGAACAGA GTACGGAATA CCACCGGCGG
CGGCCGGAAC TGGGGGTGGC GTAA
 
Protein sequence
MELYYLTAHE LSDLLNRKEI SSEEATAAII DRIEAVDGRV QAYLTRTAEQ ALEEARAVDA 
ARARGETLGP LAGVPMALKD NLCTEGVRTT CSSRMLADWV PPYDATVVRR LKEAGAVMLG
KLNMDEFAMG SSTENSSFFP TRNPWDLERV PGGSSGGAVA AVAAGEAYFA LGSDTGGSIR
QPASFCGVVG MKPTYGRVSR YGLVAFASSL DQIGPITRDV TDCALVLEAI AGHDPLDSTS
ADLPVPDYRS ALKPEVKGLK IGVPREYFGA GMEPEVAAIV RRAIAKLEEL GAVCEETSLP
HTEYALPAYY LVAPAEASSN LARYDGVSYG LRVPGKDIIE MYMNTRSQGF GPEVKRRIML
GTYALSSGYY DAYYLKALKV RTLIRRDFET AFEKYDLLAT PTSPTVAFRL GEKAGDPLAM
YLSDLCTIPI NMAGLPALSL PCGFSQGLPV GLQLIGRPFA EATLLQAAYA FEQSTEYHRR
RPELGVA