Gene Moth_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2008 
SymbolgatB 
ID3831962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2093275 
End bp2094705 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content61% 
IMG OID637829937 
Productaspartyl/glutamyl-tRNA amidotransferase subunit B 
Protein accessionYP_430847 
Protein GI83590838 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0064] Asp-tRNAAsn/Glu-tRNAGln amidotransferase B subunit (PET112 homolog) 
TIGRFAM ID[TIGR00133] glutamyl-tRNA(Gln) and/or aspartyl-tRNA(Asn) amidotransferase, B subunit 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATACG AAGCTGTAAT CGGCCTGGAA GTCCATGCGG AACTGAAGAC GGCTTCCAAG 
GCTTTCTGTA GCTGTAGTAC GGCCTTTGGC GGCGAGCCCA ATACCCACGT CTGCCCTGTT
TGCCTGGGGC TGCCCGGGGT ATTGCCGGTT ATTAACCGCC AGGTGGTTGA GTTCGGTCTG
AAAACGGCCC TGGCCCTCAA CTGCCGGGTG GCGCCCTTCT GTAAATTCGA CCGCAAGAAC
TACTACTACC CCGATCTCCC CAAGAACTAC CAGATTTCCC AGTACGACCT GCCCCTGGCT
ACCGGCGGCT ACCTGAAGAT CAACGTCGAC GGCCAGGAGC GGGTAATCGG TATCACCCGG
GTGCATATGG AAGAAGACGC CGGCAAACTC GTCCACGTGG ACGGACCCGG CGGGGGTTAC
TCCCTGGTGG ACTACAACCG CACCGGCGTA CCCCTGCTGG AAATCGTCTC CGAGCCGGAC
CTGCGCTCTC CGGCGGAGGC CCGGGCTTAT ATGGAAAAGC TCAGGGCCAT CCTCCAGTAC
CTGGACGTTT CCGACTGCAA GATGGAAGAG GGATCCTTGC GCTGCGACGC CAATGTCTCG
GTGCGGCCCC GGGGTTCTCA AACCTTCGGC ACCAAGACGG AAGTAAAGAA TATGAACTCC
TTCCGCGCCC TGCAGCGGGC CCTGGAGTAC GAGATTGAGC GCCAGATTGC CATTCTGGAG
GGCGGCGGCC GGGTCGAGCA GGCGACCATG GCGTGGGATG AAAGCCGCGG GGTAACAACC
GTTATGCGGA CCAAGGAGCA GGCCCACGAC TACCGCTACT TCCCGGAACC GGACCTGGTA
CCCCTGGAAA TCGACGCCGC CTGGATCGAA CGGGTACGCC GGGAGCTTCC GGAGCTGCCC
GATGCCCGTT GCCGGCGTTT AATGGATACC TTCGGCCTGC CGGCCTACGA TGCCGGGGTG
ATAACCAGCT CCCGGGACCT GGCCGACTAC TTCGACCGGG TAGTGGCCCG GTATCCCGAT
GCCAAGGTTG TCAGCAACTG GATCATGGGC GACTTCTTAC GGCTGTTAAA CGCCCGGAAC
CTTGAGCCCG GCCAGGCGCC GGTACCACCC GAGGAACTGG CAGACCTCCT TGAGTTGCAG
AAGGAAGGCA CCATCAGCGG TAAGATCGCC AAGCAGGTCC TGGAGGAGAT GTTTGCCAGC
GGCAAAGGCG CCCGCCAGAT TGTCCAGGAA AGGGGCCTGG TCCAGATCAG CGACACCGCC
GCCCTGGGGA AGATTGTCGA CGAGGTGCTG GCAGCCAATC CCAACGTGGT GGAGGACTAC
CGCAACGGCA AGGAAAAGGC TCTGGGCTTT CTGGTGGGCC AGGTAATGAA GGCGACAGGG
GGGAAAGCCA ACCCCGGCCT GGTGAACAAG CTCCTTAAGG AAAGGTTATA G
 
Protein sequence
MEYEAVIGLE VHAELKTASK AFCSCSTAFG GEPNTHVCPV CLGLPGVLPV INRQVVEFGL 
KTALALNCRV APFCKFDRKN YYYPDLPKNY QISQYDLPLA TGGYLKINVD GQERVIGITR
VHMEEDAGKL VHVDGPGGGY SLVDYNRTGV PLLEIVSEPD LRSPAEARAY MEKLRAILQY
LDVSDCKMEE GSLRCDANVS VRPRGSQTFG TKTEVKNMNS FRALQRALEY EIERQIAILE
GGGRVEQATM AWDESRGVTT VMRTKEQAHD YRYFPEPDLV PLEIDAAWIE RVRRELPELP
DARCRRLMDT FGLPAYDAGV ITSSRDLADY FDRVVARYPD AKVVSNWIMG DFLRLLNARN
LEPGQAPVPP EELADLLELQ KEGTISGKIA KQVLEEMFAS GKGARQIVQE RGLVQISDTA
ALGKIVDEVL AANPNVVEDY RNGKEKALGF LVGQVMKATG GKANPGLVNK LLKERL