Gene Cthe_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1031 
SymbolgatB 
ID4811325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1233155 
End bp1234588 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content42% 
IMG OID640106449 
Productaspartyl/glutamyl-tRNA amidotransferase subunit B 
Protein accessionYP_001037456 
Protein GI125973546 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0064] Asp-tRNAAsn/Glu-tRNAGln amidotransferase B subunit (PET112 homolog) 
TIGRFAM ID[TIGR00133] glutamyl-tRNA(Gln) and/or aspartyl-tRNA(Asn) amidotransferase, B subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0403261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTATG AAATAATTGT AGGGTTGGAG GTTCATGTTG AGCTTTCAAC CAAATCAAAA 
ATATATTGTT CATGCACCAC GGATTTCGGA GGAGAGCCAA ATACCCACAT TTGTCCTGTC
TGCACCGGTA TGCCGGGAGT GCTTCCGGTT TTGAACAAAA AAGTGGTGGA ATACGCGGTA
AAAACCGGGC TTGCCACAAA CTGCAGCATT GCAAAGTACA GCAAGCAGGA CAGGAAAAAC
TATTTTTATC CCGACCTTCC AAAAGCTTAT CAAATTTCCC AGTACGACCT TCCCCTATGC
CGGGACGGAT ATATTGACAT TGAGGTGGAA GGAAGGACAA AAAGGATTGG GATTACAAGG
ATACATATTG AGGAAGATGC GGGAAAGCTT GTACATGATC AGAATGAAAC GGGAACGTTG
ATTGATTATA ACCGGTGCGG TGTGCCGTTA ATTGAGATTG TCACCGAGCC TGACATGCGT
TCGGCGCAGG AAGCGAGGGC TTTTGTGGAA AGCCTTAGGA ACATACTACG GTACATTGAT
GTTTCCGACT GCAAAATGCA GGAAGGTTCA TTGAGAGTTG ACGTTAATCT TTCGGTAAGA
CCTAAGGGCC AAAAGGAATT TGGGACGAGG ACGGAAATGA AAAATTTAAA TTCCATAAGG
TCAATGGTAA GGGCAATTGA AAGTGAAGCC AAAAGACAGA TTGAGGTTAT TGAAAGTGGC
GGAATTATTG TTCAGGAAAC CAGGAGATGG GATGAACACA AAGGTGTAAG CTGTTCAATG
AGAACTAAAG AGGAGGCCCA CGATTATCGA TATTTTCCGG AACCGGATCT TATGCCGATA
GTGGTGGATG AAGAATGGAA GGAAGAAATA AAAAGAAGTC TTCCCGAGCT TCCTGATGCA
AGAAGAAAAA GGTATGTAAA CGAGTATGGA CTTCCCGGAC ATGATGCTTT CATTCTTACA
AGCTCAAAGG CTCTTGCAGA TTTTTTTGAG GAGGCGGCGG GAAAATGCAA TAATGCAAAA
GCCGTGAGTA ATTTTATACT GGGGGATGTT TCGAGAATCC TTAACGACAA GGGAATGGAA
GCTGAAGACA TACCTTTTCC GGCGGAATAC CTGGCAAAGT TGGTGAAATT GGTTGACCAG
GGAACAATAA GCACAACCAT TGCAAAAAAA GTATTGGAGA TAATGTTTGA ACAAAAAAAG
GATCCGCAGG AGATAGTAAG GGAAGAAGGA CTTGAAGTTG TAAGTGATGA AAAAGCTCTT
GCCGAGGTTG TTAAAAAGGT GATTTCAAAC AATACAAAAT TGGTGGAGGA TTACAAAAAA
GGCAAGGACA AAGTTTTCGG ATTCCTTGTG GGACAGGCTA TGAAAGAGAC TAAGGGGAAA
GCAAATCCCC GGCTTTTAAA CAAGATTTTG AAAGAAGAAC TTGACAAAAT ATAA
 
Protein sequence
MEYEIIVGLE VHVELSTKSK IYCSCTTDFG GEPNTHICPV CTGMPGVLPV LNKKVVEYAV 
KTGLATNCSI AKYSKQDRKN YFYPDLPKAY QISQYDLPLC RDGYIDIEVE GRTKRIGITR
IHIEEDAGKL VHDQNETGTL IDYNRCGVPL IEIVTEPDMR SAQEARAFVE SLRNILRYID
VSDCKMQEGS LRVDVNLSVR PKGQKEFGTR TEMKNLNSIR SMVRAIESEA KRQIEVIESG
GIIVQETRRW DEHKGVSCSM RTKEEAHDYR YFPEPDLMPI VVDEEWKEEI KRSLPELPDA
RRKRYVNEYG LPGHDAFILT SSKALADFFE EAAGKCNNAK AVSNFILGDV SRILNDKGME
AEDIPFPAEY LAKLVKLVDQ GTISTTIAKK VLEIMFEQKK DPQEIVREEG LEVVSDEKAL
AEVVKKVISN NTKLVEDYKK GKDKVFGFLV GQAMKETKGK ANPRLLNKIL KEELDKI