Gene Cthe_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1541 
Symbol 
ID4810048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1868956 
End bp1870416 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content51% 
IMG OID640106960 
Productaspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit A 
Protein accessionYP_001037961 
Protein GI125974051 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID[TIGR00132] glutamyl-tRNA(Gln) and/or aspartyl-tRNA(Asn) amidotransferase, A subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.859243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAACGG GAATATCTGA ACTGGCAAAA AGGCTTCAAA GCCGCGAAAT TTCCGCGGTG 
GAGCTGACAA AAGCATATAT CGGTGCAATT GAAAAACTGA ATCCAACCAT CAATGCCTAT
GTGCATTTGA CCTTTGATAC CGCCATGAAG GCTGCGGAAA AGGCCGATCA AAGGCTTAAA
GAGGGCGGTG CGCCGCTGCT CTGCGGTATC CCCATGGCCC TGAAGGACAA TATTTGCACT
GACGGACTTA GCACTACATG CTGCTCCAAA ATCCTGAAAG GCTTTAAACC GTACTATGAC
GCTACCGTTT GGGAGAAGCT GAAAGCCCAT GGTGCAGTGC TTCTGGGCAA GACCAATATG
GATGAGTTTG CCATGGGAAG CACTTCGGAA ACCAGCTGCT ATGGGGCGCC GCTGAACCCC
AGAAACACGA ATTATGTTAC CGGCGGTTCT TCGGGCGGTT CGGCTGCGGC AGTTTGCGCC
AATCTTGCGG TATACAGCCT GGGCTCGGAT ACGGGCGGTT CCATTCGTCA GCCGGCCTCC
TTTTGCGGAG TGGTGGGACT TAAACCCACT TATGGTGCGG TATCCCGTTA TGGGCTCATC
GCTTACGGCA GCTCTCTGGA CCAGATTGGA CCTATGACAA ACAGCGTGAA AGATGCCGCC
ATCGTTTTCG ATGCTATAAA AGGCGGCGAC AGGCGCGATC AGACCAGTGT GGATTATGAT
TATGGTTCCT CTCTGGCTGA ATGTCTGGAC CGGGATGTCA AGGGAATGCG CATTGGTGTG
GCCGAGGAAT TCTTTGACGG TATCAATCCG GAGATTAAAT CAAAGATTGA GGAAGCCATA
AAGCTGTTTG AAAGAAACGG TGCTGTGATA GAAAATATCA GGCTTCCGGC CTTAAAGCTT
GCGCTTCCTG TATATTACAT TATCGCCTGT GCCGAAGCAT CCTCCAACCT GGGACGATAT
GACGGTATCC GTTATGGTTA TCGCACATCC TCATACACCA GTATTGAGGA CATGATTGTC
AAAACAAGGA GCGAAGGCTT TGGCGACGAG GTTAAACGCC GCATCATGCT GGGAACCTAT
GTGCTGTCCA GCGGTTATTA TGATGCTTAC TACAAGAAAG CCTGCCTGAT TCGTGAAGAG
ATAAACCGGG AATTTGATGC CGTATTTGAA AAATGTGACG TGCTCGTTGC TCCTACTGCA
CCCAATACAG CATTTCCGCT GAACTACAAG GGTGCAGGCC CGGTTGAGAT GTACTTGTCG
GATATCTGCA CTGTACCCGT CAATATCGCA GGCGTTCCGG CCATCTCGGT TCCCTGTGGG
GAGGATTCCA ACGGGCTTCC TGTCGGCATG CAGATCATCG GTAAGAAATT TGATGAGGCA
ACGGTTCTTC AAGCGGCTCA TTTTTATGAA CAAAGCGCCG GAGAAGTGAT TGGCGTATAC
GAAGGAGGTG CCAGGATATG A
 
Protein sequence
MRTGISELAK RLQSREISAV ELTKAYIGAI EKLNPTINAY VHLTFDTAMK AAEKADQRLK 
EGGAPLLCGI PMALKDNICT DGLSTTCCSK ILKGFKPYYD ATVWEKLKAH GAVLLGKTNM
DEFAMGSTSE TSCYGAPLNP RNTNYVTGGS SGGSAAAVCA NLAVYSLGSD TGGSIRQPAS
FCGVVGLKPT YGAVSRYGLI AYGSSLDQIG PMTNSVKDAA IVFDAIKGGD RRDQTSVDYD
YGSSLAECLD RDVKGMRIGV AEEFFDGINP EIKSKIEEAI KLFERNGAVI ENIRLPALKL
ALPVYYIIAC AEASSNLGRY DGIRYGYRTS SYTSIEDMIV KTRSEGFGDE VKRRIMLGTY
VLSSGYYDAY YKKACLIREE INREFDAVFE KCDVLVAPTA PNTAFPLNYK GAGPVEMYLS
DICTVPVNIA GVPAISVPCG EDSNGLPVGM QIIGKKFDEA TVLQAAHFYE QSAGEVIGVY
EGGARI