Gene Athe_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2036 
Symbol 
ID7408249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2150808 
End bp2152493 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content37% 
IMG OID643716403 
Productarginyl-tRNA synthetase 
Protein accessionYP_002573886 
Protein GI222530004 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.432071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATTTAG TAAAACTTGC AAAACAGCAG ATTCAAGATG TAGTTCAAAA TGCCATAAAA 
AACTGTATTG ATAAAGGAAT ATTTGAGCTT GACAGCATTC CAGATATAAT GATTGAAAAG
CCGAAGGAGA AATCTCACGG CGATTTTGCA ACAAACATAG CAATGGAGCT TACAAGAAAA
CTTAAGAAAA ATCCAAGAGA GATTGCAAAT GGCATTGTTA ACGCAATTGA TTTATCAAAT
ACTTTCATTG AAAAGGTTGA AGTTGCGGGC CCAGGGTTTA TAAATTTCTT TTTCAAAAAA
GACTGGCTTT ACAAAGTTGT GGATGTGATT TTGTCTGAAG GTGACGACTA TGGAAAAGTA
AATATTGGAA ATGGCAAAAA AGTGATGGTT GAGTTTGTCT CGGCAAATCC GACTGGCCCT
ATGCACATGG GGAATGCCCG CGGAGGCGCG CTGGGGGACT GTCTTGCAAA CCTTTTAAAA
TGGGCTGGAT ACAATGTTAC AAAGGAGTTT TATGTCAATG ATGCTGGAAA CCAGATAGAA
AAGTTTGGAC AGAGCCTTGA GATTAGATAC AGACAGCTGA AGGGTGAAAA TGTAGATCTT
CCTGAAGATT GCTATCATGG TGAGGATATA ATCGAAAGGG TAAAAGAATA TTTAGATGAG
CACGGGGATG ATTTGGAGAA TTTGTCTTCG GATGAGAGAA GAAAAAAGCT TGTTGACTTT
GCCCTAAAGA GAAATATTTT GCTTATGAAA GAGCACTTGA GAAAATATGG CATAGAATAT
GATGTATGGT TTCATGAAAG TAGCCTTTAT GAGAGTGGAG AAGTTTTTGA GACAATTGAG
GATTTAAAAT CAAGAGGATA CACGTACGAA AAAGATGGGG CGCTGTGGTT TGCAGCATCA
AAAATAGATG AGAGCTTGAA AGATGAGGTT TTGATAAGAG CAAATGGGAT TCCAACCTAT
TTTGCAGCTG ATATTGCTTA TCACAGAAAC AAGTTTGAAA AAAGAGGCTT TGACATTGTG
ATAGACATTT GGGGTGCTGA CCATCACGGG CATGTTGCAA GAATGAAAGC TGCAATGAAA
GCGCTTGGGA TTGACCCAGA AAAGCTTATA GTTATCCTTA TGCAACTTGT AAGACTTGTA
AGAGGTAAAG AAGTTGTTAG AATGTCTAAA AGGACAGGTA AAGCAATAAC TCTAATTGAC
CTAATTGATG AAATTGGTAA AGATGCAGCA AGGTTTATGT TCAATACAAA ATCAGCAGAT
ACTCACATTG AGATAGATTT AGACCTTGTG ACACAACAGA CACTTGACAA CCCTGTATTT
TATGTCCAGT ATGCTCATGC AAGAACATGC GGAATCATTA GAGCTTTGTC AGAAGAGGGA
ATAGTGTTAG ATAAAAGTAG AATAAAAGTA GAGCTTTTGC AGCAAGAGGA AGAGCTTGAA
CTTTTGAAAA AGCTTTTAGA GCTTCCTGAA GAAATAGAAA TGGCGGCAAA GAACTTAGAC
GTTAGCAGAG TGACAAAGTA TCTTTTAGAC TTAGCATCTA TGTTCCATGC TTTTTATAAC
GCGTGCAGAG TTAAAAATGA AAATGAAGAA CTGATGTTTA CAAGGCTTTC TCTTGTTGAG
TGTGTAAGAA TTGTCATAAA TAACATGCTA AGACTGCTTG GAGTTGACGC TCCAGAGAAA
ATGTAA
 
Protein sequence
MNLVKLAKQQ IQDVVQNAIK NCIDKGIFEL DSIPDIMIEK PKEKSHGDFA TNIAMELTRK 
LKKNPREIAN GIVNAIDLSN TFIEKVEVAG PGFINFFFKK DWLYKVVDVI LSEGDDYGKV
NIGNGKKVMV EFVSANPTGP MHMGNARGGA LGDCLANLLK WAGYNVTKEF YVNDAGNQIE
KFGQSLEIRY RQLKGENVDL PEDCYHGEDI IERVKEYLDE HGDDLENLSS DERRKKLVDF
ALKRNILLMK EHLRKYGIEY DVWFHESSLY ESGEVFETIE DLKSRGYTYE KDGALWFAAS
KIDESLKDEV LIRANGIPTY FAADIAYHRN KFEKRGFDIV IDIWGADHHG HVARMKAAMK
ALGIDPEKLI VILMQLVRLV RGKEVVRMSK RTGKAITLID LIDEIGKDAA RFMFNTKSAD
THIEIDLDLV TQQTLDNPVF YVQYAHARTC GIIRALSEEG IVLDKSRIKV ELLQQEEELE
LLKKLLELPE EIEMAAKNLD VSRVTKYLLD LASMFHAFYN ACRVKNENEE LMFTRLSLVE
CVRIVINNML RLLGVDAPEK M