Gene Moth_2410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2410 
SymbolargS 
ID3830777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2530410 
End bp2532092 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content60% 
IMG OID637830329 
Productarginyl-tRNA synthetase 
Protein accessionYP_431235 
Protein GI83591226 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000917803 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATATAG TACAGGAAAC CAAAAGGCGG CTCGCAGCGG CATTGACTGA TGCCGCTGCC 
ACGGCCAGGG CGGCCGGTGA AATTAGTTAC GATGAGCTGC CTGATTTTGT CATTGAGACG
CCGCGGGATA AAACTCACGG CGACTTTGCT GCTAACCTGG CTTTATTGCT GGCCAGGCAG
GCGCGGCAGT CCCCTCGCAA CGTAGCGGCA GCCATTGTGC GGCACCTGGA AAGGCCGCAA
CCCGGCGTGG CCAGAGTTGA AGTGGCCGGA CCGGGCTTTA TTAATTTTAC CCTGGATAAC
CAATGGTTGT TACCGGTGTT GCCGGCCGTC CTGGCGGAAG ACGACCACTA TGGGTGGTCC
AATATCGGCC AGGGAGCCAA GGTCCAGGTG GAGTTCGTCA GCGCCAACCC CACGGGGCTT
TTGCATATGG GTAATGCCCG CGGTGCCGCC CTGGGGGATA GTATTGCCAA CCTCCTCACG
GCGGTAGGCT ATGACGTTAC CCGGGAATTC TATATCAACG ACGCCGGCAA CCAGATTGAG
AATTTTGGCC TCTCCCTGGA GGCTCGCTAC CTTCAGGCCC TGGGCCAGGA AGCCTCTATA
CCTGAGGACG GTTATCACGG CGAGGACCTG GTGGCTACCG TCGGCCGTTT TATCGCCAAG
TACGGGGATA AGTACCTGGA TACAGATCCG GCCCTCCGGA GGGAGATGCT GGTCCGCTTT
GCCCTGGAAG AAAAGCTGGA CGCCATCCGC CGGGCCCTGG AGGATTTCGG CGTAACCTAT
GACGTCTGGT TCAGCGAGCA GTCTCTTCAC GACTCCGGCG CCGTCGCCCG GGCCATTGCC
GACCTGGAAA AGGCCGGATA TATTTATGAA AAGGACGGGG CACTGTGGTT TAAGGCCACC
AGTTTTGGCG ATGTTAAGGA CGAGGTGGTG GTGCGCAAGA ACGGCATCCC CACTTACTTT
GCCGCCGATA TCGCCTACCA CCGCAATAAA TTCGAACGCG GCTTCGAGCG GGTAATAAAT
ATCTGGGGCG CCGACCATCA CGGGCATGTA GCCCGCCTCA AAGGTGCTCT CCAGGCCCTG
GGCTATGACC CCCGCCGCTT GGAAGTCGTC CTCATGCAAT TGGTGCGCCT CTATCAGGGC
GGCGAAATCC TGCGCATGTC CAAACGTACC GGCCAGTACG TCACCCTGGA AGAACTAATT
GAAGAGGTGG GCCGGGACGC GGCACGCTAC TTCTTTGTCA TGTTGAAGAG CGACAGCCAC
CTGGAGTTCG ACCTGGACCT GGCCCGGTCC CAGTCGGCAG ACAACCCGGT GTATTACGTC
CAGTACGCCC ATGCCCGTAT CTGCAGCATC CTGCGCCTGG CGAAGGATAG GGGCCTGGAA
GTACCGCCGG CGCGGGAAGC CCGGCTGGAA CTCTTACAGG ACCCGGCTGA GCTGGAGTTG
ATCAAGCAGA TTGCTGCCTG GCCGGACACC GTGGCCGGGG CGGCCCAGGC CCTGGAGCCC
CACCGGTTGA CGCGCTTTGC CCACGATCTG GCCAGCCTGT TTCACAGCTT TTATACCAGT
TGCCGGGTCC TGGCCGATGA CCCGGAGGTC CGCAAGGCCC GGCTGGTACT GGTGGAAGCG
ACCCGGATCA CCCTGCGCAA CGTCCTGCAC CTCCTGGGAG TCACCGCCCC GGAGAGGATG
TAG
 
Protein sequence
MNIVQETKRR LAAALTDAAA TARAAGEISY DELPDFVIET PRDKTHGDFA ANLALLLARQ 
ARQSPRNVAA AIVRHLERPQ PGVARVEVAG PGFINFTLDN QWLLPVLPAV LAEDDHYGWS
NIGQGAKVQV EFVSANPTGL LHMGNARGAA LGDSIANLLT AVGYDVTREF YINDAGNQIE
NFGLSLEARY LQALGQEASI PEDGYHGEDL VATVGRFIAK YGDKYLDTDP ALRREMLVRF
ALEEKLDAIR RALEDFGVTY DVWFSEQSLH DSGAVARAIA DLEKAGYIYE KDGALWFKAT
SFGDVKDEVV VRKNGIPTYF AADIAYHRNK FERGFERVIN IWGADHHGHV ARLKGALQAL
GYDPRRLEVV LMQLVRLYQG GEILRMSKRT GQYVTLEELI EEVGRDAARY FFVMLKSDSH
LEFDLDLARS QSADNPVYYV QYAHARICSI LRLAKDRGLE VPPAREARLE LLQDPAELEL
IKQIAAWPDT VAGAAQALEP HRLTRFAHDL ASLFHSFYTS CRVLADDPEV RKARLVLVEA
TRITLRNVLH LLGVTAPERM