Gene Acid345_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2032 
Symbol 
ID4073201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2434202 
End bp2436145 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content59% 
IMG OID637984046 
Productarginyl-tRNA synthetase 
Protein accessionYP_591107 
Protein GI94969059 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATCGGT CTCTCCAGCA GCGCCTCATC TCCGCCATCC AGGCCTTCCT CCGCCAGAAA 
TACGACGTCC ATCTTCCCAC CCTCGTGGTG GACGCTCCAC CCAAGGTCGA GATGGGTGAA
TACGCTCTTC CGTTCTCGTT CGAACTCGCC AAGCGCCTGC GCAAAGCCCC GCGCAAAATA
GCCGAAGAAG TCGCAACCGA ACTGCCGCCC ATCGAAGGCT TCGAGAAGCC TGAGGTCGCA
GGCGCCGGGT ACATTAACTT CCGCCTGAAG CGGGGGGATG CAGCCACCGC GTTAGCAAAA
GGAGAATCCA CGCCGGTCAC CCACGACGGC AAGATTCTGG TCGAGCACAC CTCGATCAAT
CCCAACAAGG CCGCGCACAT CGGCCACCTG CGCAACTCCA TCCTTGGCGA CACCTTCGTC
CGCCTTTTGC GCGCCGCCGG CCGCACCGTC GACATCCAGA ATTACATCGA CAACACTGGC
GTTCAAGTCG CCGACGTCGT CGTCGGCTTC ACCCATATCG AGAACAAGTC GAAGGCCGAA
ATCACTGCGC TCACCGAGCA GCCCAAGTTC GACTACTACT GCTGGGACCT CTACGCCCAC
ACCTCGCAGT GGTACGAGCA GGCCGCGGAG AACAAGAAGA TCCGCCTCGA AGTCCTGCAC
GCCATCGAGC ACGGGGGCAA CGAGCTCTCC GAGATCGCTG AAATCATCTC GCGCGCTGTC
CTCCGTCGCC ATCTCGAGAC CATGGACCGC CTCGGCATTG AATACGATTT CCTGCCGCGC
GAGAGTGAAA TCCTCCGCCT CAACTTCTGG GCACTCGCCT TCGAGCAGCT CAAAGAAAAA
GGTGTCCTCT ACTTCGAAAC CGAAGGTAAG AACAAGGGCT GTTGGGTCAT GACCCGGCCA
GGCCGCGAAC GGGTCGATGG CCAGCCGGAC GAGGACGCTA AGGTCATTGT CCGTTCGAAC
GGGACAGTCG GCTACGTCGG AAAGGACATC GCCTACCATC TTTGGAAGTT CGGCCTGCTC
GGCCGCGACT TCGGCTACAA AAAGTTCTAC CTCTATCCCA ACGGCCAGCA GGTCTGGATC
AGTTGTGATC CCGCCGAGGG GGAGAGCGAC CACCCCCATT TCGGTGGCGT CAGCGAAATT
TACAACGTCA TCGATACCCG CCAGTCCGAT CCTCAGGAGA CGGTCAAGGA AGCCATCCGA
CTCCTCGGCT ACAACGACAA AGCCGACCAC TACACCCACT TCTCCTACGA GATGGTCGCG
CTCACCCCGC GTTGCGCCAT CGATCTCGGC TACGACGTCT CGGAAGATGA TCGTGCCAAG
TCCTATATAG AAGTCAGTGG CCGCAAAGGT TTCGGAGTCA AAGCCGACGA CCTCATCGAC
AAGCTCATCG ACGCCGCGAC CAAAGAAGTC GATTCCCGCC ACCCGGAACT CACCGAGTCC
GAGCGCCGCG AAATTGGCAC CCAAATCGCC ATCGGCGCCC TGCGCTACTT CATGCTCAAG
TACACCAAAG CATCGGTCAT CGCCTTCGAC TTCAAGGAAG CTCTCGCCTT TGAAGGCGAA
ACCGGTCCCT ACGCGCAGTA CGCGGTGGTC CGCGCCACCA ATATTTTTCG CAAAGCCGGC
ATCGCACCCG GAGATGCACT CGCGTACAAC GTCGATTTCA CGAAGCACTT TGCCGAGACT
GCCGAGATAT GGGAAGTCTG GCTCATGGCA GGGAAGACCT CACAGATTCT CGAGCTCTGC
ATCTCGCAAT CCGAGCCCGC CTACGCCGCC AAGCACGCTT TCCAACTTGC GCAACTGTTC
AACAACTTCT ACCACCGCCA CCACATCCTC ACCGAGGAAG ACGAAGGCCG GAAGAAATTC
CTGCTCGCCA CCGCCGCCGT CATGCGCCGC GAACTAATTG CCGTCCTCGC TGCCATGGGC
ATCAGCGTTC CGCCTGTCAT GTAA
 
Protein sequence
MYRSLQQRLI SAIQAFLRQK YDVHLPTLVV DAPPKVEMGE YALPFSFELA KRLRKAPRKI 
AEEVATELPP IEGFEKPEVA GAGYINFRLK RGDAATALAK GESTPVTHDG KILVEHTSIN
PNKAAHIGHL RNSILGDTFV RLLRAAGRTV DIQNYIDNTG VQVADVVVGF THIENKSKAE
ITALTEQPKF DYYCWDLYAH TSQWYEQAAE NKKIRLEVLH AIEHGGNELS EIAEIISRAV
LRRHLETMDR LGIEYDFLPR ESEILRLNFW ALAFEQLKEK GVLYFETEGK NKGCWVMTRP
GRERVDGQPD EDAKVIVRSN GTVGYVGKDI AYHLWKFGLL GRDFGYKKFY LYPNGQQVWI
SCDPAEGESD HPHFGGVSEI YNVIDTRQSD PQETVKEAIR LLGYNDKADH YTHFSYEMVA
LTPRCAIDLG YDVSEDDRAK SYIEVSGRKG FGVKADDLID KLIDAATKEV DSRHPELTES
ERREIGTQIA IGALRYFMLK YTKASVIAFD FKEALAFEGE TGPYAQYAVV RATNIFRKAG
IAPGDALAYN VDFTKHFAET AEIWEVWLMA GKTSQILELC ISQSEPAYAA KHAFQLAQLF
NNFYHRHHIL TEEDEGRKKF LLATAAVMRR ELIAVLAAMG ISVPPVM