Gene Pars_1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1821 
Symbol 
ID5056075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1633283 
End bp1635040 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content58% 
IMG OID640469367 
Productglycyl-tRNA synthetase 
Protein accessionYP_001154024 
Protein GI145592022 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0423] Glycyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00389] glycyl-tRNA synthetase, dimeric type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.651121 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGAG CTGAGCTACT GGAAGAAATT ATAAAACGCC GCTTGCTCTA CTGGCCCTCA 
TCTGAGATTT ACGGCGGCGT GGGCGGCTTC TACGACTATG GCCCGCTGGG GGTTCAGCTG
AGGCGGAACA TAGTGGAGAA GTGGCGCCGA ACCTTCGTCT TGCCCTTCCA AGACCTCATA
ATTGAGGTGG AGACGCCCAT AATTATGCCG GAGCCCGTCT TCAAGGCGTC GGGCCACCTT
GACCACTTCA CCGACTACGT GGTGGGGTGC ACCAAGTGCG GGAGGAAATA CAGAGCCGAC
CACCTTGTGG AGGAGGAGCT GGCCAAGAGG GGCCTCAAGA TATCCACAGA GGGTCTCTCG
GCGGCTGAGC TGGAGCGCTT AATAGTGGAG CACAGAATTG TCTGCCCCAA CTGCGGCGGC
CCTCTCGGGA GGGTTGAGTC TTTTAACCTC CTCTTCAAGA CGACGATTGG GCCCTACAGC
GAAAACGCAG GCTATCTAAG ACCCGAAACG GCTCAGGGAA TATTCGTAGC TTTTCCCCGC
CTCGCGGAGT ACGTGGGGCG ACGCCACCCC TTCGGCGTTG CGCAGATAGG GAGGGTGGCG
CGCAACGAGA TCTCGCCTCG GGGCGGACTC ATGAGGCTGA GGGAATTCAC ACAGATGGAG
ATAGAGCTCT TCTTCGACCC TCAGAACCCC AAGTGTCCCT ACTTCGCCGA GGTGGAGGGG
CTTGAAATCC CTATCGTGCC GGAGGAGTTC GTGGCTAAGG GTCAGACAGA GCCCCTTTTC
CTAACGGCGA GGGAAGTAGC GGCGAGGGGA TACGCAAACG AGTGGATGGC CTTCTTCATG
GCCCTAGCCG CCAAGTTCCT CAAAGAGCTG GGAGTTCCCC TGGAGAGGCA GAAGTTCTTG
GGTAAGCTCC CACACGAGAG GGCCCACTAC TCGGCTAAGT CCTACGACCA GATGGTGTTG
ACAGAGCGCT TCGGCTGGGT GGAGGTATCG GGCCACGCCT ACCGCACCGA CTACGACCTC
TCCGGCCATA TGAGGCACAG CGGCCGTGAG ATGTACCTAG AGAGGCGTCT TCCAGCCCCT
AAGGAGGTAG AAGTGGTGAG GATCTATCCC AACCCCAACG CCATAAGGGA GAAGTACGGG
GATAGAATAG GAGAGGTCAT AAAGGCAATA AAAGAAAACG AGGCGTATGT AGCTAAGACA
TTCGGCGAGG GGAAGCAAGA GGTGACCGTT GGCGAGTACA TCGTAACTCG TGACATGGTA
TTTATAAAGA CAGAGAAGCG CAAAACCGAC CTCGAGAAGT TTATCCCGCA TGTGGTGGAG
CCTTCTTTTG GCCTCGATAG GATTATGTAC GCGGTTTTGG AATACGCAGT GGCGGAAGAG
GGTGGGCGCG TCTACTTGAG GCTCCCGGCA GACGTCGCGC CTATCAACGT GTGTATCCTG
CCCATTGTCA AAAGGCAGGA CTACGTGGAG ATAGGCCGGT CTCTGCGGAA GGAACTGGCG
ACGGAGGGCT TTTTAGCATG GTACGACGAC GAGGGAACAA TCGGCAGTCG ATACGCGGCG
TGTGACGAAA TCGGTGTGCC GCTTGCGGTG ACGATTGACG AGAGGACCCC AACAGACGGC
ACCGTCACCA TACGCGACAG GGATACCCGC AAGCAGGTGC GGATAGGGCT GAGGGACGTG
GCGAAATTCT TGGCGGCGGT TAGGAAAGGC ACCTCTTTTG ACGAGGCGGC TAAGGCCCTC
GGCGCCACGC CTGTCTAA
 
Protein sequence
MRRAELLEEI IKRRLLYWPS SEIYGGVGGF YDYGPLGVQL RRNIVEKWRR TFVLPFQDLI 
IEVETPIIMP EPVFKASGHL DHFTDYVVGC TKCGRKYRAD HLVEEELAKR GLKISTEGLS
AAELERLIVE HRIVCPNCGG PLGRVESFNL LFKTTIGPYS ENAGYLRPET AQGIFVAFPR
LAEYVGRRHP FGVAQIGRVA RNEISPRGGL MRLREFTQME IELFFDPQNP KCPYFAEVEG
LEIPIVPEEF VAKGQTEPLF LTAREVAARG YANEWMAFFM ALAAKFLKEL GVPLERQKFL
GKLPHERAHY SAKSYDQMVL TERFGWVEVS GHAYRTDYDL SGHMRHSGRE MYLERRLPAP
KEVEVVRIYP NPNAIREKYG DRIGEVIKAI KENEAYVAKT FGEGKQEVTV GEYIVTRDMV
FIKTEKRKTD LEKFIPHVVE PSFGLDRIMY AVLEYAVAEE GGRVYLRLPA DVAPINVCIL
PIVKRQDYVE IGRSLRKELA TEGFLAWYDD EGTIGSRYAA CDEIGVPLAV TIDERTPTDG
TVTIRDRDTR KQVRIGLRDV AKFLAAVRKG TSFDEAAKAL GATPV