Gene Pars_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1993 
SymbolileS 
ID5054987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1783080 
End bp1786022 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content59% 
IMG OID640469540 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_001154192 
Protein GI145592190 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTATG GGGATTTTAA GCTGTCCCCG AGCTACAACT CACACGCAGT AGAAAAGGCA 
GTCCAGGAGT TCTGGGACCG TAACAAGATT TTTGACAAAT GGAAGTCGTG GCGCGGGAGG
CCGGTATACG CCTTCTTGGA GGGACCGCCG ACGACGAACG GCATGCCGCA CGTCGGCCAC
ATAAGGGGGC GCACCTATAA GGACGTTGTG CTACGCTTCC ACCGGCTTCT GGGCTACGAC
GTCTGGGTTC AGGGAGGCTG GGATATGCAG GGGATGCCTG TGGAGTGGGA AGTGGAGAAG
AAGCTGAAAC TTAGGAGCAA GAAAGACGTC GAGCAGTACG GGCTTGAGAA GTTCGCGTTG
GAGTGCAACG CCCTGGTTGA GGAGTATCTC CAGTACTGGC GGGAGTGGGG CACGAGAAGG
CTGGGGCTGT GGCTTGACTT AGAGAACGCC TACGAGACGA GACGGCCCCG GTATCTAGAG
TACGCGTGGC GTGTTGTGAA GAGGGCGCAC GAGGCGGGCC TCCTTGCCGA GGACTACCGG
GTCTTGTGGT TCTGCCCCCG CTGTGAGACG TCGCTGTCCG ACCACGAGGT GGCTCTAGGC
TACGAGGAGA GGGAGGATCC CTCGATATAT GTAAAATTCA GAGTAGAGGG AAGTGATAAC
GAATACCTTG TAATCTGGAC AACGACGCCT TGGACTATCG TGGACAATGA GGCCGTAGCA
GCGCACCCGG ACTACGCCTA CGCCAAGGTG GAGGTCGTCG TGGGGGAGAG GCGGGAGTAT
TGGTGGCTTG CCGAGAGGCT AGTTCCCCAA CTGATGCAAT TATTCGGCGT TAAGGAGTGG
CGGGTTATAG AGACGAAGAT GGGAGAGGAA CTCGCCGGGC TGAGGTATGT CCACCCGCTG
GCTGAGGAGG TGCCGGAGAG GGCTTCGAGA GAGCACAGAG TGGTCACTGC AGAGTTCGTC
ACGCTGGAGC AGGGCACAGG CCTTGTCCAC ATCGCGCCTG GCCATGGCCC TGAGGACTTC
GAGCTGGCCA AGAGGTACGG CATCAGGGTT ACAAACAGCG TGGAGGTAAA CGGCGTCTAC
AACGAGCTGG GGGGCAAGTA CAGGGGCAAA CACGTCTACG ACGTGGACAA GGAGGTTGTC
AAGGATCTCA AGGCCAAGGG CCTTCTGGTC TACGAGGGGA AGATACGCCA CGAATACCCC
CACTGCTGGC GTTGCGGGTC GAAGCTGATA CTCCGCGCCG ATAGGCAGTG GTTCATAACC
ATTTCACGCA TCCGCGACAA GATGTACTCA GAGCTCCAGA AAGTCAACGT GGTGCCGCAG
AAGCTTAGGG ACAGGTTTGA CGTCTTTGTG CAAAACGCCA GGGATTGGAA TATAAGCAGG
AGCAGGGTGT GGGGCACGCC CCTCCCCGTG TGGAGATGCA AGAAGGACGG CCGCATACTG
GTAGTGGGCT CCCTCGACGA GCTGAAGAAG TTGGCCAAGG AAGTGCCCCC GGTGGACGAC
TTCAAGCTAG TCCACCGGCC GTGGATCGAC CAGGTTAAGA TCGCCACCGA GGACTGCGAC
GAGTGGGTGC GGGAGCCCTA CGTGATGGAT GTGTGGCTTG ACAGCGGCGT GGCGTGGCTC
GCCGCAGTGG ACGGGGAGAG GAACAGAGAA CTCTGGGGGA AGCTGTTCCC CTACGACTTC
GTCACAGAGG GGATAGACCA GACGAGAGGG TGGTTCTACT CCCTCCTCGC CACTTCTGTC
CTCTACACAG GAAGGGCGCC CTACAAGACC GTGCTGATCC AGGGCCTAAT CCTCGATAAG
CACGGCCAGA AGATGTCCAA GAGCAAGGGC AACGTAGTGT GGGCCAAGGA CCTCTTCGAG
AAGTACGGCG CCGACCCCGT TAGGCTGTAT ATCTTGTCTA AGGCGGCTCC TTGGGAAGAC
CTCTCCTTTG ACCCAGACGA GGTAAGGCAC GTGCTGGCCG ACCTCGGCAT TTTGTGGAAC
GTCGCGAGGT TCGCTGACAC CTATATGTCC CTCGACGGAT TTGACGCCGA GAGGTACCCG
CTGGAGGAGT GGCTGGGGAG GGGGCTCGAA GAGGATAGGT GGGTCCTCTC GGAGCTCAAC
TCCCTCGTTG CAGAGTTCGC CTCGTACCTC AAGAACTTCG AATTCCACAA AGCGGTAGCC
ATGTGGAGGG ACTTCGTGGT GGAGACCCTC AGCCACCGCT ACCTCAGGCT TTTGAGGAGG
CGCGTCTGGA GCGACGAACC CACCCCCGAC AAATACGCCG CCTACGCGGT CCTCCACCAC
GTGATAAAGA CCGTGTTGAT CCTCGCCTCC GTGTTTACGC CCTTCGTGGC GGAGTACCTA
TGGCAGGCCT ACGTGAGGAA GTACGAGAAG TCCGCGCCGG AGTCCGTCCA CCTGGCCCAG
TACCCCCAGG CGGGGCCCGT CGACAAGGAG CTCGTAGAGG CCTACCGCGA GCTGTTCGCC
GCCTTCTCTG CGCTGGCCGA GGCTAGGAAC AAGGCCGGGA TAAAGCTGAG GTGGCCCGTG
AGGGTGGCGT ATATAAGCGG CGCCAAGCAC GCGGGGCGCT ACGCCGAGCT GTTGAAATAC
CTGGGCAACG TGAAAGAGGT AAAGATAGGG CCGTGCCCCG AGGGCTACGT GAAGGCCGCA
GAGGGCCAAT TGGAGGCTTG CATACCCCCA AAGCTTGAGC CCGAGCTCTA CTACGAGGCC
CTCGCCAGGG AGATAGTGAG GAGGATCCAG GTGATGAGGA AAGAGGCCGG CCTTGAGATA
AGCGATAGCA TTAAAGTAGT GGTGGAAACA AACTCAGAAG ATGTAAAAAA CGCCGTTGAA
CATTATAGAG ATTACATCGC CAGGGAGACA CGCGCTGTTG ACCTACTCAT CGGGCAGGCA
ACGGGCGGAC GTGAGTGGGA CATCTCAGGC GAAAGGGTAC GTATTGAGAT AAAGAAGGCC
TAG
 
Protein sequence
MSYGDFKLSP SYNSHAVEKA VQEFWDRNKI FDKWKSWRGR PVYAFLEGPP TTNGMPHVGH 
IRGRTYKDVV LRFHRLLGYD VWVQGGWDMQ GMPVEWEVEK KLKLRSKKDV EQYGLEKFAL
ECNALVEEYL QYWREWGTRR LGLWLDLENA YETRRPRYLE YAWRVVKRAH EAGLLAEDYR
VLWFCPRCET SLSDHEVALG YEEREDPSIY VKFRVEGSDN EYLVIWTTTP WTIVDNEAVA
AHPDYAYAKV EVVVGERREY WWLAERLVPQ LMQLFGVKEW RVIETKMGEE LAGLRYVHPL
AEEVPERASR EHRVVTAEFV TLEQGTGLVH IAPGHGPEDF ELAKRYGIRV TNSVEVNGVY
NELGGKYRGK HVYDVDKEVV KDLKAKGLLV YEGKIRHEYP HCWRCGSKLI LRADRQWFIT
ISRIRDKMYS ELQKVNVVPQ KLRDRFDVFV QNARDWNISR SRVWGTPLPV WRCKKDGRIL
VVGSLDELKK LAKEVPPVDD FKLVHRPWID QVKIATEDCD EWVREPYVMD VWLDSGVAWL
AAVDGERNRE LWGKLFPYDF VTEGIDQTRG WFYSLLATSV LYTGRAPYKT VLIQGLILDK
HGQKMSKSKG NVVWAKDLFE KYGADPVRLY ILSKAAPWED LSFDPDEVRH VLADLGILWN
VARFADTYMS LDGFDAERYP LEEWLGRGLE EDRWVLSELN SLVAEFASYL KNFEFHKAVA
MWRDFVVETL SHRYLRLLRR RVWSDEPTPD KYAAYAVLHH VIKTVLILAS VFTPFVAEYL
WQAYVRKYEK SAPESVHLAQ YPQAGPVDKE LVEAYRELFA AFSALAEARN KAGIKLRWPV
RVAYISGAKH AGRYAELLKY LGNVKEVKIG PCPEGYVKAA EGQLEACIPP KLEPELYYEA
LAREIVRRIQ VMRKEAGLEI SDSIKVVVET NSEDVKNAVE HYRDYIARET RAVDLLIGQA
TGGREWDISG ERVRIEIKKA