Gene Pars_1379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1379 
Symbol 
ID5054697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1240776 
End bp1242602 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content61% 
IMG OID640468924 
Productglutamyl-tRNA(Gln) amidotransferase subunit E 
Protein accessionYP_001153593 
Protein GI145591591 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2511] Archaeal Glu-tRNAGln amidotransferase subunit E (contains GAD domain) 
TIGRFAM ID[TIGR00134] glutamyl-tRNA(Gln) amidotransferase, subunit E 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00195959 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0032704 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGACTACA AGGCGCTTGG CTTGAAAACC GGACTTGAAA TCCATATACA GCTCAACACG 
AGGCGCAAGC TCTTCTGCCA CTGCCCCCCG GTATTGAGAG ACGACGAGCC CCACTTCAGA
GTAGAGAGGA GGTTGCACAT ATCTGTCAGC GAGCTGGGGG CGGTTGACCC GGCGGTTGTG
TGGGAGGTGA GGAAGCGGAG GAAGTACATA TACGAGGGGT ACAGGGACAC CACCTGCCTC
GTGGAGCTTG ACGAGGAGCC GCCCCACATG CCGGACGAGG AGGCCTTGAC GACGGCGGTG
GCCGTGGCTA AGATGTTCAA CGCCAAGCTC TTTGACGAGA TCTACGTGAT GAGGAAGACG
GTGGTGGACG GCTCCAACGT GTCGGGCTTC CAGCGCACGA TGCTCGTGGC GTATGGCGGG
AGGGCCAAGA TCCTGGGCTA CGACATCGGG GTGGAGACCA TAGCCCTCGA GGAGGACGCG
GCGAGGAAGA TGGGAGAGGA GGGCAAAGCT GTGGTGTACC GCCTGGACAG GCTGGGGATC
CCCCTCATCG AGATCGCCAC GGAGCCCATG ACCTACGCGC CACAGCAGGT GGAGGAGGTG
GCGTGGATTA TAGGCTACAG CGTGAAGATA ACGGGGAGGG CCAAGAGGGG CGTGGGCACA
GTGAGGCAAG ACGTCAACGT CTCCATCGCG GGCGGCGCCA AGACTGAGAT AAAGGGCGTC
CCCGACTTGT CCCTAATCCC CAAGGTTATC GAGTACGAGG CGACGCGCCA GCTCAGCCTG
TTGAAAATAG CAGAGGAATT GAAGAGACGC GGCGTGGAGA AGGTGGAGCT CTCCCTCGCC
GACGTCACCC AGGCCTTTGC CAACACCAAG TCTAAGCTTG TGAGGCGGGT GCTAGACGCC
GGGGGGAAGG TGGTGGCGGT GAAGGCCCCC GGCTTCAATA AGCTCCTAGG CGCGGAGGTC
CAGCCGGGGA GGAGGTTCGG CACTGAGCTG GCGGACTATG TGAGGGCTTG GACTGAGCTG
GGGGGCCTCC TACACAGCGA CGAGCTCCCG GGTTACGGCA TTACAGCAGA CGAGGTAAGG
GACGTGGAGG CGAGGGTGGG GGTTAACAGC TTCATCTTGC TCATGGGCGT CGACGAGGGG
GAGCTGGAGG AGGCGGCGAG GGTGGTTGTG GAGAGGCTCA ACGCGGCGCC TAGGGGGGTG
CCCGAGGAGA CCCGGGCCGC CAACCCCGAC GGCACTACGA GGTTTCTCAG GCCTAGGCCC
GGCGCGGCTA GGATGTACCC CGAGACAGAC CTCCCGCCGG TAAGGATTAC TTTTGAGATC
TTGAAGAAGG CCGAGGAGGT GGCCAAAGTC ACCCTTGAGG GCAAGCTCAA GGAGCTCACG
TCGAGGGGGC TGAGCAGGGA CTTGGCGCTT CAGCTGGTGA AGTCTCCACA CCTGGAGAAG
TTTGAGGACT ACCTCCAGAG GTTTAAGGAG GTGCCGCCCC AGCAAATAGC CGCGGTTCTA
CTCAACATCT CCAAGGCCTT GGCGAGGGAG GGCGTCGAGA TCACCGACGA GAAGGTGGAG
TCTGTTCTCG ACGCTTTGAA TAGGAAAGTC ATAACCAAGG AGGCTGTGGA GGAGGTCCTC
AGGAACATGA AGCCGGGGGA GTCGGCCGAG GAAGCGGCTA AGAGGCTGGG GCTGTTGAGA
ATGTCCTACG ACGAGGTGAA GAAAATCGTG GCCGAGGTGG CGGCCCAGGT GGGGAAGGAG
AAGGCGGTGG GCGAGGTGAT GAGGCGCTAC AGGGGAAAGG TGGATGTGGA GGACGTAAGA
CGGGCGCTGG CCGAGATATA TTTATAA
 
Protein sequence
MDYKALGLKT GLEIHIQLNT RRKLFCHCPP VLRDDEPHFR VERRLHISVS ELGAVDPAVV 
WEVRKRRKYI YEGYRDTTCL VELDEEPPHM PDEEALTTAV AVAKMFNAKL FDEIYVMRKT
VVDGSNVSGF QRTMLVAYGG RAKILGYDIG VETIALEEDA ARKMGEEGKA VVYRLDRLGI
PLIEIATEPM TYAPQQVEEV AWIIGYSVKI TGRAKRGVGT VRQDVNVSIA GGAKTEIKGV
PDLSLIPKVI EYEATRQLSL LKIAEELKRR GVEKVELSLA DVTQAFANTK SKLVRRVLDA
GGKVVAVKAP GFNKLLGAEV QPGRRFGTEL ADYVRAWTEL GGLLHSDELP GYGITADEVR
DVEARVGVNS FILLMGVDEG ELEEAARVVV ERLNAAPRGV PEETRAANPD GTTRFLRPRP
GAARMYPETD LPPVRITFEI LKKAEEVAKV TLEGKLKELT SRGLSRDLAL QLVKSPHLEK
FEDYLQRFKE VPPQQIAAVL LNISKALARE GVEITDEKVE SVLDALNRKV ITKEAVEEVL
RNMKPGESAE EAAKRLGLLR MSYDEVKKIV AEVAAQVGKE KAVGEVMRRY RGKVDVEDVR
RALAEIYL