Gene Pars_1740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1740 
Symbol 
ID5054957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1566150 
End bp1567406 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content50% 
IMG OID640469283 
ProducttRNA CCA-pyrophosphorylase 
Protein accessionYP_001153943 
Protein GI145591941 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1746] tRNA nucleotidyltransferase (CCA-adding enzyme) 
TIGRFAM ID[TIGR03671] CCA-adding enzyme 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.753345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000389887 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACTCTGG AGGAAGTTCT TAAAGAGGCG GCTAGGTTAG TAACACCAAG CGAGGAGGAG 
GAGAGGAATG TGAAAGAGAT TTCGCAAAGC GTAAAAGAAC TGGTGTCTCA GATAGTTCGC
GAAGAGGGTG TCAACTCGGA GGTTGAGGTG TATGGATCCA GCGCAAGGGG CACGTGGCTA
CCGGGGCAAC GCGATATCGA TGTGTTCGTC GTGATGTTGG ATAGGGAGAG GAGTCCTGAG
GATGTGGTTA GGCTACTTAC TAGGCGTTTC TCAGAGCTTG GCTTGAACTG GACGTTGAGA
TACGCGCAAC ACCCATACGT CACGCTACAG GTAAGAAGCT ACGAGGTCGA TATTGTTCCT
TGTTATAAGA TCCAGCCGGG TGAGAGACCG GTGACCGCCG CAGATAGGTC GCCGCTTCAC
CACAAGTTTC TTGTAGAAAG GTTGAAGCCA GAACAGGCAC TTGAGGTCAG ACTCCTTAAG
CGGTTTCTGC AGACTATAGG GGTGTACGGC GCCGAGGTCA AGGTGGAAGG TTTTTCGGGC
TACCTTTCTG AACTTCTCGT GGCGTACTAC GGCTCTTTTA TAAACGTTCT TAAAGCCGCT
ACTAGTTGGA GACCGTATCG CACCTACATT TCTTTCTACG AGAGTAATGC AAAGTTCAAA
GCGCCTCTTA TCGTGCTGGA TCCGGTTGAT CCGAACAGAA ACGCCGCAGC GGCGGTGTCG
CTGACTTCTA TGTCTATATT TATTTTGGCG GCAAGGAGAT TTTTGAAGAG GCCATCGTTG
TCATATTTCC GCCAAGAGCA AGGCGACTTG GTGGAGGGCG TTAATAGAGT TGAGGTGGTA
TATCCCTATC CCAATGAGCC GCCGGATGTG GTTTGGGGCA GGTTTAAGAG ACTAGGCCGG
GCTTTGGCCT CGTGGCTTAG GGAATGTGGG TTCAGAGTTA TGAGGTGGGG GGTAGAAAGC
GATGAGCGGA CTTACGTCTC CCTCATATAC GTCGTAGAGC AGACCCAACT TCCGCCTTAC
GTCATACACA GGGGGCCGCC TGTATACGAC GAAGCTGTCG ATAAATTTAT TGAGAAATAC
CTCGGTAGTG ACGTAGTAGG CCCCTTCGTA CAAGGAACAA GAGTCTACGT AATTAAGAGG
AGACGGTATA CGGAGATTAC GGAGTGTATA TCCGCGAGGT TAGGCAAGGG AGGTTACAAT
ATAAGGGTTA ACCTGTACAG CGGCGAGCTA ATAAGAAAAA ATCCTTGGAT TACATAA
 
Protein sequence
MTLEEVLKEA ARLVTPSEEE ERNVKEISQS VKELVSQIVR EEGVNSEVEV YGSSARGTWL 
PGQRDIDVFV VMLDRERSPE DVVRLLTRRF SELGLNWTLR YAQHPYVTLQ VRSYEVDIVP
CYKIQPGERP VTAADRSPLH HKFLVERLKP EQALEVRLLK RFLQTIGVYG AEVKVEGFSG
YLSELLVAYY GSFINVLKAA TSWRPYRTYI SFYESNAKFK APLIVLDPVD PNRNAAAAVS
LTSMSIFILA ARRFLKRPSL SYFRQEQGDL VEGVNRVEVV YPYPNEPPDV VWGRFKRLGR
ALASWLRECG FRVMRWGVES DERTYVSLIY VVEQTQLPPY VIHRGPPVYD EAVDKFIEKY
LGSDVVGPFV QGTRVYVIKR RRYTEITECI SARLGKGGYN IRVNLYSGEL IRKNPWIT