Gene Pars_2210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2210 
Symbol 
ID5054387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1982493 
End bp1983539 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content59% 
IMG OID640469763 
ProductRNA 3'-terminal-phosphate cyclase 
Protein accessionYP_001154408 
Protein GI145592406 
COG category[A] RNA processing and modification 
COG ID[COG0430] RNA 3'-terminal phosphate cyclase 
TIGRFAM ID[TIGR03399] RNA 3'-phosphate cyclase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.098436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTGTCC GGATCGACGG CTCCTACGGG GAGGGCGGAG GCCAAATTTT GCGGACGTCA 
ATTGCCTTAT CCGCTCTCTT GGGCAAGCCT GTGGAGATAA TAAACATACG GGCTAAGAGG
GCGAACCCGG GGCTCCAGCC GCAGCACCTC ACCGGCGTGA GGGCCGCCGC CTTGCTTACA
GACGCCGAAG TGGAGGGAGC CGTTAAGGGC TCCACGAGGC TGTTCTTCAA GCCCAGGGAC
ATTAAGTGTG GGTCCTTCGA CATAGACATA GGCACTGCTG GTAGCATCTC CCTAGTTGTC
CAGACCCTCG CCCCGGTTTT GCTGTTCGCC CCGTGCCCCA CCCGAATAGC CATCTCCGGC
GGCACGGACG TGTCGTGGTC GCCTCCCATC GACTATATGC GGTTTGTATT CGCAAAGGTC
TTATCCCTAT TCGGCGCAAG GGTGGAGATA GAGCTTATAA GGCGTGGCCA CTACCCCAAA
GGAGGGGGGA GGGCGGTGCT GAGAGTAGAG CCGGTGAAAA AGCTCTCACC CGTGAGTTTA
GAAGAGTTCG GGAAGGTACT GGAAATACGC GGGATATCCC ATGCCGTAAA TCTCCCGAGC
CACGTGGCGG AGAGGCAGGC TAGGGCGGCC GCCGAAGTCT TGGCAAAGCT GGGGTACAGA
GCCGAGATAT CAACGGAGGT GCGGGCCGAC GGCCTTGGCC CCGGCAGCGG TGTTGTCCTC
TGGGCCTACT CGGAAAGCGG AAGCACCGTA GGCGGGGACT CATTAGGAGA GAAGGGAAAG
CCCGCCGAGG TAGTTGGCCG CGAAGCCGCC GAGAAGCTTG CCGCCGTGCT TAAAACCGGC
GCCACGTTGG ACCCCCACAT GGCCGACATG GCAGTTGTGT ACATGGCACT GGCCGACGGG
AGGAGCAGGC TGAGCACATC AGAAGAGACT ATGCACCTCA AGACAAACAT CTACATCGTG
GAGCAGTTCT TGCCAGTGAA GTTCAAGGTG GAAAAACAGG CGGCAAGATA TGTACTAGAA
GTAGACGGAG TAGGCTACAG CAGATAG
 
Protein sequence
MVVRIDGSYG EGGGQILRTS IALSALLGKP VEIINIRAKR ANPGLQPQHL TGVRAAALLT 
DAEVEGAVKG STRLFFKPRD IKCGSFDIDI GTAGSISLVV QTLAPVLLFA PCPTRIAISG
GTDVSWSPPI DYMRFVFAKV LSLFGARVEI ELIRRGHYPK GGGRAVLRVE PVKKLSPVSL
EEFGKVLEIR GISHAVNLPS HVAERQARAA AEVLAKLGYR AEISTEVRAD GLGPGSGVVL
WAYSESGSTV GGDSLGEKGK PAEVVGREAA EKLAAVLKTG ATLDPHMADM AVVYMALADG
RSRLSTSEET MHLKTNIYIV EQFLPVKFKV EKQAARYVLE VDGVGYSR