Gene Pars_1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1080 
Symbol 
ID5055321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp966947 
End bp968281 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content64% 
IMG OID640468636 
Producthypothetical protein 
Protein accessionYP_001153310 
Protein GI145591308 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1530] Ribonucleases G and E 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.108156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACAGGG CAAGGATTAG GGGGATTTAC GCCACTGCCT TGACTAAGCT GGCGCTGGAC 
TGGGGCTTCA AGGTGGTGCA ACCGACAGAG AAGATTGCCC GCCGCTTCGG CCTAGAGCCC
GACTTCTCGC CGCCCGACAT CACCGTGAAG GACCACGAGT CTAAGACGGG GATTGTGGCG
ATGGGCCTAT GCGAGGCGGT TGAGGCCTTT CTCTCAAAGC TTACGGAGTA CGCCGACCCC
ATCGTGGCGA GGGCCAGGGC CCGGCTTAAG GAGGTTTTCG TGGGCAGGGC AGTGGGGGAG
GCGACTGTGG AGGGGCCAGG CGGGGAGGTC TTTGACGTGC CCCGCCGCTA TGTTTTGACC
CCCGGCGCAA CGGGCATCTA CACAGTGGTG AGGCCGCCCA TCGGCCCCCT CAAGGGCGTG
GCGGCCCCCG AGATCGTGGT GGAGGGACAG TACGTCGAGC TCAACACCAC GGGCCGCGTC
TCCTACAGCG AGCACATACC CGCCGAGGAG GCGGTCCGGC TTAGGATCCT CGCCGAGACG
AGGCTCAGGC AGTACGCCTC GATAGGCCTT AGGTTTAAGT CCTCCGCCCG CTATGCTCCG
GATGACGCCA TCGCCGCGGA GGCCGAGGCG CTTTATAAGG AGATGCTTGA AATCTCAAAG
GGCGGCTCCC CGGGTCAGGT GCTTAGGCGG GGGAAGTGCT TTGCGGTAGT CCTCTTCGAC
TCTGCGTCGA AGGCTAGGCT CGACGAGGCG AGGGCCGCCG TTGTGCCCAC CGTGAGGGGC
CACCACGCAC TTAGGGCGCA GGGCCTTGGG AAGTGCCTAG ACCTCCTCGA CCACGTCGGC
GGCGACGTCT ACGAGAAAGC CGCCGAGTTT TTGGCGGGAG AGGCGGCGGC GGTGTACCAC
GTAAAGCCGT GGGGCGAGGT GGTGAAGATG CGGGCTGAGC CCGTCGGGGT TAGGGGCGGC
GTCTTGGTGC TGAGGAGGCG GCTTAGGCCA GGCGGCGTGT TGGACGGCAT CGGCGTCAAG
ATAGAGAGGG GGTTCTACGC CTTGACGTGC GTCCCACGGG GCAAGGGCTA CGTCGTACAC
ACCTACTACA CAGCAGAGGG GAAAGCCGTG GGGACGTACG TAAACGCCAA CACGGTGCCC
GAGTGGGGCC GCCGCGTTAT CTACATCGAC CTATTGGTGG ACAAGGCCTT CGACGGGGGA
GGAGAGAGGG TGCTTGACCT GGATGAGTAC GAAAAATACG CCGAGATGTT CCCACAGAGG
CTGAGGGACC CCCTCAGCAG ACTGCCCAAG ACGCCCATAT GGTGCACCGA GGAGGGCATA
AAGACGGTCG CCTAG
 
Protein sequence
MYRARIRGIY ATALTKLALD WGFKVVQPTE KIARRFGLEP DFSPPDITVK DHESKTGIVA 
MGLCEAVEAF LSKLTEYADP IVARARARLK EVFVGRAVGE ATVEGPGGEV FDVPRRYVLT
PGATGIYTVV RPPIGPLKGV AAPEIVVEGQ YVELNTTGRV SYSEHIPAEE AVRLRILAET
RLRQYASIGL RFKSSARYAP DDAIAAEAEA LYKEMLEISK GGSPGQVLRR GKCFAVVLFD
SASKARLDEA RAAVVPTVRG HHALRAQGLG KCLDLLDHVG GDVYEKAAEF LAGEAAAVYH
VKPWGEVVKM RAEPVGVRGG VLVLRRRLRP GGVLDGIGVK IERGFYALTC VPRGKGYVVH
TYYTAEGKAV GTYVNANTVP EWGRRVIYID LLVDKAFDGG GERVLDLDEY EKYAEMFPQR
LRDPLSRLPK TPIWCTEEGI KTVA