Gene Pars_0578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0578 
Symbol 
ID5056043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp516565 
End bp517659 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content61% 
IMG OID640468139 
Productparallel beta-helix repeat-containing protein 
Protein accessionYP_001152824 
Protein GI145590822 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3420] Nitrous oxidase accessory protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.56022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTATCA ACACGCCGGG TTATTACAGA GGCGTCGTGG CCGATAGGCT TGTAATCAAC 
GCCAGCGGCG TCTGGCTGGA GGACGTGACG GTGAGGGGGG GCTATAGGGA GGTGCCGCTC
CCCCAGACTA TGACGGCGTA TAGGGTTAAG CCTGCGGTGG GCTGTCTAGT GGTGGCGGGA
CGCAACGTGA CCATCACAAA CCTGAGGCTG GACTGCGCGG CGGGTCTGCT AATCTTTAAC
TCATCCGGCG TCGTCGTCCG CGGATTCGAG GCGCGGGGGC CTGTGGAGAT GGCCGTGTAC
AAAAGGGGAC TAGGCATCTA CGTATACAAC TCCACAGACG TGAGAGTAGA GGGTGGCAGT
CTGTACGGCT TCCACGACTG CGTCTACGTG GAGTATTCCC GCAACGTCTT TCTCGACGGT
TTGAGAGCCC AGGGCTGTAG GTACGGCGTA CATGTCATGT TCAGCGAGGG CGTCTCCATC
AAGAACTCTA TGGTCTCTGA CAGCTATGTG GGCTTCGCGG TTATGTACAC GAAAAACGCC
TCTGTGGTCA ACGCCTCGGC GGTTGGCAAC AGGGCGTGGG CTGAGGGCTA CGGCATCTTG
CTGGCGGAGC TCAGCGGCGT TGTGAGAGGT TGTAAGGCTG TGGACAACGT CCACGGCATC
TACGTCTTGT ACTGGGGGGG CACCAGGGTG TTGGTGGAGG GCTGCGTCAT ATCCGGCAAC
TACTTCGGCA TCACGCTGAG GGGGAGGAAC GCCACCGGCG TGGAGTTCGT AGGTAATGTG
ATTCGAGGCA ACGTGGTTGA GGTGGATCAC ATGGGGGTGG GGGAGGAGGC CCCCGCCGCC
TTGTTTAGGG GCAATCTCTG GGGCGGCCAC GCCTCGCCGT CCCCCTACTA CTACGCCAGC
GCCTTCTCTG ACTTGATGAC CGCGACGGAG GGGGCGCTTG CATATTTAGC CGCATCCCCG
GCCCGCTTCG TGATCGACGC CGCCATGGGG AGGCCAATTG CCTACGACCC GGCGCCTAGG
CCGGATGAGA GGGCCCCGCC GTATTTGTTG CTTTTGGCCC TCCTCCTGGT GCCGCTGGTA
TGGAAGTCGA GGTGA
 
Protein sequence
MVINTPGYYR GVVADRLVIN ASGVWLEDVT VRGGYREVPL PQTMTAYRVK PAVGCLVVAG 
RNVTITNLRL DCAAGLLIFN SSGVVVRGFE ARGPVEMAVY KRGLGIYVYN STDVRVEGGS
LYGFHDCVYV EYSRNVFLDG LRAQGCRYGV HVMFSEGVSI KNSMVSDSYV GFAVMYTKNA
SVVNASAVGN RAWAEGYGIL LAELSGVVRG CKAVDNVHGI YVLYWGGTRV LVEGCVISGN
YFGITLRGRN ATGVEFVGNV IRGNVVEVDH MGVGEEAPAA LFRGNLWGGH ASPSPYYYAS
AFSDLMTATE GALAYLAASP ARFVIDAAMG RPIAYDPAPR PDERAPPYLL LLALLLVPLV
WKSR