Gene Pars_1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1167 
Symbol 
ID5054906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1054060 
End bp1055379 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content59% 
IMG OID640468717 
Producthypothetical protein 
Protein accessionYP_001153390 
Protein GI145591388 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.018342 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTCC CAATATCTAC AATTATATTG TTGGCGGCTC TTGTAGCCGC CCAACAAATA 
TATGTAAATG TCTTGGCAGA CCTCTCCCAC GGCGAGGCGG AGAAGGGCCT CGACTTCTGG
GTTAACTCCA CGACAAACCC CCTAGCGATA TCGGACTTCG CTAAGCTGTA CATCTTGTTG
CCCCCCGGCG CCAAGCCGGG CCCAGTCGTC GCCAAACTAA ACGCCACGAA GTCGGCAGTC
CTCCTCACAG GCGACCTATC CACAGTAGAC CTGAACCAGT TTAAGGTAAT TGTGCTGGGC
CAGCCGACGA AGCCTCTGTC AGACGCCGAG TTGGCTGCTG TTAAGAAGTG GCTCGACGGA
GGCGGGAGGG TGCTGTGGTG CGCCGCCGAC TCCGACTACC CGGCTCAAGG CTCCGAGGAG
TCGCAAGTGG CGTGCAACGA CATCGCCGAA TACCTCGGCG CCCACATCCG CGTCGACTAC
GTGTCTGTAG AAGACTCCCA ACACAACGCC GGGGCTGGCT ACAGAGTTGT CGGCGTGGTC
GACCCGCCGC CTCAGCTCTC CTTCCTCGGC TTCATGGCCC AGAGAGTGCT CTTCCATGGC
CCAGGCGCTG TTGCTGTCGT CCTCCCTGAC GGGAAGTGGA TCCCCGCCAC GAGCCCCGAG
GTGCAGAAGT ACTACAATAA CGTATTCGTA ATTGTACATA CAACCCCCAC CGGCACCATT
GTGGAGAGCA GAACCTCAGC CGACGGAAAG GGGAGAGATG GAAAGGCCCA CACGGCGGGA
GACAAGGGCG TGTTTGCCCT TATGGCTCTT GAGCTCCTGC CCAGCGGTAG CGTCTTGATA
CTATCCGGTG AAACTCCATA CGGTGGCTAC GAGCCCATGC TGGCCCCTGT GTACTACAGA
GTGCCCCTAG ACGGGCCGAG GTTTTTCAGA AACTTGATGC TTTGGGCGAC TGGGAACTAC
CGCGAGCTCA CCACCATGAT CCAGCAAACG CAACTCTTAT CTCAGTTGCA ACAGGGAATT
GAAGCCGTTA GAGGCGACGT GTCTGCCCTA AAAGGCGGCG TCTCGGCCGT GCAGAACGAC
TTGGCTTCCC TCAAGAACTC GGTGTCCCAG CTGGGCAACC AGATCAGCGC TGTTTCTCAG
AACGTCGCAC CGGTAAAAGA CCAGGTAGCC GCCCTCAACC AAAAGGTGGA CCAGCTGACG
CAACAGCTAA ACGCCGCTAT GGCCGAGGCC AACAACGCCA GGACGGTGGC CTTTGTCGGG
ACGGCGTTGG CCCTTATATT CGCCATAGCC GCCGCCTTCC TTGCTGTGAG GAGGAAATGA
 
Protein sequence
MRVPISTIIL LAALVAAQQI YVNVLADLSH GEAEKGLDFW VNSTTNPLAI SDFAKLYILL 
PPGAKPGPVV AKLNATKSAV LLTGDLSTVD LNQFKVIVLG QPTKPLSDAE LAAVKKWLDG
GGRVLWCAAD SDYPAQGSEE SQVACNDIAE YLGAHIRVDY VSVEDSQHNA GAGYRVVGVV
DPPPQLSFLG FMAQRVLFHG PGAVAVVLPD GKWIPATSPE VQKYYNNVFV IVHTTPTGTI
VESRTSADGK GRDGKAHTAG DKGVFALMAL ELLPSGSVLI LSGETPYGGY EPMLAPVYYR
VPLDGPRFFR NLMLWATGNY RELTTMIQQT QLLSQLQQGI EAVRGDVSAL KGGVSAVQND
LASLKNSVSQ LGNQISAVSQ NVAPVKDQVA ALNQKVDQLT QQLNAAMAEA NNARTVAFVG
TALALIFAIA AAFLAVRRK