Gene Pars_1394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1394 
Symbol 
ID5055666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1256194 
End bp1258227 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content58% 
IMG OID640468937 
Producthypothetical protein 
Protein accessionYP_001153606 
Protein GI145591604 
COG category[R] General function prediction only 
COG ID[COG4880] Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.46646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.639448 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAG TTCAGTTAGC GGCGCTTGCA TTTACTATCA CGGCGCTTAT ATTCCTACTA 
TACTCGTTGT CTGTTCCCTC TGTCCCTCAG TCCCCGCCGC CGGCGGGTGC CCCGCCCGCC
GTCTCTGGGG GGATAAAGTC TCTCAGAAAC TTCTCCTCGT ACGAGGAGCT GGTGAGGTTT
GTCTCGCTGG TGGCCGAAGC TGAGGTTGCA TACGGCTATG TCCCCTTCGC AGTCGCCCCG
GCTGTTGCGT GGACTGCCGT CCCAGTAGCC GCACAAGTTG CCGGAGATTC GGCGAGGCGC
GTGTTTTCCA CTACAAACGT CCAGGTGGCC GGCGTCGACG AGCTGGACAT TGTGAAAACT
GACGGGAGGA TAATCGCAGT GGCTTCTGGA GAGAGAGTTT ACCTAATTGA CGCATCCGCT
AGGAGGCTCA TCTCGGCTAT TAAGCCTAAC GGCACGGCGG AGGGCCTATT CTTGTGGGGA
GGCGTCCTGG CTGTGGTCAC GGTGAGGTAT CCCATCGGGG GCGGCGCAGT GGCGTCTACA
GCCTTAGAGA TTTACGATAT TTCAGACCCA CACGCCCCGG CGGCTAGGGG GGCAGTTGTG
TACAGCGGAA CGCTGGTTGG GGCAAGGATG ATTAACGGCA CGGTGTACCT AGTGCTTTCC
ACGCCGGCGA GGCCCGAGGG CGTGCCCAGG GTAAACGGGG CGCCGATTAG GCCGGAGAGG
GTGTACCTGG TTGATCCTCT CCCAACTTCC TTTACCACAG TCGCCGCTGT GGACCTGAAC
AGCGGAAAGA TGGGCGAGGT CTCCCTCCTG ATATCCCAAG CTACGTGGAT TTACATGAGC
GCAACCCGGC TGTACGTGGC GTCGTCGGGA TCGCCTTATG CTGAGGCCCT TGCCAAGGCG
CTGGGTATAC TGGCAGATGT GGCCCCCGGC GAAGTCGGGG CTAGTATTAA GTCTGAGCTA
CAGCGGGGTA ACCTCACAGG TGCTTTGAAA ATAGCCAGGA GCTACATCGC GTCTTTACCC
GCCGCCGAGG CGGAGGAGGT TTTAAGAAGA GTTTCCACAG CGCTGTCGGC GTACGTCTTC
AACGAGACCA CGAGGATACA CGTCTTCTCG GTGAGGGGCG TCGACGTGGC CTACCAGGGC
TACGTGGATC TGCAAGGCCG CGTCTTGGAC CAGTTCTCGC TTGAGGAGTA CAAGGGCTAC
TTAATTGTGG CCACCACTGC CTCGAACTAC ACAGCGTCGC TGGTTGAGCC AAAACCGATA
CTGCCGCCCA TACCCACGAC CGACGTGGGT ATCAACGTGT TGGAGTGTAC GGGGTCGCAG
TGCAGTGAGC GGCGGGTGGT CATTAGGCCG CCTCCTTCGC CCCCCTACTG GCCGCCTGTT
TTCGACGTGG TTATAACACC CAAGGGGGAG AGCCTTAACA ACGTCTTTGT CGTATCCCCA
GCTGAGCTTA GGGTTGTTGG CCGCCTCGGC GGTTTGGCAA AGGGGGAGAG GATATACGCG
GCTAGGCTAA TCGGCGACTT GTTCTTTTTG GTGACCTTCC GCCAGGTGGA TCCCCTCTAC
GCCGTAGACG TGAGCAACCC GGAGAAGCCA GCTGTGCTCG GCTTTCTCAA AATACCGGGC
TACAGCGAGT ACCTACACCC CCTAGGCGGC GGCATGTTGC TAGGCGTCGG GGTCGAAGAC
GGGTCGCTTA AGGTATCGCT GTTCGACGCT TCGAACCCCA GAGACATAAA GGAGGTGGCA
TATGTAAAGA TGGAGGGGTC CAGGAGCCCA GCCCTCTTTG ACCATCACGC CTTTACCATA
CGCCCTGACA AGAGGCTGGT GATGATTCCA GTAACTGCTG GGCACTACGG CATCCCGGCT
GGGATCGCGG CGATAAGCTT CTCAGAGGGG CTGAAGCTTC TGTACCTAAT GGAGCACTGG
GGCGCGGTCA GATCGCTATA TGTGAACAAC ACAGTATTTA CGATAGCGAC AGACAGCGTG
AAGATGTTCG ACATTGACAC ATTTAAGGAG CTGGCAGAAA TCCCGCTGAG GTAG
 
Protein sequence
MAKVQLAALA FTITALIFLL YSLSVPSVPQ SPPPAGAPPA VSGGIKSLRN FSSYEELVRF 
VSLVAEAEVA YGYVPFAVAP AVAWTAVPVA AQVAGDSARR VFSTTNVQVA GVDELDIVKT
DGRIIAVASG ERVYLIDASA RRLISAIKPN GTAEGLFLWG GVLAVVTVRY PIGGGAVAST
ALEIYDISDP HAPAARGAVV YSGTLVGARM INGTVYLVLS TPARPEGVPR VNGAPIRPER
VYLVDPLPTS FTTVAAVDLN SGKMGEVSLL ISQATWIYMS ATRLYVASSG SPYAEALAKA
LGILADVAPG EVGASIKSEL QRGNLTGALK IARSYIASLP AAEAEEVLRR VSTALSAYVF
NETTRIHVFS VRGVDVAYQG YVDLQGRVLD QFSLEEYKGY LIVATTASNY TASLVEPKPI
LPPIPTTDVG INVLECTGSQ CSERRVVIRP PPSPPYWPPV FDVVITPKGE SLNNVFVVSP
AELRVVGRLG GLAKGERIYA ARLIGDLFFL VTFRQVDPLY AVDVSNPEKP AVLGFLKIPG
YSEYLHPLGG GMLLGVGVED GSLKVSLFDA SNPRDIKEVA YVKMEGSRSP ALFDHHAFTI
RPDKRLVMIP VTAGHYGIPA GIAAISFSEG LKLLYLMEHW GAVRSLYVNN TVFTIATDSV
KMFDIDTFKE LAEIPLR