Gene Pars_0207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0207 
Symbol 
ID5054131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp187527 
End bp188648 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content56% 
IMG OID640467786 
Productuncharacterized linocin/CFP29-like protein 
Protein accessionYP_001152474 
Protein GI145590472 
COG category[S] Function unknown 
COG ID[COG1659] Uncharacterized protein, linocin/CFP29 homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.502019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATTCT CGAAGAACCC GGTAGACATT ACTAGGGACA GAAAGTTGTC CTCTGGAGAG 
ATCGCCGACT CGCTTAGGCT TGCCATTATG GCCGAGCTAG ATGCCATAAG CCTATATCTC
CAATTGGCCA GGCTAATCGA CGACGAGAGA GTAAGGAAGG TCTTTGAGGA CATAGCTAAG
GAGGAGAAGA CGCACTTCGG CGAATTTCTC GCACTGCTTA AACACATCGA CCCCGAGCAG
GTGGAGCAGT TGAAGGCAGG CTTGATAGAA GTCGGCGAGC TGACAGGCAT AAAGGCGCCG
ATGAATGATC CCGACAATAA GCGGGTTGGC GAGTCTAACG CGCGCAGTGA TCCGCCGCCT
GACGTGGTCT CTTCGTCGGG CCTAAGCCCC GAAGAGCTCA GGTACTTGCA GAATAGGGTG
AGGGAGGTCT CCGGCAAGGT GAGGAGGTTT AGGAGGTATC TTGCGACATA TGAGGCGGGC
CCCGGGGTCG ACGCAGTGCC GCTGGAGGAG GCGGCCCCCG GCCCCACAAT AGCCGCTAAT
AGGTCGGTGG TGCCGCTTAA GGAGCTGAGC GTGAAGTTCT CCATATCGCA GAGGCAGATC
GAGTACGCAA GAGCTAGGGG CGAGGCGGTC TACTCGACGT CGGCGGATAG GGCAGCGGTT
AGGCTAGCGT ATGAGGAAGA CGCAACAATA CTCGGCGATA TTCTGGGCAA CCCGAAGGTC
AAGACAATGG GCATCACGTC GTGGGACGCG CCGGGGTCTG CCGTAGCTGA GGTCTCAAAC
GCCGTAAATC TGCTTTACAG CAACTACGTG CCCGAGCCCT ATGTGTTATT TGTAAGCCCC
GGCAGATTCA CGAAACTCTT GACAGTTGTC GAGAAGACTG GCGTCATGGA GCTGACGAGA
GTTAAGTCCC TAGTTCAAGA CGTCGTCGTG GTGCCCCAAC TGAGAGACGA CACGGCACTG
TTGCTGTCAA CCCACCAATC AATCATCGAC GTAGCCGTAG GCGTAGACAC GGCGCTGGTA
TACCTCGGCC CCGAAGACGG TACACACGGG TTCAACTTGT GGGAAACCTT GGCGGTGAGG
ATCAAGGATC CCAACGGCGT AGTAGTACTA AAACAAACGT AG
 
Protein sequence
MVFSKNPVDI TRDRKLSSGE IADSLRLAIM AELDAISLYL QLARLIDDER VRKVFEDIAK 
EEKTHFGEFL ALLKHIDPEQ VEQLKAGLIE VGELTGIKAP MNDPDNKRVG ESNARSDPPP
DVVSSSGLSP EELRYLQNRV REVSGKVRRF RRYLATYEAG PGVDAVPLEE AAPGPTIAAN
RSVVPLKELS VKFSISQRQI EYARARGEAV YSTSADRAAV RLAYEEDATI LGDILGNPKV
KTMGITSWDA PGSAVAEVSN AVNLLYSNYV PEPYVLFVSP GRFTKLLTVV EKTGVMELTR
VKSLVQDVVV VPQLRDDTAL LLSTHQSIID VAVGVDTALV YLGPEDGTHG FNLWETLAVR
IKDPNGVVVL KQT