Gene Pars_1176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1176 
Symbol 
ID5056042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1065061 
End bp1066353 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content58% 
IMG OID640468726 
Productbasic membrane lipoprotein 
Protein accessionYP_001153399 
Protein GI145591397 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0623927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAA CCAGGAGGGA CTGGCTTAAA GTCGTGGCTT CCTTCGCCAC CGGCGCGGTG 
CTGGGAGCCG CCGGCGGCTA CTACGCAGGC CTACAAAGCA TAAAAAAAGC AGTTGAGAAG
ACAACGACAT CAACCACAGT AGCCGAGGCT CCGAAAACAC AGATAGGCTT ACCGAAAATG
AAGATACACT TCATATACGT GGGGCCTGTC GGCGACTACG GCTGGACCCA CGCCCACGAC
CAGGGCAGGA AATTCCTGGA AAGGACCCTC TGTCCAGATC CCAATAAGTG GTGCATCGTG
GAGACTTCCT ACACCGAGAG CGTCCCAGAG GAGCAGGCGT ACAGCTATGT GAAAAACGCC
GTCTCCGGCG GCGCCCACAT GGTGATTACC ACCTCTTACG GCTTTATGGA CGGGACAAAG
AAAGCAGCGG CGGAGTACCC CGACCGATTC TTCGCCCACT GCTCAGGCTA CTTCAAAGAC
CAACAAGAGT GGAGCCGCCT ACAGAACTTT GCCGAGTACT TCATTGACCT CTACGAGGCC
TACTACCTCA ACGGCATTGT TGCCGGGAAG ATGACGAAGA CAAATAGGCT GGGTTACGTC
GCCGCGTTCC CCAAGTTGCC CGAGATCATC CGGCACATGA ACGCCTTTCT GATCGGCGCC
CGCGAGGTCA ACCCAAATGT GCAGATGGAC GTGGTGGGCC TCGGCGCGTG GTACGCCCCC
GAAAACGCCA CAAGAGCCGC CCAGGCGCTC GTCGACACCA ACGGCGTTGA CGTGCTCGCG
TTTACAGAGG ACTCCCCCGC AGTCCTCCAA GCTGCCGAGA ACTACCAGAA GCAGGGAAAG
AAGGTGTGGT CCTTCTCCCA CTACAGCGAC ATGTCCCAGT ACGGCCCCAA CGCCAACCTC
ACTGGGCAGA TAGTAAACTG GGGGCCGCTA TACGTGGAGA TGGCGATTAG GGCATACCTG
GCGTGGATAT CCGGCGTGCT GTCAATCTGG TCTGAATGGC CACCTGAAAG GCCCAGAGAC
TACTGGTGGA GCATGAAGAA CTCCTACGCC TACTACGACT ACAAGAAAAA CCCAGCAGAC
ATACTCCCCC TAAACCCAGC TGTGCCGGAC GAGGTGAGGA AATACGTCGA GGAGAGGAGG
AGGCAGATCA TAGAAGGCGT CTGGGACCCG TTCACGGGCC CCGTAAGAGA CATGCAGGGC
AAAGTGAGGG TGCCGGACAG GGCGAGGCTG AGCAAAGACG AGCTCTACAA CATGGATTGG
TACGTGGAGG GCTACCGCCA GCTACCCTCG TAG
 
Protein sequence
MSTTRRDWLK VVASFATGAV LGAAGGYYAG LQSIKKAVEK TTTSTTVAEA PKTQIGLPKM 
KIHFIYVGPV GDYGWTHAHD QGRKFLERTL CPDPNKWCIV ETSYTESVPE EQAYSYVKNA
VSGGAHMVIT TSYGFMDGTK KAAAEYPDRF FAHCSGYFKD QQEWSRLQNF AEYFIDLYEA
YYLNGIVAGK MTKTNRLGYV AAFPKLPEII RHMNAFLIGA REVNPNVQMD VVGLGAWYAP
ENATRAAQAL VDTNGVDVLA FTEDSPAVLQ AAENYQKQGK KVWSFSHYSD MSQYGPNANL
TGQIVNWGPL YVEMAIRAYL AWISGVLSIW SEWPPERPRD YWWSMKNSYA YYDYKKNPAD
ILPLNPAVPD EVRKYVEERR RQIIEGVWDP FTGPVRDMQG KVRVPDRARL SKDELYNMDW
YVEGYRQLPS