Gene Pars_0904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0904 
Symbol 
ID5054634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp800795 
End bp802231 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content49% 
IMG OID640468461 
Producthypothetical protein 
Protein accessionYP_001153137 
Protein GI145591135 
COG category[S] Function unknown 
COG ID[COG2855] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAACAT ATGCCATGGC TCAGCAACGA AAGATTGATT GGAGTTCTTT GTGGAAGAAA 
GAAGATTGGT GGGCCCTGTG GCTTGGGCTT TTTGTTTTCT TACTGGCGTG GTTCCTCCTA
TTGGGCTGGG TGCCTAAGAC CAGCGTGTGG ATTGACCCAT CAAAAAGTAT ATCAACAGCG
AGCAAGGAAT TTGCATACCT CGGCGGCTGG AGCCTCATAT TGCTCTACTT CTTCACCCTA
GTGGTGTTGT CCATAGCGGC TGCGCTCATG AAATACGACG TGAAGGCGTT TGCAGTGGGC
TATACCGTTA TTTTCTGGCT TTCCTACCTC ATGTGGTGGT TTAGCAACTA TGCTTACATC
GCCGCCACTC CTGACGTATG GCCTAGATAT GGAATTAACT GGAGCCTCAG CCTAACTGGC
GAGGCTGGCT GCATATTTGC ATTAGTTCTC GGCCTTATAA TAGGCAACAC CGTGAGGAAG
CTACCCAAGC CGCTTGAAGT AGCAGCCAGG CCTGAGTGGT ATATCAAGAC TGCCATAGTC
CTACTAGGCG CAGTGGTTGG CGCAAAGGCG CTTCAAAATA TGACTGTCGC CGCTGAGGTT
TTAACTAGAA GCCTCATAGC AATTGTAGCT GCGTATTTAA TTTACTGGCC AATTTCATAC
TTAATATCAA GAAAAATTGG CTTAGATAAG CAGTGGGCTG CGACGCTGGC TTCCGGAGTC
AGCATCTGCG GAGTCTCTGC GGCCATAGCC ACTGCGGCGG CAATTGGGGC CCCCGCCGTG
ATTCCAGGCA CAATTGCATC TATCATAGTC ATATTTGCAG TAATCGAGTT GATAATTCTC
CCCTGGGTGG CGGCTCAGAT TTTGACATGG GCCCCTTTGG CCGCGGGGGC GTGGATGGGT
CTTGCCGTTA AGACGGACGG AGCTGCGGCA GCCTCAGGCG CTGTTACAGA TGCATTAATA
AAAGTTAAAG TGCCTGAAGC CGCAGGATGG GTCACTGCAA CGGCGGTGAC TGTTAAAGTA
TTTATAGACA TTTGGATTGG ACTGTGGGCA TTTGTACTAG CCCTGTGGTG GGTTACGAGA
GTAGAGAGAA AGCCAGGAGA GAAGGTTCAA GCCGTAGTCA TTTGGTATAG ATTTCCAAAA
TTTGTAATTG GCTATTTCGT TACAATGTTT GCAATATTAG CCCTGGCCTC TTTCATACCT
ATTAAAGACG CCATTTCTCT TGCCAGTGCC GTAACGGGAC AGTCGGACGT GTTAAGACAG
TTCTTCTTCC TCATAACTTT CACCTCCATA GGGCTAACTA CTAACTTTAG GAAGTTTAAA
GAAATCGGCG CCGGCAAGGC CGTGGTGGCT TACTTTATAT CCTTGCTGGT TATAATATTC
ATAGCCTTAG GTCTCGCCGT GGCTTTCTTT GCAGGATTGC CTCTGCCCAA GTCCTAA
 
Protein sequence
MLTYAMAQQR KIDWSSLWKK EDWWALWLGL FVFLLAWFLL LGWVPKTSVW IDPSKSISTA 
SKEFAYLGGW SLILLYFFTL VVLSIAAALM KYDVKAFAVG YTVIFWLSYL MWWFSNYAYI
AATPDVWPRY GINWSLSLTG EAGCIFALVL GLIIGNTVRK LPKPLEVAAR PEWYIKTAIV
LLGAVVGAKA LQNMTVAAEV LTRSLIAIVA AYLIYWPISY LISRKIGLDK QWAATLASGV
SICGVSAAIA TAAAIGAPAV IPGTIASIIV IFAVIELIIL PWVAAQILTW APLAAGAWMG
LAVKTDGAAA ASGAVTDALI KVKVPEAAGW VTATAVTVKV FIDIWIGLWA FVLALWWVTR
VERKPGEKVQ AVVIWYRFPK FVIGYFVTMF AILALASFIP IKDAISLASA VTGQSDVLRQ
FFFLITFTSI GLTTNFRKFK EIGAGKAVVA YFISLLVIIF IALGLAVAFF AGLPLPKS