Gene Pars_1236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1236 
Symbol 
ID5055439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1118647 
End bp1119693 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content56% 
IMG OID640468782 
Producthypothetical protein 
Protein accessionYP_001153455 
Protein GI145591453 
COG category[S] Function unknown 
COG ID[COG2855] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGACTA AGTGGACTGA TCCAGCAAAG GCCTTCGCGG TGCCGAGCCC AAAGCTTGTG 
TTCGGGAGCC CGTGGGTAAA TATCATAGTT GCGTGGGTTG TCTTAATGCT CCTTCTGATG
GGCCCCGCAA AGCTAATCGG CGTCAAGCCG AGCGACTGGG CCAAGGGCTT TTCTGTAATC
TGGTGGCTGT GGTGGCTTTC CGCATTTATT GTCGGGTATA AGCCCATAGC TGACGTTGTG
ACCACAGAAT TTGCCTTCAC CCTAGCCCTC TTCCTTGGGA TGCTCATAGG CAATTTGCCT
AAGGTGCATC AGTGGTTGCT CGGATCGGCG AGGGGGGAGT GGTTTATAAA AACGGCAATT
GTGCTCCTAG GCGCCAAGAT CCTCTTCACA GACTGGATTA GATACGGAGG CTCTGTTCTC
GTCATGGTGC TTATGTCCTT CCCAGTGTTT ATGCTCTTGG CATTCCCCGT GTTCAGGCTC
TTCACCAAGA ATACCGATCT AAGCATCGTG GCTTCCGTAG GCATAGGCGT GTGCGGCGTG
TCGGCGTCTA TCACAGCGGC CGGCGCCATT GGGGTCCCCG CTATTTACCC CACAGTGGTG
TCAGCCGCAA TCCTGATATA CGCGGCGGTT GAGCTCATCA TCTTGCCGTA CGTGGCGCAG
TGGCTGGTTA AGGCAGGGAT AATGAGCCCT GCTACCGCAG GGGCGTGGAT GGGCCTCTCT
GTTAAGACCG ACGGGGCCGC CGCGGCGTCT GCTGAAATAG TCACCCGCTA CGTGGGGGTT
GATGAGCCTC TACGCGTCGG CGTAATGGCC AAGGTCTTAA TTGATATCTG GATGGGGGTG
ATCGCCTTTG TCCTCGCCTT GATATGGGTG TTCGTTGTAG AGGTTAGGCG CGGAGTCGCG
AGCGGCCGCA GGCCCTCGCC GATGGAGCTC TGGTATAGAT TCCCCAAGTT TGTCCTAGGC
TACTTCTTCA CATCGCTGGT AATTTCAGCA CTTATTATGA GCTTAGCCGG CTCTGTATAC
GCCACTGCCC CGAACCCCGT CGACTAG
 
Protein sequence
MWTKWTDPAK AFAVPSPKLV FGSPWVNIIV AWVVLMLLLM GPAKLIGVKP SDWAKGFSVI 
WWLWWLSAFI VGYKPIADVV TTEFAFTLAL FLGMLIGNLP KVHQWLLGSA RGEWFIKTAI
VLLGAKILFT DWIRYGGSVL VMVLMSFPVF MLLAFPVFRL FTKNTDLSIV ASVGIGVCGV
SASITAAGAI GVPAIYPTVV SAAILIYAAV ELIILPYVAQ WLVKAGIMSP ATAGAWMGLS
VKTDGAAAAS AEIVTRYVGV DEPLRVGVMA KVLIDIWMGV IAFVLALIWV FVVEVRRGVA
SGRRPSPMEL WYRFPKFVLG YFFTSLVISA LIMSLAGSVY ATAPNPVD