Gene Pars_2307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2307 
Symbol 
ID5056098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2063132 
End bp2064742 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content57% 
IMG OID640469859 
Producthypothetical protein 
Protein accessionYP_001154503 
Protein GI145592501 
COG category[S] Function unknown 
COG ID[COG3356] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00335528 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGTCTT TTGAGAAGGG GTATAGCATT CTCTTTGGGC GTTCGCCGAG GAGGGTTGCG 
CTTTACGCAA CGGCCCTCTT AGCCTTCTTA GCGGCTTTGA AGGCGCTGTC CGCACAGCGG
GCGCCACTTC TTTACGCCTT GTTCGGCGGC GTGATTCTCC TCATACTGCT CTCAGCGGAT
CGCGCCGTGA TTAACCCGCG CAGATCTTAC TACGTCGCGG TTATATCGAC GCTGGTGGTC
TCCTTCTTCG ATTTATTATT TCAAAAAGCT CCGCTGACCT TTGCCCTAGT CGGCGCCGTG
ATAACCGCGG TGGTCCTGCA GTCGCTTAAA TGCAGGAGCT TTTGGTACAT CGCTCCACTC
GTCGCGGTCT CAGCTATTTA CTACGCGGTG GGGGAGCTGT ACCTTTTCGC CATTTCTCTC
CTTTACATAT CGGTGCTCCA GCTGACTAGG TTTGTTATAA ACAAGATGGT GAGGGGTCTC
GACGCCATGT GCATGTTTTC GAGCTTTATC TACTCCGTCT TTGCAGAAGA TGACGTTTTG
GAAGACGCCT TCAGGGAGTT GGGCAGGTTG GAAAGGGTGC CTCTCCACGT CTTTATCATC
GGCGGGAGGC ACGTCGTCGT TGTGTCGGAC TTCCACCCAG GGCCGTTTAG GCACATCGGC
GGCGGTATGC TGGTAGATGA GTTGCAGAAA GCGGTTGAGG GTATGGGGTA CAGCTTCACC
TTTCTCCACG GCGTTGGTAG CCACGAGCGC GACCCCGTGG ACGGGGAATC CCTCAGGAGA
ATAGTAAACG CGGTCAAGAC TGTCTTGGCC TACGGGCGAA ACGGAGCCCC GCCCAGGGGG
ATCTATCCGC AGAGCCACAT TGTTGGGGAC GTAAAGGTAG TGGGCCTCAG CCTCGGCGCA
CCGCCGTACC TAGCAGTGGT GAGCAGGGTG AACTCCGCCT CGGACGACAT CCCCACCTGG
GTTAGCCGGC TTGTGGACAC CGGCGCGTAT ATACTAATCG ACGCACAGAA CAAATTCGAC
GGCGCGGTGC AGTGGCGCGA GGTGGACGTG GCGTCGCTCT CCAAGGGGCT GAAAGCCCTC
CAGGAGGCCC CGCAGTGCCG CGTCTTCAAA ATCGGCGTGG GCAAAGTAAG TGCGCACCAC
CTCGATGTCC TGGGCTACGA GATTGGGCCG GCGGGGATAT CGGCAATAGT AGGCGAGTGC
GACGGGGCGA GGAGCTTGCT GGTAGTTTTT GACGGGAACA ACCTACACAG CGAGTTGTAC
AACAAGATCG TAGACACGTT CGAGAGCCGT GGCTACAAGC TGGTTGAAGT AGTAACCACC
GACACTCACA GGGCCACGGG AATTGGCATC GGCAAGGGAT ACCGCATAGT GGGCGAGCGC
ATAGACCATG GACAGATCTT AAAGGCCGTA GAAGAGGCTG TGTCCATCGC CGAGAGATCG
CTCGGCGACC ACAACGTAGA CTACAAGAGG GTAGAGGTTG AGGCGTACGT CTTGGGTGAG
GAGGGCTTTA GGAAGATCCA AGACGCCGTG AGGATGTACA AGAAAGTCGG GGTGTTGATC
GCGGCGGTTG TATTCGCCCT GCCAATTCTC CTAATTTCGC TTTTAGCATA A
 
Protein sequence
MRSFEKGYSI LFGRSPRRVA LYATALLAFL AALKALSAQR APLLYALFGG VILLILLSAD 
RAVINPRRSY YVAVISTLVV SFFDLLFQKA PLTFALVGAV ITAVVLQSLK CRSFWYIAPL
VAVSAIYYAV GELYLFAISL LYISVLQLTR FVINKMVRGL DAMCMFSSFI YSVFAEDDVL
EDAFRELGRL ERVPLHVFII GGRHVVVVSD FHPGPFRHIG GGMLVDELQK AVEGMGYSFT
FLHGVGSHER DPVDGESLRR IVNAVKTVLA YGRNGAPPRG IYPQSHIVGD VKVVGLSLGA
PPYLAVVSRV NSASDDIPTW VSRLVDTGAY ILIDAQNKFD GAVQWREVDV ASLSKGLKAL
QEAPQCRVFK IGVGKVSAHH LDVLGYEIGP AGISAIVGEC DGARSLLVVF DGNNLHSELY
NKIVDTFESR GYKLVEVVTT DTHRATGIGI GKGYRIVGER IDHGQILKAV EEAVSIAERS
LGDHNVDYKR VEVEAYVLGE EGFRKIQDAV RMYKKVGVLI AAVVFALPIL LISLLA