Gene Pars_1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1659 
Symbol 
ID5056046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1497457 
End bp1499439 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content54% 
IMG OID640469202 
Productsignal transduction protein 
Protein accessionYP_001153864 
Protein GI145591862 
COG category[K] Transcription 
COG ID[COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTCAGT CTCAGATTAC TAGAAAGGAG GTTGAAAAGC TTGTCGAAAA GCCAGTCCCC 
CAGTACGTGA GGGATGTGGT TGAGCACCCA CCTTTTAGGG TAACGCCTAA TACAACACTA
GAGACTCTTG CCGCCTATCT GAGAAAATTC CCAGTGGACG TAGTACCCGT GTTCAGGTCG
GTGTTTTCGG AAGAGGTGGC AGGGGTTGTG TACCCGCACA CAGCTTTGCT TCTCAAAAGC
AAGAAATCAG ACGACAAGGT GGGCGAGTTT GTTAACCCGC CGATTTTTGT AAAGGAGAGC
TGGAAAATAG AAAACGTGCT CGAAATCCTC ACGAGCGAAG CTAAGTGGGG GGCGGTGGTC
GTGGACGAGG AGGGGAAATA CGTGGGGGTG GTAACTCTTA GAGGTCTGCT CTCCGCGCTG
TTGCTTAGGG AGCCTAAGGC GAAATCCGTC GCCGCCGTCT ACACCCCCGC CGAGGAGAAG
AAATCTCGGG CAGGTTTTGT CAAGGCAATA GAAAAGGTGT CGAGGATTTT CAAAAAACTG
ACCGGAGGCG AGGTAGATGG GTACGTCGTT TTGAACAGAG AGGGGGGAGT GGCGGGTATC
CTTACTGTGT GGGATTTGGT GAAGTCGCGG AGGTGGTACA AGGGCGCCGG GGCGCCTAGA
GCCATATTCG GCACGCGTGT GACAAGAGGC GAGAGCAAAA GCTCCGGCGT CGCGAGGGTA
TGGAGGCTTA TGTTCAGAGG CGCGGCGGTG GCGACGCCGG ATACGCCGAT AGAGGAGGTG
GCAAGGTTTA TGGCGACTAC TGGTCTATAC ATAGCTCCTG TTGTAAACAG AGAAGGCAAA
GCCATCGGCG TAGTGACCGC GTGGGATGTT ATACATGCCT ACCTATACGG GCCTAAGGAG
GGTAGAGAAG ATGTGGAAGT AAAGAGGGTA GCGGAGGTGC AAGTGGCTAA GCCCCACGTT
GAGGAGGCTA TTAGGCTACG GCCTTCTAAA CACGTTACTG GGCTTAGAGC CCGCGACGTC
ATGCTCACAG ACGTGCCTGT AGTAAATCTA AGAGATCCGC TATCCAGAAT ACGTAAAGTG
TTCCTCCGCA CTGGGGCCTC CATATTAGCC GTTGTAGACG ACGAGGGCAA AGTTACGGGT
TTTATCACTA GGAGGGACTT CTTGACTTAT ATCGCAGAGA AGTCGCTGGG ATACTGGAAG
AGACAGAGGG GCAAAATGCT TATTCTGAAA GAGCAGGTAA TGCCAGGGGA GAGGGCCAAG
TTGCTTGTAG AGGAGGGTAC TGCAGGAGAC GTAATGAAGA CAGAATACCC GACGGCGCCT
CCAGACGCCA CGGTAGAGGA GATAGCGTAC AAGATGCTAG CGGCGGGGAC GGACTACGTG
GTTATTGTAG ATGAGAAAGG TACTTCCGTG GGCGTCGTGA CGAAAAACGA GTTACTTAAG
GCGTTTAAGG AGCGGGGCCG CGACGTAAAA GTCGGCGAGT TGATGGCCCC GGCGGAGGTG
GCCACCGCCG ACCTCTTCCA CAGCCTACAC TCAGTAATAC GCAAAATAAA CGCCTACGAG
CTAGACGGCG TTGTCGTAAA GGAAGGCGCC GAGGTTAAGG GAGTCATGAC CGTGGACGAC
TTGACTCTGA GGCCTGTGGA GGAGAGCCTT AGAGGGGAGA AGCTGGTGTT TTTCACAAAG
ACAGGTGTAA GGAGGGCGGT GAGGACGGGT CTTAGCAGAT TGAGATATTC CAAAATCGGG
ATAATTACCG CAATTGACGT CATGAGGCCG GTTGACCACG CTGTGAGCGC CGACGCAAAC
GCCAAGGAGG TGATAGACAA GTTGTTCGAA CAAGGCGTCC TGCCTGTATA CAACGAAAAA
GGCGAGTTGG TAGGTGTGTT GAACAAAATG GACGTGGTGA AGGAGCTGGC TAGGGTCTAC
GTGACGTACG CCATGCCCGA AAAAGTTAAG GAGGCAGAGA AGGCGGAGGC TCGGGCGAGG
TAA
 
Protein sequence
MSQSQITRKE VEKLVEKPVP QYVRDVVEHP PFRVTPNTTL ETLAAYLRKF PVDVVPVFRS 
VFSEEVAGVV YPHTALLLKS KKSDDKVGEF VNPPIFVKES WKIENVLEIL TSEAKWGAVV
VDEEGKYVGV VTLRGLLSAL LLREPKAKSV AAVYTPAEEK KSRAGFVKAI EKVSRIFKKL
TGGEVDGYVV LNREGGVAGI LTVWDLVKSR RWYKGAGAPR AIFGTRVTRG ESKSSGVARV
WRLMFRGAAV ATPDTPIEEV ARFMATTGLY IAPVVNREGK AIGVVTAWDV IHAYLYGPKE
GREDVEVKRV AEVQVAKPHV EEAIRLRPSK HVTGLRARDV MLTDVPVVNL RDPLSRIRKV
FLRTGASILA VVDDEGKVTG FITRRDFLTY IAEKSLGYWK RQRGKMLILK EQVMPGERAK
LLVEEGTAGD VMKTEYPTAP PDATVEEIAY KMLAAGTDYV VIVDEKGTSV GVVTKNELLK
AFKERGRDVK VGELMAPAEV ATADLFHSLH SVIRKINAYE LDGVVVKEGA EVKGVMTVDD
LTLRPVEESL RGEKLVFFTK TGVRRAVRTG LSRLRYSKIG IITAIDVMRP VDHAVSADAN
AKEVIDKLFE QGVLPVYNEK GELVGVLNKM DVVKELARVY VTYAMPEKVK EAEKAEARAR