Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1659 |
Symbol | |
ID | 5056046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1497457 |
End bp | 1499439 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640469202 |
Product | signal transduction protein |
Protein accession | YP_001153864 |
Protein GI | 145591862 |
COG category | [K] Transcription |
COG ID | [COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTCAGT CTCAGATTAC TAGAAAGGAG GTTGAAAAGC TTGTCGAAAA GCCAGTCCCC CAGTACGTGA GGGATGTGGT TGAGCACCCA CCTTTTAGGG TAACGCCTAA TACAACACTA GAGACTCTTG CCGCCTATCT GAGAAAATTC CCAGTGGACG TAGTACCCGT GTTCAGGTCG GTGTTTTCGG AAGAGGTGGC AGGGGTTGTG TACCCGCACA CAGCTTTGCT TCTCAAAAGC AAGAAATCAG ACGACAAGGT GGGCGAGTTT GTTAACCCGC CGATTTTTGT AAAGGAGAGC TGGAAAATAG AAAACGTGCT CGAAATCCTC ACGAGCGAAG CTAAGTGGGG GGCGGTGGTC GTGGACGAGG AGGGGAAATA CGTGGGGGTG GTAACTCTTA GAGGTCTGCT CTCCGCGCTG TTGCTTAGGG AGCCTAAGGC GAAATCCGTC GCCGCCGTCT ACACCCCCGC CGAGGAGAAG AAATCTCGGG CAGGTTTTGT CAAGGCAATA GAAAAGGTGT CGAGGATTTT CAAAAAACTG ACCGGAGGCG AGGTAGATGG GTACGTCGTT TTGAACAGAG AGGGGGGAGT GGCGGGTATC CTTACTGTGT GGGATTTGGT GAAGTCGCGG AGGTGGTACA AGGGCGCCGG GGCGCCTAGA GCCATATTCG GCACGCGTGT GACAAGAGGC GAGAGCAAAA GCTCCGGCGT CGCGAGGGTA TGGAGGCTTA TGTTCAGAGG CGCGGCGGTG GCGACGCCGG ATACGCCGAT AGAGGAGGTG GCAAGGTTTA TGGCGACTAC TGGTCTATAC ATAGCTCCTG TTGTAAACAG AGAAGGCAAA GCCATCGGCG TAGTGACCGC GTGGGATGTT ATACATGCCT ACCTATACGG GCCTAAGGAG GGTAGAGAAG ATGTGGAAGT AAAGAGGGTA GCGGAGGTGC AAGTGGCTAA GCCCCACGTT GAGGAGGCTA TTAGGCTACG GCCTTCTAAA CACGTTACTG GGCTTAGAGC CCGCGACGTC ATGCTCACAG ACGTGCCTGT AGTAAATCTA AGAGATCCGC TATCCAGAAT ACGTAAAGTG TTCCTCCGCA CTGGGGCCTC CATATTAGCC GTTGTAGACG ACGAGGGCAA AGTTACGGGT TTTATCACTA GGAGGGACTT CTTGACTTAT ATCGCAGAGA AGTCGCTGGG ATACTGGAAG AGACAGAGGG GCAAAATGCT TATTCTGAAA GAGCAGGTAA TGCCAGGGGA GAGGGCCAAG TTGCTTGTAG AGGAGGGTAC TGCAGGAGAC GTAATGAAGA CAGAATACCC GACGGCGCCT CCAGACGCCA CGGTAGAGGA GATAGCGTAC AAGATGCTAG CGGCGGGGAC GGACTACGTG GTTATTGTAG ATGAGAAAGG TACTTCCGTG GGCGTCGTGA CGAAAAACGA GTTACTTAAG GCGTTTAAGG AGCGGGGCCG CGACGTAAAA GTCGGCGAGT TGATGGCCCC GGCGGAGGTG GCCACCGCCG ACCTCTTCCA CAGCCTACAC TCAGTAATAC GCAAAATAAA CGCCTACGAG CTAGACGGCG TTGTCGTAAA GGAAGGCGCC GAGGTTAAGG GAGTCATGAC CGTGGACGAC TTGACTCTGA GGCCTGTGGA GGAGAGCCTT AGAGGGGAGA AGCTGGTGTT TTTCACAAAG ACAGGTGTAA GGAGGGCGGT GAGGACGGGT CTTAGCAGAT TGAGATATTC CAAAATCGGG ATAATTACCG CAATTGACGT CATGAGGCCG GTTGACCACG CTGTGAGCGC CGACGCAAAC GCCAAGGAGG TGATAGACAA GTTGTTCGAA CAAGGCGTCC TGCCTGTATA CAACGAAAAA GGCGAGTTGG TAGGTGTGTT GAACAAAATG GACGTGGTGA AGGAGCTGGC TAGGGTCTAC GTGACGTACG CCATGCCCGA AAAAGTTAAG GAGGCAGAGA AGGCGGAGGC TCGGGCGAGG TAA
|
Protein sequence | MSQSQITRKE VEKLVEKPVP QYVRDVVEHP PFRVTPNTTL ETLAAYLRKF PVDVVPVFRS VFSEEVAGVV YPHTALLLKS KKSDDKVGEF VNPPIFVKES WKIENVLEIL TSEAKWGAVV VDEEGKYVGV VTLRGLLSAL LLREPKAKSV AAVYTPAEEK KSRAGFVKAI EKVSRIFKKL TGGEVDGYVV LNREGGVAGI LTVWDLVKSR RWYKGAGAPR AIFGTRVTRG ESKSSGVARV WRLMFRGAAV ATPDTPIEEV ARFMATTGLY IAPVVNREGK AIGVVTAWDV IHAYLYGPKE GREDVEVKRV AEVQVAKPHV EEAIRLRPSK HVTGLRARDV MLTDVPVVNL RDPLSRIRKV FLRTGASILA VVDDEGKVTG FITRRDFLTY IAEKSLGYWK RQRGKMLILK EQVMPGERAK LLVEEGTAGD VMKTEYPTAP PDATVEEIAY KMLAAGTDYV VIVDEKGTSV GVVTKNELLK AFKERGRDVK VGELMAPAEV ATADLFHSLH SVIRKINAYE LDGVVVKEGA EVKGVMTVDD LTLRPVEESL RGEKLVFFTK TGVRRAVRTG LSRLRYSKIG IITAIDVMRP VDHAVSADAN AKEVIDKLFE QGVLPVYNEK GELVGVLNKM DVVKELARVY VTYAMPEKVK EAEKAEARAR
|
| |