Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1532 |
Symbol | |
ID | 5054182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1389421 |
End bp | 1390620 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640469073 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001153738 |
Protein GI | 145591736 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.192742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGATAG GACAACTCCT CTCGTCCCGC ACGCGCTACA TGTCCGCGAG CGAGATTAGA GAACTGTTAA AGTGGGCGAC GGCGGACGTC ATCTCCTTCG GCGGCGGCAT GCCTGACCCC TCCACCTTCC CCGTGGAGGA CATCGCGAAG ATCGCGTCCT ACGTGCTGGA GGCCTACCCC CACAAGGCGC TCCAGTACGG TCCTACTGAA GGGGTTTACG AACTACGCGA AGAAATTGCA AAATTCAGCG AATCTTTCAG AGGGATTAAG ACCCGTGCAG AGAATATTAT AATCACTGTG GGTAGCCAGG AAGCTTTGGA GCTGTTGGGC AGGGTGTTCA TCAACCCTGG GGACGTAGTA ATTACGGAGA ACCCCACATA TCTCGCCGCT TTGCAGGCGT GGCGGGTCTA CGAGCCTAGG CTTGTGGGTA TTCCCATGGA CGAAAGCGGC ATGGTCGTGG AGATCCTGGA GGAGAGGGTT AAGCAACTCA AGGCGGAGGG GGCCCGCATC AAATTCATAT ACACAATACC GACAGCGCAG AACCCCACCG GCCTCACGAT GTCGCAGGAC AGAAGGAAGT ACCTCCTGGA GGTGGCTGAG AGGTACGATC TCCTCGTGGT GGAGGATGAC CCCTATTCCT ACTTCCTCTT CGAGCCTATC CAGGTCTCCC CTATAAAGGC CCTCGACAAG TCGGATAGGG TCATATACCT CTCCACCGCT TCTAAGATCT TCGCGCCGGG CTTACGCCTA GGCTGGGTTA TTGCCAGCGA GGAGGTGATC AGGTGGTTTA ACCTAGCGAA GCAGTCTCTT AACCTGAACA CCTCTAATCT AGTGCAGTAC ATGTTCCTTG AGGGGCTGAG GCGTAATGTT GTGCTTAAGA ATTTGCCCAA CGTGAGGGAT CTCTACAAGC GGAAGAGAGA TGCAATGCTG GCGGCGCTGG AGACATACAT GCCGCAGGGG GTCAGCTGGA CGAGGCCCTC CGGCGGGATG TTCATCTGGG TGACGGCGCC GCCTCAAATC GACACTAGAG AGCTGTTGAA GGTCGCTGTG ACGCAGTACA AGGTGGCCTT CGTGCCGGGC CACGGCTTCT TTGTTGACCA GTCAGTTAGA AACGCCATGA GGCTGAACTT CACGTATCCC ACCTTTGAGC AGATAAATGA AGGGATCCGC CGCCTCGCCT TGGCGCTCCG GGGCGCATGA
|
Protein sequence | MEIGQLLSSR TRYMSASEIR ELLKWATADV ISFGGGMPDP STFPVEDIAK IASYVLEAYP HKALQYGPTE GVYELREEIA KFSESFRGIK TRAENIIITV GSQEALELLG RVFINPGDVV ITENPTYLAA LQAWRVYEPR LVGIPMDESG MVVEILEERV KQLKAEGARI KFIYTIPTAQ NPTGLTMSQD RRKYLLEVAE RYDLLVVEDD PYSYFLFEPI QVSPIKALDK SDRVIYLSTA SKIFAPGLRL GWVIASEEVI RWFNLAKQSL NLNTSNLVQY MFLEGLRRNV VLKNLPNVRD LYKRKRDAML AALETYMPQG VSWTRPSGGM FIWVTAPPQI DTRELLKVAV TQYKVAFVPG HGFFVDQSVR NAMRLNFTYP TFEQINEGIR RLALALRGA
|
| |