Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_2033 |
Symbol | |
ID | 5161415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | + |
Start bp | 2228843 |
End bp | 2229988 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640553956 |
Product | signal transduction histidine kinase, nitrogen specific, NtrB |
Protein accession | YP_001235152 |
Protein GI | 148261025 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3852] Signal transduction histidine kinase, nitrogen specific |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.825681 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGCC TGCGCGGCGC ATTCCGGCGC CGGGGGGCGG TGACGAGGCA GCCGGAACCG CCGATCGATT CCCGCCAGTT GCTGGCGGCG CTGCCTCTCG CCGTCATCGA ACTCGACGCG GACGACCGGT TCCTCTTCGC GAATTACGCG GCGGAGGAAA TGTTCGGCTC GTCGCAGGCC TTCCTTGCCG GCAAGCCGCT GCACGAATTC ATCCCGGCCG ACCATCCCAT CTTCATCCTG CTCGACCGCG CCCGACGCGA CGAAGCGCCG ATCGCCGAAC ACGACCTCGT GCTCGAAGGC CCGCGCTTCG CCCGGCGCGG CGTGTCGATG CAGGTTGCCG CGATCGTGGA TGCGCCGGGC CATCTCGCGG TCACGATGCA GGACAGTTCG GCCGCCCGGG CGCTCGACCA GCAGCTTTCG GCCCGCAACG CGGCCCGCAG CATCACCGGC ATGGCCGCCG TTCTCGCGCA CGAGGTGAAG AATCCCTTGT CGGGCATCCG CGGCGCCGCG CAGCTGCTCG AGGCCAACGC GGCGCCCGAG GACCGCGAAC TCGCCGTGCT GATCCGCGAC GAGGTCGACC GCATCCGCGA ACTCGTCGAA CGCATCGAGG TGTTCAGCGA CAAGCCGATC GATGTGACGG CCGTAAACAT TCACCGGGTG CTCGAGCATG TCAGGCGCCT GGCCCAGTCC GGCTTCGGCG CGCGGATCCG CTTCGTCGAG GCCTATGATC CGTCACTGCC GCCGGTGCTC GGCAATCGCG ACCAGCTCGT CCAGGTTCTG CTGAACCTCA TGAAGAACGC GGCCGAGGCG ATCAGCGAGA CGGAACGGCC GGATGGCGAG ATCACGCTGG CCACCGGCTT CCAGCATGGC GTGCGGCTCG CCGCCTCCCC TGCCCGCGGC CAGCGCAACC TGCCGATCTT CATTTCCGTG CGCGACAACG GCGCGGGAAT CCCCGAGGAT ATCCGCCGCC ACCTCTTCGA GCCCTTCGTC AGCACCAAGG CGGCCGGCTC CGGCCTCGGC CTCGCGCTGG TCGCCAAGAT CGTGGCCGAT CACGGCGGGC TCATCAATGT CGACAGCCGG CCCGGCCGCA CCGAATTCCG TATCCATCTG CCGCAGTTCG AAGATCGCGC CGAGGGCCCG CCATGA
|
Protein sequence | MSGLRGAFRR RGAVTRQPEP PIDSRQLLAA LPLAVIELDA DDRFLFANYA AEEMFGSSQA FLAGKPLHEF IPADHPIFIL LDRARRDEAP IAEHDLVLEG PRFARRGVSM QVAAIVDAPG HLAVTMQDSS AARALDQQLS ARNAARSITG MAAVLAHEVK NPLSGIRGAA QLLEANAAPE DRELAVLIRD EVDRIRELVE RIEVFSDKPI DVTAVNIHRV LEHVRRLAQS GFGARIRFVE AYDPSLPPVL GNRDQLVQVL LNLMKNAAEA ISETERPDGE ITLATGFQHG VRLAASPARG QRNLPIFISV RDNGAGIPED IRRHLFEPFV STKAAGSGLG LALVAKIVAD HGGLINVDSR PGRTEFRIHL PQFEDRAEGP P
|
| |