Gene Acry_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_2033 
Symbol 
ID5161415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp2228843 
End bp2229988 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content69% 
IMG OID640553956 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_001235152 
Protein GI148261025 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.825681 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCC TGCGCGGCGC ATTCCGGCGC CGGGGGGCGG TGACGAGGCA GCCGGAACCG 
CCGATCGATT CCCGCCAGTT GCTGGCGGCG CTGCCTCTCG CCGTCATCGA ACTCGACGCG
GACGACCGGT TCCTCTTCGC GAATTACGCG GCGGAGGAAA TGTTCGGCTC GTCGCAGGCC
TTCCTTGCCG GCAAGCCGCT GCACGAATTC ATCCCGGCCG ACCATCCCAT CTTCATCCTG
CTCGACCGCG CCCGACGCGA CGAAGCGCCG ATCGCCGAAC ACGACCTCGT GCTCGAAGGC
CCGCGCTTCG CCCGGCGCGG CGTGTCGATG CAGGTTGCCG CGATCGTGGA TGCGCCGGGC
CATCTCGCGG TCACGATGCA GGACAGTTCG GCCGCCCGGG CGCTCGACCA GCAGCTTTCG
GCCCGCAACG CGGCCCGCAG CATCACCGGC ATGGCCGCCG TTCTCGCGCA CGAGGTGAAG
AATCCCTTGT CGGGCATCCG CGGCGCCGCG CAGCTGCTCG AGGCCAACGC GGCGCCCGAG
GACCGCGAAC TCGCCGTGCT GATCCGCGAC GAGGTCGACC GCATCCGCGA ACTCGTCGAA
CGCATCGAGG TGTTCAGCGA CAAGCCGATC GATGTGACGG CCGTAAACAT TCACCGGGTG
CTCGAGCATG TCAGGCGCCT GGCCCAGTCC GGCTTCGGCG CGCGGATCCG CTTCGTCGAG
GCCTATGATC CGTCACTGCC GCCGGTGCTC GGCAATCGCG ACCAGCTCGT CCAGGTTCTG
CTGAACCTCA TGAAGAACGC GGCCGAGGCG ATCAGCGAGA CGGAACGGCC GGATGGCGAG
ATCACGCTGG CCACCGGCTT CCAGCATGGC GTGCGGCTCG CCGCCTCCCC TGCCCGCGGC
CAGCGCAACC TGCCGATCTT CATTTCCGTG CGCGACAACG GCGCGGGAAT CCCCGAGGAT
ATCCGCCGCC ACCTCTTCGA GCCCTTCGTC AGCACCAAGG CGGCCGGCTC CGGCCTCGGC
CTCGCGCTGG TCGCCAAGAT CGTGGCCGAT CACGGCGGGC TCATCAATGT CGACAGCCGG
CCCGGCCGCA CCGAATTCCG TATCCATCTG CCGCAGTTCG AAGATCGCGC CGAGGGCCCG
CCATGA
 
Protein sequence
MSGLRGAFRR RGAVTRQPEP PIDSRQLLAA LPLAVIELDA DDRFLFANYA AEEMFGSSQA 
FLAGKPLHEF IPADHPIFIL LDRARRDEAP IAEHDLVLEG PRFARRGVSM QVAAIVDAPG
HLAVTMQDSS AARALDQQLS ARNAARSITG MAAVLAHEVK NPLSGIRGAA QLLEANAAPE
DRELAVLIRD EVDRIRELVE RIEVFSDKPI DVTAVNIHRV LEHVRRLAQS GFGARIRFVE
AYDPSLPPVL GNRDQLVQVL LNLMKNAAEA ISETERPDGE ITLATGFQHG VRLAASPARG
QRNLPIFISV RDNGAGIPED IRRHLFEPFV STKAAGSGLG LALVAKIVAD HGGLINVDSR
PGRTEFRIHL PQFEDRAEGP P