Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_18640 |
Symbol | |
ID | 7760798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1845102 |
End bp | 1846520 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643804762 |
Product | sensory histidine protein kinase,two-component |
Protein accession | YP_002799051 |
Protein GI | 226943978 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTCGA TCCGCGCCCG CACCCTGTTT CTCGTCCTCG CTGTGCTGGC CTTCACCCTC TCGCTGATCT CCTACAAGAG CTACAGGGAC GCCCAGCACG AGATCGAGGA GTTGTTCGAT GCGCAACTGG CGCAGACCGC CCGCCTGCTC GCCGGGCTGG TCGGCCGCGA AATGCTCGAA CGCGAGCGCG AGGAGGTGCA GGCCATGCTC GACGAGGCGC TGGCGATGCA GACCCTGCCG GGCGGGCACG CGGCACTGTC GCTGGGACAC CGCTACGAGG GCAAGCTGGC CTTCCAGGTG TTCGACGAGC GGGGCGAATT GATCCTGCAT TCGGCGAGCG CCCCGCGCAC CCTGTTGTCC AGCCTGCTGA CGCAGGTGCC GGACGCCCTG GCCGACGAGC GGCTGATCGG CTACCACCTG CTGGATCTGG AGCCGTTTCG CTGGCGAGTC TTCGTTCTCC ACGACCGGGC TGACCAGCGC TGGATTCTGG TCGGCGAGCG CGAGGACGTG CGCGGCGAAC TGGCGGACAA GATCGCCAAG CGCAGCCTGT TGCCGGACCT GTTCGGCCTG CCGCTGCTGG CGTTGCTGGT CTGGCTGGCG ATCGGCTGGG GCCTGCGTCC GCTCGAACGC ATGGTCGGGC TGATCCGGGC GCGCGACCCG GACAACCTGG CGCCGTTGCT GCTGGCGCCC CTGCCGCGGG AGCTGGAGCC GGTGGTGGCG GCCCTCAACC GCCTGTTGCT GCAGGTCACC CAGTTGCTGG AGCGCGAGCG CCAGTTGCTC GCCGCCGCCG CCCACGAGCT GCGCACGCCG CTGGCGGTGC TGCGCATTCA TGCGCAGAAC GCCCTCGAGG CGCCCGATCC GGCCGACCGC GCCGAAGCGC TTCGCCAACT GGGGCCGGGC GTCGAGCGGG CGACCCGGGT GGTCGGGCAG TTGCTGGCGC TGGCCCGCCT GGAGCCGGCG GCGGTGCAAC TGCACATGAC CCGTCTCGAC CTGGCCGCCT TCCTGCGCAG CGAACTGGCC GAACTGACGC CGCTGGCGCT GGCCAAGGGG CAGGAGCTGA CCCTGGAAAC CACCGGCGCA GGCGGCTATG TGCTGCGCGG CGATGCGCCC AGCCTGGCGA TCCTGGTGCA GAACCTGGTC GCCAATGCGG TGCAGTACAC CCCGCAAGGG GGTTGTATCC GATTATTGCT CGAGGCCGAT GCGGACGACC TCCTGCTGCG CGTGCAGGAC AGCGGGCCGG GAATTCCGCC GGCGCTGCGC GAGAAGGTGT TCGAGCGCTT CTTCCGTGCC GGAGACGGCC AGGGCGCCGG GCTCGGCCTG TCCATCGTGC GACGGGCGGT GGAGTTGCAC GGCGGCGAGA TCGCCCTGGG CGCTTCGCCC CTCGGCGGAC TGGAGGTTTC CGTTCGCCTG CCCCGTTAG
|
Protein sequence | MRSIRARTLF LVLAVLAFTL SLISYKSYRD AQHEIEELFD AQLAQTARLL AGLVGREMLE REREEVQAML DEALAMQTLP GGHAALSLGH RYEGKLAFQV FDERGELILH SASAPRTLLS SLLTQVPDAL ADERLIGYHL LDLEPFRWRV FVLHDRADQR WILVGEREDV RGELADKIAK RSLLPDLFGL PLLALLVWLA IGWGLRPLER MVGLIRARDP DNLAPLLLAP LPRELEPVVA ALNRLLLQVT QLLERERQLL AAAAHELRTP LAVLRIHAQN ALEAPDPADR AEALRQLGPG VERATRVVGQ LLALARLEPA AVQLHMTRLD LAAFLRSELA ELTPLALAKG QELTLETTGA GGYVLRGDAP SLAILVQNLV ANAVQYTPQG GCIRLLLEAD ADDLLLRVQD SGPGIPPALR EKVFERFFRA GDGQGAGLGL SIVRRAVELH GGEIALGASP LGGLEVSVRL PR
|
| |