Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_42670 |
Symbol | cbrA |
ID | 7763142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4298743 |
End bp | 4301688 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643807122 |
Product | Sensory histidine protein kinase CbrA |
Protein accession | YP_002801365 |
Protein GI | 226946292 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.416099 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTTA GCCTGAGTGA ACTGCTGCTG ATCAGCGCCG CCTACCTGCT GACCCTGTTC GGCGTCGCCT GGCTCAGCGA GCAGGGCCTG ATCCCGCAGC GGCTGATCCG CCATCCGCTG GTCTACACCC TATCCCTCGG CGTGTATGCC AGCGCCTGGG CCTTCTACGG CGCGGTCGGC ATGGCCTACC AGTACGGCTA CGGCTTTCTC GCCTGCTACC TGGGGGTTTC CGGCGCCTTC ATGCTGGCGC CGGTGCTGCT CTACCCGATC CTGCGCCTGA CCCGCACCTA CCAACTGTCG TCCCTGGCCG ACCTGTTCGC CTTCCGCTTC CGCAGCACCT GGGCCGGCGC GCTGACCACC CTGTTCATGC TGGTCTCGGT ACTGCCGCTG CTGGCCCTGC AGATCCAGGC CGTCACCGAC TCGGTGGGCA TCCTCACCCA GGAGCCGGTG AAAGCGCGCG TGGCCCTGAG CTTCTGCGCC CTGATCATGC TCTTCACCAT CCTCTTCGGC TCGCGGCATA TCGCCACCCG CGAGAAACAC CAGGGCCTGG TCTGCGCCAT CGCCTTCGAG TCGCTGGTCA AGCTGGTGGC CCTCGGCGGT ATCGGGCTCT ATACCCTGTA CCGGGTGTTC GACGGTCCCC AGGGCCTGGA GCAGTGGCTG CTGCACCAGC GGGAAGTGCT GGGCACGCTG CACACGCCGC TGCAGGAAGG CCCCTGGCGC ACCCTGCTGC TGCTGTTCTT CGCCTCGGCC ATCGTCATGC CGCACATGTA CCACATGGCC TTCACCGAGA ACCTCAACCC CCGCGGCCTG GCCAGCGCCA GTTGGGGCCT GCCGCTGTTC CTGTTGCTGA TGAGCCTGGC CGTCCCGCTG ATCCTCTGGG CCGGCATGCA CCTGGAAGTC GCCACCGGCC CGGAATATTT CACCCTGGGC GTGGGCCTGG CGCTGGAAAA CGCGCCCCTC ACTCTGCTGG CCTATGTCGG TGGACTGTCC GCCTCCAGCG GGCTGATCAT CGTCCTGACC CTCGCCCTGT CCGGGATGAC CCTCAATCAC CTGGTGCTGC CGCTGTACCA GCCACCGGCC GAAGGCAACA TCTACCGCTG GCTGAAATGG ACGCGGCGTA CGCTGATCAT CGCCATCATC GCCGCCGGCT ACGCCTTCTA CCGAATGCTG GACGCCAGCC AGAACCTGTC CAACCTCGGC GTCACCGCCT TCGTCGCCAC CTTGCAGTTC CTTCCCGGCG TGCTTTCGAC CCTCTACTGG CAGACCGCCA ACCGTCGCGG TTTCATCGCT GGCCTCCTGG TCGGCATGAC GGTATGGCTG GGCGGCATGA TGCTGCCCAT GCTCGGCAAC CTGCAGGACG TGCACCTGCC GTTGCTGAAT TTCGTCTATG CGCTGGACGA CACCAGTTGG CACCTGGCGG CGATCACCTC GCTGGCCGCC AACGTGCTGG TGTTCTCCCT GGTGTCGATA TTCACCGAGC CGAGCCCCGA GGAGCGCAGC GCCGGCGAGG CCTGCGCCGT GGACAACGTG CGCCGGCCGC AGCGCCGCGA ACTGCTGGCC ACCTCCGCCC AGGAGTTCGC CGCCCGGCTG ACCCGCCCGC TCGGCGCCAG GACCGCTCAG CGCGAGGTGA AGCAGGCCCT GCGCGACCTG CACCTGCCGT TCGACGAGAG CCGCCCCTTC GCCCTGCGAC GCCTGCGCGA CCGCATCGAG GCCAACCTGT CCGGCCTGAT GGGGCCGAGC GTGGCGCAGG ATATCGTCGA GACCTTCCTG CCCTACAAGA CCGCCAGCGA AGACTACATC ACCGAGGACA TCCACTTCAT CGAAAGCCGC CTGGAAGACT ACCGCTCGCG CCTGACCGGC CTGGCCGCCG AACTCGACGC GTTGCGCCGC TACCACCGGC AGACCCTGCA GGAACTGCCG ATGGGCGTCT GCTCGCTGTC CAAGGACAAG GAGATCCTCA TGTGGAACCG GGCCATGGAG GAACTCACCG GCATATCCGC CGAGCGCGTG GTCGGCTCCT GTCTGACCAC CCTCGACCCC CCCTGGCGGA ACCTGCTGGA GGACTTCGTC CAGCGCTCCG ACGAACACCT GTACAAGCAG CGGCTGTCGC TGGACGGCCA GATGCGCTGG CTCAACCTGC ACAAGGCGGC GATCGACGAG CCCCTGGCCC CGGGCAGCAG CGGCCTGGTC CTGCTGGTCG AAGATCTCAC CGAAACCCAA TTGCTGGAAA ACAAACTGGC GCACTCCGAG CGCCTGGCCA GCATCGGCCG CCTGGCCGCC GGCGTGGCTC ACGAGATCGG CAATCCGATC ACCGGCATCG CCTGCCTGGC GCAGAACCTG CGCGAGGAAC ACGAGGACGA TGCCGAACTG GGCGAGATCA GCAGCCAGAT CATCGACCAG ACCAAGCGGG TGACGCGCAT CGTCCGCTCG CTGATGAGCT TCGCCCATGC CGGCAGCCAC ACGCAGGCCA GCGAAGCGGT GTGCCTGGCC GACATCGCCC GCGAGGCGAT CGGCCTGCTG TCGCTCAACC GCGAAGGCAC CGAGGTCAGC TTCCTCAATC TCTGCGATCC CGGCCACTGC GTGGTCGGCG ACCCTCAGCG GCTGGTGCAG GTACTGGTCA ACCTGCTTTC CAACGCCCGC GACGCCTCCC CTCCCGGCGG AGTCATCCGC GTGCGCAGCG AGGCCGTGGA ACAGAGCATC GACCTGATCG TCGAGGATGA GGGCAGCGGC ATCCCCAAGG CCATCGCCGG CCGTCTGTTC GACCCCTTCT TCACCACCAA GGACCCAGGC AAGGGGACCG GCCTCGGCCT CGCACTGGTC TATTCGATCG TGGAAGAGCA TTATGGCCGG ATCGCCGTCG ACAGCCCGGC CGACCCCGAG CGGCAACGCG GGACCCGCAT CCGGGTCACC CTGCCACGGC ATGTCGACAT GGCGCCGGTG CCGTGA
|
Protein sequence | MSFSLSELLL ISAAYLLTLF GVAWLSEQGL IPQRLIRHPL VYTLSLGVYA SAWAFYGAVG MAYQYGYGFL ACYLGVSGAF MLAPVLLYPI LRLTRTYQLS SLADLFAFRF RSTWAGALTT LFMLVSVLPL LALQIQAVTD SVGILTQEPV KARVALSFCA LIMLFTILFG SRHIATREKH QGLVCAIAFE SLVKLVALGG IGLYTLYRVF DGPQGLEQWL LHQREVLGTL HTPLQEGPWR TLLLLFFASA IVMPHMYHMA FTENLNPRGL ASASWGLPLF LLLMSLAVPL ILWAGMHLEV ATGPEYFTLG VGLALENAPL TLLAYVGGLS ASSGLIIVLT LALSGMTLNH LVLPLYQPPA EGNIYRWLKW TRRTLIIAII AAGYAFYRML DASQNLSNLG VTAFVATLQF LPGVLSTLYW QTANRRGFIA GLLVGMTVWL GGMMLPMLGN LQDVHLPLLN FVYALDDTSW HLAAITSLAA NVLVFSLVSI FTEPSPEERS AGEACAVDNV RRPQRRELLA TSAQEFAARL TRPLGARTAQ REVKQALRDL HLPFDESRPF ALRRLRDRIE ANLSGLMGPS VAQDIVETFL PYKTASEDYI TEDIHFIESR LEDYRSRLTG LAAELDALRR YHRQTLQELP MGVCSLSKDK EILMWNRAME ELTGISAERV VGSCLTTLDP PWRNLLEDFV QRSDEHLYKQ RLSLDGQMRW LNLHKAAIDE PLAPGSSGLV LLVEDLTETQ LLENKLAHSE RLASIGRLAA GVAHEIGNPI TGIACLAQNL REEHEDDAEL GEISSQIIDQ TKRVTRIVRS LMSFAHAGSH TQASEAVCLA DIAREAIGLL SLNREGTEVS FLNLCDPGHC VVGDPQRLVQ VLVNLLSNAR DASPPGGVIR VRSEAVEQSI DLIVEDEGSG IPKAIAGRLF DPFFTTKDPG KGTGLGLALV YSIVEEHYGR IAVDSPADPE RQRGTRIRVT LPRHVDMAPV P
|
| |