Gene Avin_42670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_42670 
SymbolcbrA 
ID7763142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4298743 
End bp4301688 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content67% 
IMG OID643807122 
ProductSensory histidine protein kinase CbrA 
Protein accessionYP_002801365 
Protein GI226946292 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.416099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTTA GCCTGAGTGA ACTGCTGCTG ATCAGCGCCG CCTACCTGCT GACCCTGTTC 
GGCGTCGCCT GGCTCAGCGA GCAGGGCCTG ATCCCGCAGC GGCTGATCCG CCATCCGCTG
GTCTACACCC TATCCCTCGG CGTGTATGCC AGCGCCTGGG CCTTCTACGG CGCGGTCGGC
ATGGCCTACC AGTACGGCTA CGGCTTTCTC GCCTGCTACC TGGGGGTTTC CGGCGCCTTC
ATGCTGGCGC CGGTGCTGCT CTACCCGATC CTGCGCCTGA CCCGCACCTA CCAACTGTCG
TCCCTGGCCG ACCTGTTCGC CTTCCGCTTC CGCAGCACCT GGGCCGGCGC GCTGACCACC
CTGTTCATGC TGGTCTCGGT ACTGCCGCTG CTGGCCCTGC AGATCCAGGC CGTCACCGAC
TCGGTGGGCA TCCTCACCCA GGAGCCGGTG AAAGCGCGCG TGGCCCTGAG CTTCTGCGCC
CTGATCATGC TCTTCACCAT CCTCTTCGGC TCGCGGCATA TCGCCACCCG CGAGAAACAC
CAGGGCCTGG TCTGCGCCAT CGCCTTCGAG TCGCTGGTCA AGCTGGTGGC CCTCGGCGGT
ATCGGGCTCT ATACCCTGTA CCGGGTGTTC GACGGTCCCC AGGGCCTGGA GCAGTGGCTG
CTGCACCAGC GGGAAGTGCT GGGCACGCTG CACACGCCGC TGCAGGAAGG CCCCTGGCGC
ACCCTGCTGC TGCTGTTCTT CGCCTCGGCC ATCGTCATGC CGCACATGTA CCACATGGCC
TTCACCGAGA ACCTCAACCC CCGCGGCCTG GCCAGCGCCA GTTGGGGCCT GCCGCTGTTC
CTGTTGCTGA TGAGCCTGGC CGTCCCGCTG ATCCTCTGGG CCGGCATGCA CCTGGAAGTC
GCCACCGGCC CGGAATATTT CACCCTGGGC GTGGGCCTGG CGCTGGAAAA CGCGCCCCTC
ACTCTGCTGG CCTATGTCGG TGGACTGTCC GCCTCCAGCG GGCTGATCAT CGTCCTGACC
CTCGCCCTGT CCGGGATGAC CCTCAATCAC CTGGTGCTGC CGCTGTACCA GCCACCGGCC
GAAGGCAACA TCTACCGCTG GCTGAAATGG ACGCGGCGTA CGCTGATCAT CGCCATCATC
GCCGCCGGCT ACGCCTTCTA CCGAATGCTG GACGCCAGCC AGAACCTGTC CAACCTCGGC
GTCACCGCCT TCGTCGCCAC CTTGCAGTTC CTTCCCGGCG TGCTTTCGAC CCTCTACTGG
CAGACCGCCA ACCGTCGCGG TTTCATCGCT GGCCTCCTGG TCGGCATGAC GGTATGGCTG
GGCGGCATGA TGCTGCCCAT GCTCGGCAAC CTGCAGGACG TGCACCTGCC GTTGCTGAAT
TTCGTCTATG CGCTGGACGA CACCAGTTGG CACCTGGCGG CGATCACCTC GCTGGCCGCC
AACGTGCTGG TGTTCTCCCT GGTGTCGATA TTCACCGAGC CGAGCCCCGA GGAGCGCAGC
GCCGGCGAGG CCTGCGCCGT GGACAACGTG CGCCGGCCGC AGCGCCGCGA ACTGCTGGCC
ACCTCCGCCC AGGAGTTCGC CGCCCGGCTG ACCCGCCCGC TCGGCGCCAG GACCGCTCAG
CGCGAGGTGA AGCAGGCCCT GCGCGACCTG CACCTGCCGT TCGACGAGAG CCGCCCCTTC
GCCCTGCGAC GCCTGCGCGA CCGCATCGAG GCCAACCTGT CCGGCCTGAT GGGGCCGAGC
GTGGCGCAGG ATATCGTCGA GACCTTCCTG CCCTACAAGA CCGCCAGCGA AGACTACATC
ACCGAGGACA TCCACTTCAT CGAAAGCCGC CTGGAAGACT ACCGCTCGCG CCTGACCGGC
CTGGCCGCCG AACTCGACGC GTTGCGCCGC TACCACCGGC AGACCCTGCA GGAACTGCCG
ATGGGCGTCT GCTCGCTGTC CAAGGACAAG GAGATCCTCA TGTGGAACCG GGCCATGGAG
GAACTCACCG GCATATCCGC CGAGCGCGTG GTCGGCTCCT GTCTGACCAC CCTCGACCCC
CCCTGGCGGA ACCTGCTGGA GGACTTCGTC CAGCGCTCCG ACGAACACCT GTACAAGCAG
CGGCTGTCGC TGGACGGCCA GATGCGCTGG CTCAACCTGC ACAAGGCGGC GATCGACGAG
CCCCTGGCCC CGGGCAGCAG CGGCCTGGTC CTGCTGGTCG AAGATCTCAC CGAAACCCAA
TTGCTGGAAA ACAAACTGGC GCACTCCGAG CGCCTGGCCA GCATCGGCCG CCTGGCCGCC
GGCGTGGCTC ACGAGATCGG CAATCCGATC ACCGGCATCG CCTGCCTGGC GCAGAACCTG
CGCGAGGAAC ACGAGGACGA TGCCGAACTG GGCGAGATCA GCAGCCAGAT CATCGACCAG
ACCAAGCGGG TGACGCGCAT CGTCCGCTCG CTGATGAGCT TCGCCCATGC CGGCAGCCAC
ACGCAGGCCA GCGAAGCGGT GTGCCTGGCC GACATCGCCC GCGAGGCGAT CGGCCTGCTG
TCGCTCAACC GCGAAGGCAC CGAGGTCAGC TTCCTCAATC TCTGCGATCC CGGCCACTGC
GTGGTCGGCG ACCCTCAGCG GCTGGTGCAG GTACTGGTCA ACCTGCTTTC CAACGCCCGC
GACGCCTCCC CTCCCGGCGG AGTCATCCGC GTGCGCAGCG AGGCCGTGGA ACAGAGCATC
GACCTGATCG TCGAGGATGA GGGCAGCGGC ATCCCCAAGG CCATCGCCGG CCGTCTGTTC
GACCCCTTCT TCACCACCAA GGACCCAGGC AAGGGGACCG GCCTCGGCCT CGCACTGGTC
TATTCGATCG TGGAAGAGCA TTATGGCCGG ATCGCCGTCG ACAGCCCGGC CGACCCCGAG
CGGCAACGCG GGACCCGCAT CCGGGTCACC CTGCCACGGC ATGTCGACAT GGCGCCGGTG
CCGTGA
 
Protein sequence
MSFSLSELLL ISAAYLLTLF GVAWLSEQGL IPQRLIRHPL VYTLSLGVYA SAWAFYGAVG 
MAYQYGYGFL ACYLGVSGAF MLAPVLLYPI LRLTRTYQLS SLADLFAFRF RSTWAGALTT
LFMLVSVLPL LALQIQAVTD SVGILTQEPV KARVALSFCA LIMLFTILFG SRHIATREKH
QGLVCAIAFE SLVKLVALGG IGLYTLYRVF DGPQGLEQWL LHQREVLGTL HTPLQEGPWR
TLLLLFFASA IVMPHMYHMA FTENLNPRGL ASASWGLPLF LLLMSLAVPL ILWAGMHLEV
ATGPEYFTLG VGLALENAPL TLLAYVGGLS ASSGLIIVLT LALSGMTLNH LVLPLYQPPA
EGNIYRWLKW TRRTLIIAII AAGYAFYRML DASQNLSNLG VTAFVATLQF LPGVLSTLYW
QTANRRGFIA GLLVGMTVWL GGMMLPMLGN LQDVHLPLLN FVYALDDTSW HLAAITSLAA
NVLVFSLVSI FTEPSPEERS AGEACAVDNV RRPQRRELLA TSAQEFAARL TRPLGARTAQ
REVKQALRDL HLPFDESRPF ALRRLRDRIE ANLSGLMGPS VAQDIVETFL PYKTASEDYI
TEDIHFIESR LEDYRSRLTG LAAELDALRR YHRQTLQELP MGVCSLSKDK EILMWNRAME
ELTGISAERV VGSCLTTLDP PWRNLLEDFV QRSDEHLYKQ RLSLDGQMRW LNLHKAAIDE
PLAPGSSGLV LLVEDLTETQ LLENKLAHSE RLASIGRLAA GVAHEIGNPI TGIACLAQNL
REEHEDDAEL GEISSQIIDQ TKRVTRIVRS LMSFAHAGSH TQASEAVCLA DIAREAIGLL
SLNREGTEVS FLNLCDPGHC VVGDPQRLVQ VLVNLLSNAR DASPPGGVIR VRSEAVEQSI
DLIVEDEGSG IPKAIAGRLF DPFFTTKDPG KGTGLGLALV YSIVEEHYGR IAVDSPADPE
RQRGTRIRVT LPRHVDMAPV P