Gene Avin_18010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_18010 
SymboldctB 
ID7760736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1790768 
End bp1792546 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content73% 
IMG OID643804700 
ProductC4-dicarboylate transport sensory histidine protein kinase, two-component; DctB 
Protein accessionYP_002798989 
Protein GI226943916 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.892386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCCCG TCCGCCCGCG CGTGCCGCTG GTCGTCCTGG CGATCTTCGC CGGGCTGCTG 
GCCAGCGTCT GGCTGGCGGG TCTCTATGCC GAGCGGCGCC ACTGGAGCGA ACTCGATATC
GAGGCGCGCG GCCAACTGGA GCTGTACGCC CAGTCGCTGC GCATTCTGGT CGAGCGCTTC
CGTGCGCTGC CGGCGATCAT CGCCCTGGAC ACCGAGATCC AGGCGCTGCT GCAGGAGTCC
TCCGATCCGT CCGCGCGCGA GCGGCTCAAT CGGCGCCTGC AACGGCTCAA CCGGGAGGCG
GGGGTCGGGG TGATCTTCCT GCTCGATCTG CAGGGCACGG CCATCGCCGC CAGCAACTGG
CGGGAGCCGG GCAACTTCGT CGGCGGCAAC TACGCCTTCC GCCCGTATTT CCAGGGAGCC
TTGCGCGAGG GCAGCGCCCG CTATTTCGGC GTCGGCGCGA CCACCGGGGT GCCCGGCTAC
TTCCTGTCCC GCGCGGTGCG CGACGCGGGG GGACGCACGC TCGGCGTGCT GGCGCTCAAG
CTGGTGATGG AGGACCTGCA GCGCGACTGG GCCGGACAGC CGGGCATCCT GCTGGTCGTC
GACCGGCAGG CGGTGGCCAT TCTCGCCAGC CGGCCGGCCT GGCGGTTCCG TCTGCTGCAG
GAGCCGGCGG CGGCGGAGCC CCCCCGTCCG GTCGATGCCC GCCGCTATGG CATCCGCGAA
GTGCGGCCGC TGGCGCTGCA GCCGTTGCGC CCGCTCGACG GCGGCGGCGA GCTGCTGCGC
ATCGACGGCC CGGAGGGCCG GCGCGACTAC CTGCGCCTGA GCCAGCCGCT GGCCGACGAG
GGCTGGACCC TGGAGTTGCT GCGCGAGGCG CGCGTGCCGG GTGCCGCGGT GCGCAGCTAC
GGGCTGGCCG CCGCCGGCAT CTGGCTGGCG TTGGCGTTCC TCTTCCTGTT CCTCGGCCAG
CGGCAGAAGA ACCGCCGCCT GCTGCTGCGC CGCCGGGCCG AGCTGGAGCG GTTGGTCGAG
GAGCGCACCG CGGCGTTGCG CAAGGCCCAG GACGAACTGG TGCAGGCGGC CAAGATGGCT
GCCCTGGGGC AGATGTCGGC GGCCCTGGCC CATGAGATCA ACCAGCCGCT GACCGCCATG
CGCATGCAAC TCGGCAGCCT GCAACTGCTG CTGGCGAAGA ACGACCCGGC GGCGATGCGC
ACCTGCCTGG AGCGCATCGA TGGGCTGCTC ACGCGCATGG CGGCGCTCAC CGGCCATCTC
AAGACCTTCG CCCGCGATAC CCCCGGCGGC CTGCGCGAGC GCCTGGCGCT GGATACCGTC
GTCGACCATG CGCTGCTCCT GCTGGAGGCG CGTATCCGCC AGGACGCGGT GCGGGTGCTG
CGCCTGCGCG CGCCGCGGGC CTGGGTCGAG GGCAATGCGA TCCGCTTCGA ACAGGTGCTG
GTCAACCTGC TGCACAATGC TCTGGACGCC ATGGCCGGGC GGGCGCGCCG CGAACTGCGC
ATCGCGCTGC GTCGCGACGG CGGCGACTGG CTGCTGAGCG TCGCCGACAG CGGCGGCGGC
ATCGCCGCCG AGCACCTGTT GCGCGTGTTC GAGCCGTTCT TCACCACCAA GCCGGTGGGC
GAGGGGCTCG GCCTCGGCCT GGCCGTATCC TATGGCATCG TCCGCGAATC GGGCGGTCAG
ATGGAGGTCG GCAATACCGG CGACGGCGCG CAGTTCGTCA TCCGGCTGCC GGCCGCGGCG
CCGCCGGATA CACTGCAGGC CGGCTCGCAG GAGGGTTAG
 
Protein sequence
MPPVRPRVPL VVLAIFAGLL ASVWLAGLYA ERRHWSELDI EARGQLELYA QSLRILVERF 
RALPAIIALD TEIQALLQES SDPSARERLN RRLQRLNREA GVGVIFLLDL QGTAIAASNW
REPGNFVGGN YAFRPYFQGA LREGSARYFG VGATTGVPGY FLSRAVRDAG GRTLGVLALK
LVMEDLQRDW AGQPGILLVV DRQAVAILAS RPAWRFRLLQ EPAAAEPPRP VDARRYGIRE
VRPLALQPLR PLDGGGELLR IDGPEGRRDY LRLSQPLADE GWTLELLREA RVPGAAVRSY
GLAAAGIWLA LAFLFLFLGQ RQKNRRLLLR RRAELERLVE ERTAALRKAQ DELVQAAKMA
ALGQMSAALA HEINQPLTAM RMQLGSLQLL LAKNDPAAMR TCLERIDGLL TRMAALTGHL
KTFARDTPGG LRERLALDTV VDHALLLLEA RIRQDAVRVL RLRAPRAWVE GNAIRFEQVL
VNLLHNALDA MAGRARRELR IALRRDGGDW LLSVADSGGG IAAEHLLRVF EPFFTTKPVG
EGLGLGLAVS YGIVRESGGQ MEVGNTGDGA QFVIRLPAAA PPDTLQAGSQ EG