Gene Avi_5539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5539 
Symbol 
ID7381451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp536254 
End bp538287 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content59% 
IMG OID643649123 
Productsignal transduction histidine kinase 
Protein accessionYP_002547360 
Protein GI222106569 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCAAT CATTAAACGA CGACCAGGCC GAGCAGCGGG AGAGACGGCG CCTTGAGGCA 
CTCAGGGCCT ATGACGTGCT TGACACGCCG CGCGAGAAGG ATTTTGACGA TATTGCCGCC
CTCGCCTCGC GGATCTGCGC GACGCCGATC GCCGTCGTCA ATCTGATCGA TGACAGCAGG
CAATTCTTCA AGGCGGAAGT CGGTCTTGGC GTGCGCGAGA CACCGTTCGA CAGTTCCTTC
TGCGCCAAGG CTATTCTGGA AGACGATTTC CTGATGATAC CGGATGCAAG TCAAGACAGC
CGGTTCAATC GCAATCCGCT GGTCACCGGC GAACCCCATC TGCGGTTTTA TGCCGGAGCC
GTCCTGAAAA CCGCTGATAA TTTACCCATC GGCACCGTCT GCGTGCTGGG TTTCGAACCC
AAGCAGCTGG ATGAGCTTCA ACAGGACACG TTGAAGGTTC TTGCCCGACA GGTGATGGTG
CAGCTGGAAT TACGCAAGGC GCTGAAGGAA AAAGCGCGTG AGGCGGAGGT ACAGCGCCGA
TTGAGCGAGC GCAGGCTGGC ACGCGTCACG GCCATGGAAC AACAGGACGA GCGCTCGCGA
AGCGCCCAGG CGGCCGGCCG GGTCGGCACG TTCGAATTGG ACATCGCTAC CAATACCATG
ACGGTTTCCA GCGAATTCTG TCGGGTCTTC GGCATTCCCG TGCAGACGTC CTATCCGGCT
TCGGCGATTG AGGATCTGGT GCATGAGGAC GATCGTGGCC TGCGCTCAAA CCCGGTCACA
CGGCGAGAGG GTTCGGCCAG CCCTGACGTG GATTATCGGA TCATCCGCGC CGACGATGAT
GAATTGCGCT GGATTTCGCG GCGGGCGCGC TTCGTTCACA ACGAAAAAGG CGAGATCACC
CGCATGGTCG GCGTGGTCTT CGACACCACC GATGCCAAGC TGAAAGAAGC GAAAAAGGCT
GCACTGTTAA AGCTGGGAGA CGAACTTCGC GCCGCCAGCA CTGTGGAAGA GATCACCCAA
AGCGCCGCCG CCATCCTGTC GGACGGTCTT GGCGTTGCCC GCGCAGGCTA TGCCGTGGTG
GATCGGACCG ACAATAGCTT TGCGGTCGCC TTCGACTGGG CATCCCATGG CACGGTCTCA
CTGGCGGGGC GCCATTCTCT CGAGAGATTT TCAGAAACGA TTGAGCACCT GCGCAAGGGA
GAGACATTGT CGATCCCCAA CGTCGCCTCA AGCAAATGGC CGACCACAGA AAAGGATGCC
TATGCCTCCA TCGGGGTTTC GTCGCTCATC AAGGTACCGA TCCTCAAAGG CGGAAACCTG
GTCGGTATTC TCTTTGCCCA TGACGACAAG CCCCGGACAT GGAGCCAGGA TGAACTGGAC
TTTACGCGCG GCATTGCCGA CAGAACCTAT GCAGCCCTTG CCAAGGTTCA GGCCGAGGAA
GAACAGGAAC TTCTCAACCA CGAACTGAGC CACCGGCTGA AAAATACGCT GGCCATGGTC
CAGGCCATTG CCGGGCAGAC ATTGAAGGAC GTCTCGGAAA AGGAAGCCGT CAACGCCTTC
CTGGCCCGTC TGCATGCCCT GGGCGCCGCC CATGACGTTC TTCTGCGGCA AAACTGGTCG
GCGGCGAAAA TGCGCGATGT GATCGAGAAA GTGCTGGCCT TGCATGCCGA CGGCGACCGT
ATTTACGTGG ATGGTCCCGA TCTGGCGCTT GGCCCGAAGG CCGGTCTGTC GCTATCTTTG
CTGCTGCATG AACTGGCGAC CAATGCCCTC AAATATGGTG CGCTTTCCAA CGCTAGCGGC
CGCGTTTCCG TCAAATGGTG GATCGCCGAC AACGACGCCA TTCCGACCCT GTCGATGACA
TGGACGGAAA GCGGTGGACC GACCGTGAGC GAACCGACCC GCAAAGGCTT CGGCTCACGC
CTGATCCGCA TGGGACTGGC TGGAACAGGC AATGCCCATA AGGACTATCG CCCGTCCGGC
CTGACGGCAA CGTTCCATGC ACCGCTCACC ACAATCCAGG AAACCGGAGA CTAG
 
Protein sequence
MRQSLNDDQA EQRERRRLEA LRAYDVLDTP REKDFDDIAA LASRICATPI AVVNLIDDSR 
QFFKAEVGLG VRETPFDSSF CAKAILEDDF LMIPDASQDS RFNRNPLVTG EPHLRFYAGA
VLKTADNLPI GTVCVLGFEP KQLDELQQDT LKVLARQVMV QLELRKALKE KAREAEVQRR
LSERRLARVT AMEQQDERSR SAQAAGRVGT FELDIATNTM TVSSEFCRVF GIPVQTSYPA
SAIEDLVHED DRGLRSNPVT RREGSASPDV DYRIIRADDD ELRWISRRAR FVHNEKGEIT
RMVGVVFDTT DAKLKEAKKA ALLKLGDELR AASTVEEITQ SAAAILSDGL GVARAGYAVV
DRTDNSFAVA FDWASHGTVS LAGRHSLERF SETIEHLRKG ETLSIPNVAS SKWPTTEKDA
YASIGVSSLI KVPILKGGNL VGILFAHDDK PRTWSQDELD FTRGIADRTY AALAKVQAEE
EQELLNHELS HRLKNTLAMV QAIAGQTLKD VSEKEAVNAF LARLHALGAA HDVLLRQNWS
AAKMRDVIEK VLALHADGDR IYVDGPDLAL GPKAGLSLSL LLHELATNAL KYGALSNASG
RVSVKWWIAD NDAIPTLSMT WTESGGPTVS EPTRKGFGSR LIRMGLAGTG NAHKDYRPSG
LTATFHAPLT TIQETGD