Gene Dvul_2947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2947 
Symbol 
ID4663795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp3451775 
End bp3454024 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content58% 
IMG OID639821209 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_968385 
Protein GI120603985 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein
[COG4564] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.732257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGGT CTATCATTAT TCAGCTCATC ATCATCGTGG TTATCGGTCT CGTGGTCGAA 
GGTGGTATCT TTTTGGGGTA TGTCTCTTTC GACTTGTCGC AATTTGCGGC TTTTCAGGCG
CAGAAGACCC GCGAGAGCAT CTATACTCAC GAACAGTACT CGCTTCGCGA TATGGTGGAC
TCGGCAAGCA GCATTGTGGA GAAATACTAC GCGCAGTCCA AGGACATGGA AGCCCTGAAG
CGCCTGAAGC GCGATGAACT GAAACTGGTG GTCGATGCTG CGGCATCCAT GGTTCAGGCT
CAGGTGGCCG CAGCACCTCA GGAACGGAAG GCGTTTGTCG CGCGCGAAAT GCTCGCATCC
TTGCGCGGGT TGCGTTTTGC GGGTGATAAT TACCTGTGGG TCAACGATCT GGACACGGTC
GTCCTCATGC ATCCTGTATC GCCGCAGATG GAAGGGAAGC CTCAACGGGA CATCCGTGAC
GAACAGGGCA AGGCCATATT CACGGAATTC GTTTCCATTG CGAGGTCGAA GGGCGAGGGC
ATGGTCGACT ATATGTGGCC CCGCCCCGGC CAGAAGCAGC CCCAGGTCAA GGTGTCATTC
GTCAAACTCA TTCCGGAACT GGGCTGGGTC GTCGGTGCAG GAGCCTATCT CGATGATATG
ACAGCGCAAC TCAAGCAGGA GGCTCTGGAC CAGATCGCCC GTATCAGGCT CTCTGACGGA
AACTACTTTT GGGTGAACGA TTTGCGTCCC TATATGGTCA TGCACCCTGT CAGACCTGAA
CTGGACGGCA CCGACCTTTC ACGTACGACA GATGCTGAAG GCAAGCTGCT CTTCGTCGAG
ATGGCGAAGG TGGCACGGGA CAAGGGAGCG GGTACCGTAG CCTACAGATT CGGCAAGCCG
GGTGCCGCAG GTGATTTCCC CAAGCTCTCG TATGTGAAGC TGTTCGAGCC ATGGGGATGG
GTCATCGGGA TGGGTGTCTA CATGGATGAT GTCGACAGGG AGATCGTGGC GGAGCAGGAG
CGTTTCAGTC AGGCTATTGG CGGGCTGATG ACGCGTAGCG GTCTCATCGG ATTGATGATC
GCAACGGCCA TGGTGGGGCT GGTGCTCTTC TACATCCACT ACCGTCTGCG TCAGCCCATG
AATCAGCTTG TTCGCTATGC CGGGCAGGTC GCCTCTGGTG AACTCGATGC CAGTGTCAGC
GGACGTTTTC AGGGAGAACT CCTGCTTCTG GCGGATGCCT TGCGTTCGAT GGTGGCAAGC
CTCGGAGAGC GCATGGAAGA GGCGCAGCTC AAAGGCCGTC TGGCCGAGGA GGAGGCTGCT
CATGCGCGCG CTGCCACGGC AGAGGCTGAC GATGCCCGGC GTAAGGCGGA AGGGGCAAAG
GCCGAGGGAA TGCTCGCGGC TGCAGGCATC CTCGACACCA TTGTCACGTC ATTGACCCAG
GCATCACAGA AGGTTGCAGC GTTGTCGGAG GAAATCAGCG ACGGGGCAGA GGGACAACGG
CAGCGTATTA CGGAAACGGC CACTGCCATG GAGGAGATGA ACGCGACCAT TCTCGAGGTG
GCTGGCAATT CGAGCAAGGC GGCTGAGAAT GCGGACCATG CGCGCGGTCG AGCAGAGGAG
GGCGCCAGTA CGGCCAGTGC CTCAGTAGCG GCCATTGAAG AGGTGCAGCG GCTGGCCGAT
GCACTCAAGG TGAATGTCGC TGAATTGGGA GTGAAGGCCG AGGCCATAGG TAGAATAATA
AATGTCATCA ATGATATAGC TGACCAGACG AATCTGCTGG CGCTCAACGC GGCCATCGAG
GCTGCCCGTG CCGGTGATGC TGGCCGGGGG TTTGCGGTGG TGGCAGACGA GGTGCGGAAG
CTCGCTGAAA AGACGATGCA CGCTACAAGC GAGGTCGGCG AGGCTATCAG GGCCATCCAG
CAGGCGACCC GCGACAACAC GCACAGTGTC GAACGCGCGG CAGCGGCAGT AGACAAGGCC
ACAGAACTGG TGGTTCGTTC CGGCAAGGCG TTGAGTGAGA TCGTTGTTCT TTCGGAACAG
TCGGCAGACA GGGTGCGTTC GATCGCCACG GCGTCGGAGG AACAGTCGGC AGCCAGTGAG
GAGATTACGC GTGCACTGGA TGAAATCAAC GGTCTCTCGG GACGCATTAC GGAGGGTATC
GGGCAGGCTG CCGGTGCACA GCGGGATATG AGTCAGCAGT GCAGGAAGCT CAATGAGTTG
ATCGAAAAGA TCAAGCTTGA GAACAAGTAG
 
Protein sequence
MRRSIIIQLI IIVVIGLVVE GGIFLGYVSF DLSQFAAFQA QKTRESIYTH EQYSLRDMVD 
SASSIVEKYY AQSKDMEALK RLKRDELKLV VDAAASMVQA QVAAAPQERK AFVAREMLAS
LRGLRFAGDN YLWVNDLDTV VLMHPVSPQM EGKPQRDIRD EQGKAIFTEF VSIARSKGEG
MVDYMWPRPG QKQPQVKVSF VKLIPELGWV VGAGAYLDDM TAQLKQEALD QIARIRLSDG
NYFWVNDLRP YMVMHPVRPE LDGTDLSRTT DAEGKLLFVE MAKVARDKGA GTVAYRFGKP
GAAGDFPKLS YVKLFEPWGW VIGMGVYMDD VDREIVAEQE RFSQAIGGLM TRSGLIGLMI
ATAMVGLVLF YIHYRLRQPM NQLVRYAGQV ASGELDASVS GRFQGELLLL ADALRSMVAS
LGERMEEAQL KGRLAEEEAA HARAATAEAD DARRKAEGAK AEGMLAAAGI LDTIVTSLTQ
ASQKVAALSE EISDGAEGQR QRITETATAM EEMNATILEV AGNSSKAAEN ADHARGRAEE
GASTASASVA AIEEVQRLAD ALKVNVAELG VKAEAIGRII NVINDIADQT NLLALNAAIE
AARAGDAGRG FAVVADEVRK LAEKTMHATS EVGEAIRAIQ QATRDNTHSV ERAAAAVDKA
TELVVRSGKA LSEIVVLSEQ SADRVRSIAT ASEEQSAASE EITRALDEIN GLSGRITEGI
GQAAGAQRDM SQQCRKLNEL IEKIKLENK