Gene VC0395_A0967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0967 
Symbol 
ID5136764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1000308 
End bp1003238 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content48% 
IMG OID640532425 
ProductGGDEF family protein 
Protein accessionYP_001216913 
Protein GI147673928 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG3292] Predicted periplasmic ligand-binding sensor domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.1127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGGAA TTAGGCTTTT AGCTTGCTGC GGAGTGCAGA TCTGTTCAGT GTTTTTGCTG 
ATCATCAGTT CATCTGCTTA TGCCGTAGTC GAACATATTT CGGACTATTT CGTGGAGACG
TGGACTTCCA CTGATGGTTT ACCGCACAAC AGTATTAATA GTATTGCCCA GACACGCGAT
GGTTATTTGT GGTTTGCCAC TTGGGAAGGC GTGGTGCGAT ATAACGGCAT CAATTTTCAA
GTGTTTGATC GCAATCCACA AACTGGAATG GTTGACTCCG GAACTCGCAC CTTATTACCC
ATGGATAACA ATGGGTTGTT GGTGGCTGGT GCTCGAGGGA GTATCACGCA CAGACAGAGT
TATGGCTGGC AATCTTTTCG TTCTGCGCCC AGTTTAGTCA ATGGTGCATT GATTGATGAT
GCACAAAACT TGTGGCTCGC GATTGAAGGT CTCGGCGTCG TAGTCCGACC TTATTTGGGT
AATAACCTCT ATGGGCCTGA TCGATGGTTA CTGACTGATG CCAGTGTGTA TCGTTTAGTA
AAAAATCAGC ATGGAGTGAT TTTTGCGGCC ACCGATAAAG GGTTGTATCG CTTTAATGAT
TGGCAAGCGT CAGCCATGGA ACTGCCGATT GGGCGCTATG AGCGAATTAA CTATATTTCC
ATCAGTTTGA ATCAAGAACT TGTCGTGGCA ACAGATCAAG GTGTTTGGAT CAATACTGAA
GAAGGCTTCC AATCGTTGTT TGTACCGCTT GAGAAAGAGG TGGTGACGCT GGCGGAGCAA
GATCGTGCCG GGAACTGGTG GATTGGCACG GTTAATCGCG GCATTGCACG TTTGAAAAAT
CAGCAACTGC AATTTTTGGA GCCTAGTCGA GGTTTGCCTT ATAGCCGAGT GCTTTCTTGG
TATCAGGATA TCGAAGGCAG TGTATGGGTG GGGACCAATG CTGGCATTAT GCGACTGCGT
GATGCGCCTT TCATTAATAT CAACAGTGAT AAAGGATTAG TGGGCGATTA TGTCCGAACC
TTGTTGCCGC TAGATGACCG ACGAGTTATG GTCGGTACCA GTCGTGGATT AAGCATTATT
GAAGAGGGCT TGGCACATAG CGCGTTAATG CCTCAAGTCG GTGCTCGTCC TTCGATACTC
AGTCTGGCCA AAGCAGACCA GCAAGGTGAA AAAGTCTGGG TTGGGACGGT ACAAAAAGGA
TTGCTCCTTT GGCAAAACGG CCAGTTGCGT CCTGTGCTAG ATGAAAATAA CGGTTTGCCA
AGCAGCGAAG TCAGAGCCAT CGTCACCGAT CCACAAGATA ATCTTTGGAT TGGTACTTCA
AATGGTATTG TCAAAAGAAC GCCGCAAGGT GTGCTGACGA CATACAACAA AGACAACTCC
CCCCTGCCTG ATGATTACAT CATGGCACTC GCTGTGGATT CGGAAGGGAA GCTGTGGGTT
GGTAGTGCCG TTGGTGTGGC TTATTTTGAT GCTCAAGGCA AGATTCGTCC GGTTGATCTT
ACTAAGCAAG AGCAAGCACA GTATGTGTTT GGCTTCTACA TGGAATCCGA TTATGTCTGG
ATGGCCACAG ACAGAGGCAT CGTGCGTTAT CGCTTGAGTG ATGCAAGCGT TTCCCTTGTG
GGTCGTGCCG CAGGCTTACC AATCGATAAA TTCTTCCAAA TGCTACGTGA TAGTGAAGGG
CATGTGTGGC TATCCAGCAA CCGTGGGGTA TGGAAGCTCA ACTATGATCA GATGTTAGCG
GTCGCTGATG GCATGAGCAC ACAGCTTGAA TTCGAACATT TTGATGAAGG TGATGGCATG
GCAACCAGTC AGGCGAATGG AGGGACCAAC CCTGCGTCTG CAAGTTTGCC AAATGGTGAG
CTGCTTTTTG CCACTGCGAA AGGGGTGGCG AGCATCAAGG TACAGCGTCT CCAGCAACTG
AGTGAACTGC GTTTACCGGT CGTTCTAGAA TCGGTCAGTT TTGACAGCGA GATTATCAAT
CCTGATCAGC AGTACATTGC TGCGGCTGGA ACCAATCGGG TGAGCTTCGG ATACGTGGGG
CTAGGTTTTG TGATGTCAGA GCGTTTGCAA TATCGCACGA AACTTGAAGG GTTCGATCGA
GATTGGTCAT ATCGGGGTCA TAATACCCAA GCGGAATACA CCAATTTAGC GCCGGGTAAA
TATCGATTCT TTGTCAGCGC CCGTTATCCG TATGGTGAGT GGAATGATGC CACGTTTAGT
TACGTATTTA TCATTGAACC TCATTGGTGG CAGCGTAAAG AGGTGATTGT CATGGCTGGC
ATGCTGTTTT TGGCGCTGGC GGTGTCACTG GTGATGTGGC GAATTCGTAT TTTGAAACGG
CGTGAGTTGT ATCTCGTTGA GCAGGTTGCG CTGCAAACCC AAAAATTGCG CCTACAAGCG
GAAAAGTTTG AGAGACTCTC GAAAGAAGAT GATTTAACTG GGCTTGCCAA TCGCCGCGCA
TTTGATATGT ACCTCAAACA AGCCTTTTCT CGTCTGCAAA ATCCAGATCA GCAAGTCAGT
ATTGCGCTGC TGGATATTGA TCACTTCAAG CAGATTAATG ATCGTTATTC GCACATTATC
GGCGACCAAG CCATTGTAGC GGTATCACAA GAGTTACTTG GGTATGTTGG CGACAAAACC
CGAGTGGCGA GATGGGGCGG TGAAGAGTTC ACCATCTTAT ATGTAGGAGA TCCTAAGCAA
GCTTGGGGTT ATTTTGAAAA GCTGCGTTGT AAGATTGAAC AGGTTGATTT GTCTGCAGTA
GCACCGGGAT TAAACGTGAC AGTGAGTATT GGCTTTGCCG ATGCGCAGCA GGCGGAGAGT
TATGAAACCG TTTTGAAGCT GGCGGATCAT GCATTATTGA CGGCGAAGAA ACTCGGGCGT
AACCGCGTGG TGAAAAGTGA AGAGTGGCAA GTAAAAAGCG GCCTCCATTG A
 
Protein sequence
MTGIRLLACC GVQICSVFLL IISSSAYAVV EHISDYFVET WTSTDGLPHN SINSIAQTRD 
GYLWFATWEG VVRYNGINFQ VFDRNPQTGM VDSGTRTLLP MDNNGLLVAG ARGSITHRQS
YGWQSFRSAP SLVNGALIDD AQNLWLAIEG LGVVVRPYLG NNLYGPDRWL LTDASVYRLV
KNQHGVIFAA TDKGLYRFND WQASAMELPI GRYERINYIS ISLNQELVVA TDQGVWINTE
EGFQSLFVPL EKEVVTLAEQ DRAGNWWIGT VNRGIARLKN QQLQFLEPSR GLPYSRVLSW
YQDIEGSVWV GTNAGIMRLR DAPFININSD KGLVGDYVRT LLPLDDRRVM VGTSRGLSII
EEGLAHSALM PQVGARPSIL SLAKADQQGE KVWVGTVQKG LLLWQNGQLR PVLDENNGLP
SSEVRAIVTD PQDNLWIGTS NGIVKRTPQG VLTTYNKDNS PLPDDYIMAL AVDSEGKLWV
GSAVGVAYFD AQGKIRPVDL TKQEQAQYVF GFYMESDYVW MATDRGIVRY RLSDASVSLV
GRAAGLPIDK FFQMLRDSEG HVWLSSNRGV WKLNYDQMLA VADGMSTQLE FEHFDEGDGM
ATSQANGGTN PASASLPNGE LLFATAKGVA SIKVQRLQQL SELRLPVVLE SVSFDSEIIN
PDQQYIAAAG TNRVSFGYVG LGFVMSERLQ YRTKLEGFDR DWSYRGHNTQ AEYTNLAPGK
YRFFVSARYP YGEWNDATFS YVFIIEPHWW QRKEVIVMAG MLFLALAVSL VMWRIRILKR
RELYLVEQVA LQTQKLRLQA EKFERLSKED DLTGLANRRA FDMYLKQAFS RLQNPDQQVS
IALLDIDHFK QINDRYSHII GDQAIVAVSQ ELLGYVGDKT RVARWGGEEF TILYVGDPKQ
AWGYFEKLRC KIEQVDLSAV APGLNVTVSI GFADAQQAES YETVLKLADH ALLTAKKLGR
NRVVKSEEWQ VKSGLH