Gene VC0395_A0907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0907 
SymbolmdoD 
ID5135468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp923029 
End bp924666 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content48% 
IMG OID640532365 
Productglucan biosynthesis protein D 
Protein accessionYP_001216853 
Protein GI147675383 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000201606 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCGTG TGTCCAGCGC CGTGCAACGT CATGCGCAAA AACTGATTGT ACTTTTCTCC 
CTGCTGTTTG GGGCTTCTTT GCTGATGTCT GATAATGGTT TTGCTACAGA CATTAAAAAT
ACTAATGCCT CTTCTCCAGT GAATTCAGAG TCTACTAAGC CAACAAAAGC TGGCGAAGTT
AAAAATGTGG TTCGCTTTGC CAAAACGGGA TCGTTTGATA ACGACACCGT TGTTCGCCTA
GCTCGCCAAC TGGCGAAAAA GCCTTATGTT GCCTTAAAAG ATCCGCTACC AGAGAGTTTG
GCCAATATCA GTTATGATGA GTACCGCGAT ATTCGTTTTA AACCCGACAG TGCAGTGTGG
AAAGCAGATG GCCTACCATA TCAAATGCAA CTATTCCATC GCGGTTTCTT CTTCCAAGAT
CTGATTGAAA TTGCACTTGT TGAAGGCAAC CAAGCCACTC ACCTAAGTTA CGACCCCAAT
ATGTTCACCG CTGGCGAAGT TCTACAACAG AACCTGCCGA CTGAAGATAT TGGTTATAGT
GGTCTTCGTG TGCATTACCC TCTCAACAGC CCATCCTATT TTGATGAACT GTTTGTATTC
CAAGGAGCAA GCTACTTCCG TGCTCTGGGT AAAGGCAATG CGTATGGCTT GTCTGCGCGT
GGCTTGGCCA TCAAAACTGC CGATCCAGCG GGTGAAGAGT TCCCTATTTT CCGTGCCTTC
TGGGTGGAAA AACCAAACTA CGACACCAAC TTGATTGTGG TCCATGCCCT ACTGGATAGC
CCAAGCGTGT CTGGTGCGTA TCGTTTCTCT ATTCGTCCAG GAGAAAATAC TCGTATGGAC
GTTGAGGCGG TACTCTTCCC ACGCGTGGAG TTAAGCAAAG TTGGTCTAGC TCCGGCAACC
AGTATGTTCA TGCATTCGCC AAATGGCCGT GAGAAGACCG ATGATTTCCG TCCTTCTGTG
CATGATTCTG ATGGTTTATT GATGATCAAC GGACGTGGTG AACGTTTGTG GCGTCCATTG
GCTAACCCTA GCACACTGCA AGTGAGCGCC TTTATGGACA ACTCACCGCA AGGCTTTGGT
TTGATGCAGC GTGAGCGCGA TTACGCCAAC TACCAAGATT TGGAAGCCCA TTACGAAAAA
CGTCCAAGTC TGTGGGTTGA ACCGGTCGGT AACTGGGGTC CTGGTGCTGT CGTGTTGACA
GAAATTCCAA CTCAATCAGA AATTCACGAC AACATTGTCG CCTTCTGGAA GCCAGCACAA
CCTCTTGCAG CAGGCAGTGA ATATCGTTTC TCTTATCACC TCAACTGGGG TGCGCAACCA
GAAGCGAATC CACAAGCGAT CACTGTAAGC CGTACTGCGA GTGGACGTGC CGATATTGCC
AAACCAACGC CAAAACGTTT GTTCGTGATT GATTACCAAG TCCAAGGTGC CAAGCCTGCA
CAGATGCCAG AACCGAAAGT GCGCAGCAAT GCTGGGGTAA TCAGTAACGT TGTACTGCGT
GATAACCCTG CCAATAATGG CTATCGCCTC TCATTTGAAT TTGATCCAGG CGAAGTGACG
CTGGCTGAAC TACGGGCAGA GCTCACTTTG CAAGAAGCGC GTCCTGTAGA AACTTGGTTG
TATCGTTGGA CCCTGTAG
 
Protein sequence
MIRVSSAVQR HAQKLIVLFS LLFGASLLMS DNGFATDIKN TNASSPVNSE STKPTKAGEV 
KNVVRFAKTG SFDNDTVVRL ARQLAKKPYV ALKDPLPESL ANISYDEYRD IRFKPDSAVW
KADGLPYQMQ LFHRGFFFQD LIEIALVEGN QATHLSYDPN MFTAGEVLQQ NLPTEDIGYS
GLRVHYPLNS PSYFDELFVF QGASYFRALG KGNAYGLSAR GLAIKTADPA GEEFPIFRAF
WVEKPNYDTN LIVVHALLDS PSVSGAYRFS IRPGENTRMD VEAVLFPRVE LSKVGLAPAT
SMFMHSPNGR EKTDDFRPSV HDSDGLLMIN GRGERLWRPL ANPSTLQVSA FMDNSPQGFG
LMQRERDYAN YQDLEAHYEK RPSLWVEPVG NWGPGAVVLT EIPTQSEIHD NIVAFWKPAQ
PLAAGSEYRF SYHLNWGAQP EANPQAITVS RTASGRADIA KPTPKRLFVI DYQVQGAKPA
QMPEPKVRSN AGVISNVVLR DNPANNGYRL SFEFDPGEVT LAELRAELTL QEARPVETWL
YRWTL