Gene Dvul_2595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2595 
Symbol 
ID4664149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp3031005 
End bp3033056 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content52% 
IMG OID639820843 
Productcapsule polysaccharide biosynthesis 
Protein accessionYP_968037 
Protein GI120603637 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00683877 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACACTAT ATTTTCACTC AAAAAACAAA AAATCCTTTG TACTCTTTTC GCAAACACTC 
TATCGCATTC CTCATTTGAA CCTTTTTTTG CACGACATGG AAAACTGTAC GATACAAAGG
AATACCGTAC TCGGCTGGGG TCATAAACCT ACAGCTGACA AAGCCCGGCG CTATGCAGCG
GAGCACAACC TCCCTTACAT CGCTGTGGAG GACGGTTTTT TGCGTTCCCT AGACTTGGGT
TGTCGGGGCG CACAGCCTCT TTCACTAGTA GTGGACCATA CGGGTATCTA CTATGATGCT
AGTGGACCGT CCGACTTGGA GGACTTTCTT AATTCCTCCG GCTGGGAAAC GCCGGAGTTG
ATGGACTCCG CTAGGCTCGC CATGCGAGAG ATTATTCGAC ATAGCCTGAG CAAGTACAAC
CACGCCCCTG AATCATCGGA ATCAGTATGG GGAGATGGGC TAGCTCCCCG CGTGCTTGTC
CTCGACCAGA CGGTAGGCGA CGCCAGCGTT ACACTAGGCA TGGCAGATGA AGAGTGTTTC
CACGCTATGC TGGAAAAGGC TTTAACTGTA CACCCGCACT CACATGTCTG TGTAAAAACG
CACCCGGATG TCATTGCCGG CAAAAAAAAA GGCTATTTAA CCGAATACGC CACAAAACAC
GGTGTCAAAA TCCTTGCGGT GGAGCATGCA CCGCTTTCCC TGCTGTCTAG AGCCGATGTG
GTATACACGG TCACGTCACA AATGGGCTTT GAAGCTCTCA TGCTCGACAA AGAAGTCCAT
TGTTTTGGCA TGCCCTTCTA TGCCGGTTGG GGGCTGACTA ACGACATGAA GACCTGCTCT
CGCAGGAAAA GACAACGGAG CCTCGAGGAA GTCTTTACTG CGGCATACCT GCTGTATGCA
CGCTACGTGA ACCCGATACG GAGCGAACGG TGCGACATCC ATGACACCAT TAGCCTGCTA
ACCGAACAAC GCAGACAAAA TGAAAGTAAT CGTTGCTTTC ACGCCTGCGT GGGCTTTCGT
TGGTGGAAAC GGCCTTACGC TAGAGCCTAT TTGCAATCCA CCGGAGGGCA AACCGTCTTT
TACCGGAATG CGAAACAAGC AATATACGAT GCATGGACAA AGGGCGGCGA ACTCGTGACA
TGGTCGTCCA ACGTAGACAC AGAACTCCAG AAAGCATGTG ATGAAAGGGG AATACGATTG
GCACGCATGG AGGACGGTTT CATTCGTTCA GTGGGATTGG GCTCGAATTT CAACTGGCCC
TATTCCTTAG TTGTGGACAG AAAGGGGATT TATTACGATC CCTCCCTGCC CAGCGAACTT
GAAGACATCC TCAACGCCAT ACATGAACAC CCTGAGCATG CGGCGCTGCT GAAGCGCGCC
GCCACGCTGT GTGCCATAAT TTTAGAAAAG GGGCTAACTA AATACAATAC AGGGTTTCGT
GTCGAATTCT TGCCAAAATT GCCGAAAGAA AAAACCATCA TACTTGTTCC GGGCCAGGTT
GAGGACGACG CTTCCGTACG TTGCGGCGGC TTCGGCATGA CGAATCTAGA TCTGCTGCGG
GCCGCGCGTG AAGCCAGACC GGACGCCTTC ATCATCTACA AGCCGCACCC AGATGTGGAA
AGTGGCAACC GCCAGGGCGC GTTGCCCGAC ACGACGACTT TACATTACGC TGACAGTATT
CTGCACGATT TTCCTATGGG TAGCCTTCTG CCTCTGGTCA ATGAGGTCCA CACGTTGACC
TCCCAAACCG GATTTGAGGC ACTGTTACGC GGGGTCAAGG TATGTACTTA CGGAGGACCG
TTTTATGCGG GATGGGGTCT AACGGAAGAC AACAGAACCT TTCCTCGCCG CAAGGCACGC
CTGAATCTGA ATGAATTGGT AGCCGGAGCC CTATTGCTCT ACCCTTCCTA TTATGACTGG
CAGACGCGGA ACTTCTGCCG GGCGGAGGAT GTCTGTTGCC GTCTGCTCCA GCCGGACGGT
CAGATGCGGG GGCGTGTCTG GACGCGGTTT GTCACTGCCA CGCGCGGATT CCTTCAGAGA
ATCGGACGAT GA
 
Protein sequence
MTLYFHSKNK KSFVLFSQTL YRIPHLNLFL HDMENCTIQR NTVLGWGHKP TADKARRYAA 
EHNLPYIAVE DGFLRSLDLG CRGAQPLSLV VDHTGIYYDA SGPSDLEDFL NSSGWETPEL
MDSARLAMRE IIRHSLSKYN HAPESSESVW GDGLAPRVLV LDQTVGDASV TLGMADEECF
HAMLEKALTV HPHSHVCVKT HPDVIAGKKK GYLTEYATKH GVKILAVEHA PLSLLSRADV
VYTVTSQMGF EALMLDKEVH CFGMPFYAGW GLTNDMKTCS RRKRQRSLEE VFTAAYLLYA
RYVNPIRSER CDIHDTISLL TEQRRQNESN RCFHACVGFR WWKRPYARAY LQSTGGQTVF
YRNAKQAIYD AWTKGGELVT WSSNVDTELQ KACDERGIRL ARMEDGFIRS VGLGSNFNWP
YSLVVDRKGI YYDPSLPSEL EDILNAIHEH PEHAALLKRA ATLCAIILEK GLTKYNTGFR
VEFLPKLPKE KTIILVPGQV EDDASVRCGG FGMTNLDLLR AAREARPDAF IIYKPHPDVE
SGNRQGALPD TTTLHYADSI LHDFPMGSLL PLVNEVHTLT SQTGFEALLR GVKVCTYGGP
FYAGWGLTED NRTFPRRKAR LNLNELVAGA LLLYPSYYDW QTRNFCRAED VCCRLLQPDG
QMRGRVWTRF VTATRGFLQR IGR