Gene Dvul_2596 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2596 
Symbol 
ID4664140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp3033053 
End bp3034294 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content54% 
IMG OID639820844 
Productcapsule polysaccharide biosynthesis 
Protein accessionYP_968038 
Protein GI120603638 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3562] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.322338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0183226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC ATCTCTTTCT TCAAGGGCCC CACGGTCATT TTTTTCGGCG GCTTGGGCAG 
ACACTGATAC GGCACGGGGA TAGAGTATTT CGTGTCAACT GCTGCGGTGG CGACGTTGTG
GACTGGCCCT GGCCTTACAC CCGCCTGTTT CGCAAAAAAG CTACTTACTG GGGGGAATGG
ATCGCCCGCC TCATGGATGA AGAACGGGTG ACAGACCTAC ATGTCTTCGG TGACTGGCGG
CCCCTGCACC GGGAAGCTGT ACTGCTGGCC AAAGTCCGAG GTATCCGAGT CTGGGCCTAT
GAGGAAGGCT ACCTCCGCCC AGACTATATC ACAATGGAGC AAGACGGCGT AAATGGTCTT
TCCTCGTTGC CGAACACCAG AGAAGGCATG GCGGAACTGG CCACCCGGTG CCCGGATCCA
CCGGAAGCCC GCAAAGTAGG CAATCCACAA AAGGTTAAGA CATGGCGTGC GATTGCTCAT
TATGCCGGCA CCATCTTTCT TTGGCCACTG TTCCGACATT TTCAGACGCA CCGCCCGCAG
AACGCCAGCC GCGAAGTCTG GGGCTGGTTC CTGCGCGTCC TTAGTCGTTC CGCGCGTCGG
GAGCGTTCTT CCAGAGCTCT TCGGGCTGCC TACCGTTCCC GTGCACCATA TTTTCTCTTT
CCCCTCCAGC TTGACGCCGA TTCCCAGGTC CGGCGCTATT CTCCGTATAG CGGCATGAAG
GAGGCCATCG CATGTGTGCT GGCCTCTTTT GCACAGGGTG CCCCGGCTGA TACCCACTGC
ATTATTCGCA ACCACCCTCT GGATAACGGT CTTATTGACT ATGCGAGCTT CATTGATTCT
TTCGCGACTG CCTGCGGCAT TCGCGAAAGA ATCCATTTTG TTGAGGGAGG CAAAGCACAT
CAAATGATGG ACAAAAGCGT CGGCGTAGTG ATCCTAAACT CCACGATGGG AATTTCCGCA
TTACGTCACG GCAAGCCAGT ATATTGCGTT GGAACGTCCA TCTACGCCGT AGAAGGACTG
GCGGTGAGCA GTGCAGAAAT GTCCTTAAAT GCCTTCTGGA ACAATCCACG CCGGCCGGAA
GATGACGCCC TCGCGGATTT CGAACGAGTT CTCAAAGCCC AAGCTCTGAT CAACGGAAAC
TTTTATACCC ATGAAGGAAT TGAAACAGCC ATCGAAGGAG TTTTGAAAAG GCTTGGTGAT
GCCTCTAAAC AGACTTTGCC AAGCCAGAAT GGAATGATTT AA
 
Protein sequence
MKIHLFLQGP HGHFFRRLGQ TLIRHGDRVF RVNCCGGDVV DWPWPYTRLF RKKATYWGEW 
IARLMDEERV TDLHVFGDWR PLHREAVLLA KVRGIRVWAY EEGYLRPDYI TMEQDGVNGL
SSLPNTREGM AELATRCPDP PEARKVGNPQ KVKTWRAIAH YAGTIFLWPL FRHFQTHRPQ
NASREVWGWF LRVLSRSARR ERSSRALRAA YRSRAPYFLF PLQLDADSQV RRYSPYSGMK
EAIACVLASF AQGAPADTHC IIRNHPLDNG LIDYASFIDS FATACGIRER IHFVEGGKAH
QMMDKSVGVV ILNSTMGISA LRHGKPVYCV GTSIYAVEGL AVSSAEMSLN AFWNNPRRPE
DDALADFERV LKAQALINGN FYTHEGIETA IEGVLKRLGD ASKQTLPSQN GMI