Gene Dvul_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1654 
Symbol 
ID4663803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1962572 
End bp1963606 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content62% 
IMG OID639819893 
ProductOmpA/MotB domain-containing protein 
Protein accessionYP_967098 
Protein GI120602698 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000122259 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.141809 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTG CCCGTTATCT CGCACTGCTG GCAGCGCTGG TCTTCGCTTG CGCCTCCAGC 
GCCCTGGCCG CGTCCCCCGA AAAGGTCCTC GTCCCCAAGA TTTCGAGCTT CGACTTCTTC
GTGGACTACT CCGGCTCCAT GGCGATGAAG CACGCCAAGC TGGGCAAGGA CAAGATCGTG
GTCGCCAAGG AACTGATGAC CGCCATCAAC GCCAAGATTC CCGCCCTCGG CTACAAGGGC
GGCCTGCACA CCTTCGCGCC CTCTTCCGAA GTGATCGCGC AGGGTCCCTG GGATCGTGCC
GGCTACGAAA AGGGCATCAA GTCCCTCAAG TCCAACTTCG ACATCTTCGG TCGCCTGACT
CCCATGAGCC AGGGCTTCGA AAAGGCTCTG CCCTACGTCC AGAACATGCA GCGCAAGGCC
GCCGTCATTC TCATCAGTGA CGGTGAAGCC AACGTGGGCA TGGATCCCGT GACCGCCGCC
AAGGCCGTTG CCGCCCTCAA GGACGTATGC ATCCATGTCA TCTCCCTTGC CGACACCGCC
AAGGGTCAGG CCACCCTCGA CGCCATCTCC AAGCTCAGCC CCTGCTCCGT CAGCGTGAAC
GGCGCCGACC TGCTGAACAA CGAGAAGGCC CTCGACAAGT TCGTCCGCGA CGTGTTCTAC
GACGAGCAGG CCCCCAAGGC TGCCGCCGCC CCCATGGAAG AAGTCATCGT GCTGCGTAAC
GTCCAGTTCG CACTGAACTC CGCCAAGCTC GACGCTTCCG CCACCAGCAT CCTCACCGAG
GCCGCCCGCA TCATCAAGGC GAACCCCGGC AAGAAGCTGC TCGTCGCCGG TCACACCTGC
AACCTCGGCA CCGACGCCTA CAACCTCAAG CTGTCCGACG CACGCGCCAA GTCCGTCAAG
GACTTCCTCG TGAAGCAGGG CGTTGAGGCC AGCCGTCTTG ACACCTTCGG CTTCGGCGAA
AGCCAGCCCA AGTACGACAA CGGCACCGAA CAGGGTCGCA GCATGAACCG CCGCGTGGAA
CTCTCGTTCC GCTAA
 
Protein sequence
MKFARYLALL AALVFACASS ALAASPEKVL VPKISSFDFF VDYSGSMAMK HAKLGKDKIV 
VAKELMTAIN AKIPALGYKG GLHTFAPSSE VIAQGPWDRA GYEKGIKSLK SNFDIFGRLT
PMSQGFEKAL PYVQNMQRKA AVILISDGEA NVGMDPVTAA KAVAALKDVC IHVISLADTA
KGQATLDAIS KLSPCSVSVN GADLLNNEKA LDKFVRDVFY DEQAPKAAAA PMEEVIVLRN
VQFALNSAKL DASATSILTE AARIIKANPG KKLLVAGHTC NLGTDAYNLK LSDARAKSVK
DFLVKQGVEA SRLDTFGFGE SQPKYDNGTE QGRSMNRRVE LSFR