Gene DvMF_0831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_0831 
Symbol 
ID7172720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp1003800 
End bp1005158 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content71% 
IMG OID643539332 
Productprotein of unknown function DUF21 
Protein accessionYP_002435255 
Protein GI218885934 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.0300168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACA CAGGCACATC CGGCTCCATC TACTTCGAGG CAGGCGTCAT TCTCCTGCTC 
ATCCTGATCA ACGGGTTCTT TTCGCTGGCG GAAATGTCGC TGGTGGCCTC GCGCAAGGTG
CGACTGCGCC AGGACGCGGA ACGCGGCGTC AAGGGCGCGG CCACGGCCCT GCGCCTGCTG
CGCGAGCCCG ACCGGCTGTT CTCCACCGTG CAGATCGGCA TCACCCTGGT GGGCATCCTT
ACCGGTGCCT ACGGCGGCGC CGCGCTGGCC GAGCACCTTT CCGCCGTGCT GGCCCGCGTG
GACGTGCTGC GCCCCTACAG CGGCCCGCTG GGCTTCGGCC TAGTGATCCT GCTCATCACC
TATTTCACGC TGATTCTTGG CGAACTGGTG CCCAAGCGCA TGGCCTTCGG CAACCCGGAG
GCCTGCGCCC GCCGCACCGC GCCGGTCATG GCGCTGCTGC TGCGCTTGGC CCTGCCGCTG
GTGCACCTGC TCAGCGCCTC GTCGCGCGCG GCCTCGCGCC TGCTGCGCCT GCCCGAGGGC
GGCGACAGGG CCGTGACGGA AGAGGACATC CGGGGGCTCA TCGGCGAGGG TGCGGCATCG
GGCGTGGTGG AGCACGCCGA GCGCGACATG CTGGAACGCA TCTTCCGGCT GGGCGACCGG
CGGGCGGGGT CGCTGATGAC CCACCGTTCG CAGGTGGAAT GGCTGGACCT GGACATGCCC
GACGCGGAGA ACATGCAGCG CATCGCGCAG TCGTCCCATT CCTGCTTTCC CGTGGCGCGG
GGCGACATCG CCGCCGCCAC CGGGGTGCTG AAGGCGCGCG ACTTCCTGGC CGCACGGCTG
GTCACCCCGG ACATTCCCGT GGACGGCTTC ATCCGGCAAC CCCTGTACAT CCCCGAAACG
GCCCGCGCCC TGACCCTGCT GGACCTGTTC CGCCACTCCG AAGGCCTGCC CTTCGCCCTG
GTGGTGGACG AATACGGCGA GGTGCAGGGG GTGGTCACCC CCAACGACGT GCTGGAAGCC
GTGGTGGGCG AACTGCCCGA CGAAGGCGGC GACCCCGACC CGGCGGCGGT GCGCCGCGAG
GACGGCAGCT GGCTGCTGGA CGGGTTGCTG CCCTTCGACG AGATGTGCTC GCTGGCGGGG
CTGGGCGCTG CGGAGGATCC TGACGACCGG CCCGGCTCGT ACGAAACCCT GGCCGGGTTC
ATGCTGCACC GGCTGGGACG CATGCCCGCC ATGGGCGATG CCCTGCGCTG GCGCGGCCAC
CGCTTCGAGA TCGTGGACAT GGATGGCCGC CGCATCGACC GCGTGCTGGT GAGCCCCGAT
CCGGAACGCG CCGACGACGT GGGCGACGAC GCGCCGTAG
 
Protein sequence
MDDTGTSGSI YFEAGVILLL ILINGFFSLA EMSLVASRKV RLRQDAERGV KGAATALRLL 
REPDRLFSTV QIGITLVGIL TGAYGGAALA EHLSAVLARV DVLRPYSGPL GFGLVILLIT
YFTLILGELV PKRMAFGNPE ACARRTAPVM ALLLRLALPL VHLLSASSRA ASRLLRLPEG
GDRAVTEEDI RGLIGEGAAS GVVEHAERDM LERIFRLGDR RAGSLMTHRS QVEWLDLDMP
DAENMQRIAQ SSHSCFPVAR GDIAAATGVL KARDFLAARL VTPDIPVDGF IRQPLYIPET
ARALTLLDLF RHSEGLPFAL VVDEYGEVQG VVTPNDVLEA VVGELPDEGG DPDPAAVRRE
DGSWLLDGLL PFDEMCSLAG LGAAEDPDDR PGSYETLAGF MLHRLGRMPA MGDALRWRGH
RFEIVDMDGR RIDRVLVSPD PERADDVGDD AP