Gene DvMF_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_1001 
Symbol 
ID7172896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp1216312 
End bp1217406 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content69% 
IMG OID643539507 
Productbiotin synthase 
Protein accessionYP_002435424 
Protein GI218886103 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCCC TGCTGGAACG TCTGTGCGCG CGCCTTTCGG ACACCATTCC CCCCGGAACG 
GAAACTCCGT ACGCATCGCC AAGCCACGCC CGCACGGAGG AAGCCCCCTG GTCCGGCATC
ACCGGAGAAG AGGCGCTCGC CGTGGCGCGG CTGCCCGCGT CCGACATCCT GGACATCCTG
GCCGTTGCGC AGGCCGTGCG TTCGGCCCGC AAGGGGCCGC TGGCGACCAC TTGCGGCATC
GTCAACGCCA AGTCGGGCCG GTGCGGCGAG GATTGCGCCT TTTGCGCCCA ATCGTCGCAC
CACGACACGG GAGCCCCGGT GCATGCGCTG CTCGGCCCCG ACGCGTTGCT GCGCCATGCG
GAAGAACTGG CCCGCGCGGG GGTGCGCCGC TTCGGCATAG TGACCAGCGG CAACGCCCTT
TCGGAACGGG AGTTCGACGC GGTCTGCCAT GCGGCGCGCC TGCTGCGCGA CCGTGTGGAC
ATCGGCCTGT GCGCCTCTCT GGGGCAACTC GCCACCGGGT CCGCCGAGAG CGGAAACCGG
GGAGAACGAG CGCGCCGCCT GAAGGACGCA GGCATCTCCA GCTACCACCA CAATCTTGAA
ACGGCCAGAA GTTTTTTCCC GCAGGTATGT ACCACGCACC CTTACGACGA CGACATCGCC
ACCGTGCGCG AGGCCGCGCG GGCGGGGCTG CGCACCTGTT GCGGCGGCAT CCTGGGCCTT
GGCGAAACGT GGGAACACCG TGTGGAACTG GCCCTGACCC TGCGTGAACT GGACGTGGAC
TCCATCCCGC TGAACTTCCT GCATCCCGTT CCGGGAACAC GGCTGGGCCA CCGCAGTCCG
CTGCCCCCCA TGGAAGCCCT GCGGGCCATT GCCGTGTTCC GGCTGCTGCA CCCGCAGAGG
GACATCCTGG TGTGCGGCGG ACGCGAGACG ACCCTTGGCC AGTGGCAGTC GTGGGTATTC
GCCGCCGGGG CCAACGGACT GATGGTGGGC AACTACCTGA CCACGGCGGG CCGCGCCCTT
GCCGAGGACA TGGAGATGCT GGCCGCGCTG GGCGTGGGCG AAATTCCCCG CAATGGCGAG
GAGGCACGGG CATGA
 
Protein sequence
MNPLLERLCA RLSDTIPPGT ETPYASPSHA RTEEAPWSGI TGEEALAVAR LPASDILDIL 
AVAQAVRSAR KGPLATTCGI VNAKSGRCGE DCAFCAQSSH HDTGAPVHAL LGPDALLRHA
EELARAGVRR FGIVTSGNAL SEREFDAVCH AARLLRDRVD IGLCASLGQL ATGSAESGNR
GERARRLKDA GISSYHHNLE TARSFFPQVC TTHPYDDDIA TVREAARAGL RTCCGGILGL
GETWEHRVEL ALTLRELDVD SIPLNFLHPV PGTRLGHRSP LPPMEALRAI AVFRLLHPQR
DILVCGGRET TLGQWQSWVF AAGANGLMVG NYLTTAGRAL AEDMEMLAAL GVGEIPRNGE
EARA