Gene DvMF_2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_2449 
Symbol 
ID7174384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp3083870 
End bp3084880 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content70% 
IMG OID643540980 
Productformamidopyrimidine-DNA glycosylase 
Protein accessionYP_002436858 
Protein GI218887537 
COG category[L] Replication, recombination and repair 
COG ID[COG0266] Formamidopyrimidine-DNA glycosylase 
TIGRFAM ID[TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGT TGCCAGAGGT GGAGACCATT GCCTGCGGTT TGCGGCCCCA GTTGACGGGG 
CGGCGCATCG TGTCCGTTTT GGTGCGCAAC GAAGGGACGG TGCAGGGCGA CGCAGCGGCG
TTCGCCCGGT GCGTGCCGGG GCGTGTCATT GCCGGGGTGG GGCGGCGCGG CAAGTTGCTG
CTGATGGAAT TGGCCCCACC GGGCGGACCG GACGCCCCGG ATGCCACAGG CGGTGGAGCG
GGCGGGCCGG ATGCGGCAGG GGCCGCCGGG ACGGGAACCA ATGTTCCGCC AGACGCGGCA
GACCTGATGG CCTGCCGCGA CGCGGCGGGC AAGATGGCCG GTTCCAGCGC TACAGGCGCT
ACAGGCGCGG CGGGCGGCAA CCGCGTGCCG CATCTGCTGG GCGTGCATCT GAAGATGACC
GGGCGGCTGT TTGTCTACGG GCCGGAGGTG GCGCCCAACA CCCACACCCG CGTGGTCTTC
GGGCTGGATG ACGGCAATCG GCTGTTTTTC GACGATGCGC GCAAGTTCGG TTACGTGCGC
GCCCTGTCCG ACGCCGATCT GGCCACGTGG GACTTCTGGC GGTCGCTGGG GCCGGAGCCG
CTGGAGATTG CCGCGCCGGA CTTCGCGGCG CTGTTCCGGG GGCGGCGGGG GCGCATCAAG
GCGCTGTTGC TGGACCAGAC GGTCATCGCG GGCATCGGCA ACATTTACGC CGACGAATCG
CTGTTCCGGG CGTCCATCCG GCCCGATGCG CAGGCCGGGG AGCTTTCGCC CGAGCGGCTG
TGCGTGTTGC ACGGGCATCT GGTGGACGTG CTGCGCGAAT CCATCGCCGA GTGCGGCAGT
TCCATCCGCG ACTACCGCGA TGCCCACGGC GATGCCGGGG CCTTCCAGAA CCGCTTCCGG
GTGTACGGCA GGTCCGGGCA GCCGTGCGTG GCCTGCGGGC GCGCGCTGAC CACGGGCAAG
GTGGCCGGGC GCACCACGGT GTTCTGCGAG CGCTGCCAGA AGGCGAAGTG A
 
Protein sequence
MPELPEVETI ACGLRPQLTG RRIVSVLVRN EGTVQGDAAA FARCVPGRVI AGVGRRGKLL 
LMELAPPGGP DAPDATGGGA GGPDAAGAAG TGTNVPPDAA DLMACRDAAG KMAGSSATGA
TGAAGGNRVP HLLGVHLKMT GRLFVYGPEV APNTHTRVVF GLDDGNRLFF DDARKFGYVR
ALSDADLATW DFWRSLGPEP LEIAAPDFAA LFRGRRGRIK ALLLDQTVIA GIGNIYADES
LFRASIRPDA QAGELSPERL CVLHGHLVDV LRESIAECGS SIRDYRDAHG DAGAFQNRFR
VYGRSGQPCV ACGRALTTGK VAGRTTVFCE RCQKAK