Gene Dvul_2698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2698 
Symbol 
ID4662806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp3140127 
End bp3141317 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content65% 
IMG OID639820944 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_968136 
Protein GI120603736 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR00586] mutator mutT protein
[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.265639 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCATG CTATGCGCAG CACACCCGCC GGGCTTGCCG GTAGCACGCC CCTGCGCTAC 
ACCCGCACCA TGCACGACAA CGCCCCGCAG CACGAATACG ACGCCTTCGC CAAGGCCCTT
CTCGACTGGT TCGCCGCCGC CCGCAGACCC CTGCCGTGGC GTGAGCATTA CACCCCCTAC
GGTGTGTGGA TTTCGGAAAT CATGCTCCAG CAGACGCAGA TGGAGCGCGG CGTGGACTAC
TACCTGCGCT GGATGGAACG CTTTCCCGAC GTGGCAAGCG TGGCCACAGC ACCTGAAGCC
GACCTGCTCA AGGCATGGGA GGGACTCGGC TACTACCGCC GTGTACGCAA TCTGCAAGCG
GCGGCGCGTG TCATCATGGA GCAGCACGAG GGCATCTTCC CCGACCTGCC CGATGCCATC
CGCGCCCTGC CCGGTATCGG CCCCTATACG GCGGGCGCCA TCGCCAGCAT CGCCTTCAAC
CACGACGTCA TCGCCGTAGA CGGCAATGTG GAACGCGTCT TTTCAAGGGT GTTCGACATC
GACACCCCGG TGCGTGAGAA GACGGCAGCC ACACGCATCC GCATGCTGAC GGCACGCACC
CTGCCCAAGG GCCGCGCCCG CGACTTCAAT CAGGCCCTCA TGGAACTTGG CGCCCTCGTC
TGCCGTAAGA AGCCCGACTG CACAGCCTGC CCGGTGGCAC GATTCTGCGA AAGCCTCCAT
CTTGGCATTC CGCATGAACG CCCTGTGCCG GGCCGCAGAC AGCCCATCGT CCCGCTGGAT
GTGGTCTCGG GAGTACTCGT CCATGAAGGC CGCATCTTCG TGCAACGTCG CCCCGACACC
GGAGTCTGGG CCGGATTCTG GGAATTCCCC GGCGGGCGCA TCGAACCGGG AGAGACACCG
GAAGAGGCCA TCATCCGCGA ATTCCGCGAA GAGACGGACT TCGCCGTACG CACCACAGAC
AAACTGGCTG TCATCCGGCA TGGCTACACG ACCTACAGGG TGGTACTGCA CTGCTATCTG
CTGCACATCG ACGCCAGCAG CCGTGGCGCC CCCCCTGAAC ATCCCGTCAT CACTGCCGCC
ACCGACCATC GATGGGCCAC ATTGGCAGAT ATCGACGCCC TCACCCTGCC CGCTGGCCAT
CGCAAGCTGG CGGACCTGCT TGCCGCGGAC CTGCGCTTCG CAGGGCTGTG A
 
Protein sequence
MIHAMRSTPA GLAGSTPLRY TRTMHDNAPQ HEYDAFAKAL LDWFAAARRP LPWREHYTPY 
GVWISEIMLQ QTQMERGVDY YLRWMERFPD VASVATAPEA DLLKAWEGLG YYRRVRNLQA
AARVIMEQHE GIFPDLPDAI RALPGIGPYT AGAIASIAFN HDVIAVDGNV ERVFSRVFDI
DTPVREKTAA TRIRMLTART LPKGRARDFN QALMELGALV CRKKPDCTAC PVARFCESLH
LGIPHERPVP GRRQPIVPLD VVSGVLVHEG RIFVQRRPDT GVWAGFWEFP GGRIEPGETP
EEAIIREFRE ETDFAVRTTD KLAVIRHGYT TYRVVLHCYL LHIDASSRGA PPEHPVITAA
TDHRWATLAD IDALTLPAGH RKLADLLAAD LRFAGL