Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_2698 |
Symbol | |
ID | 4662806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 3140127 |
End bp | 3141317 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639820944 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_968136 |
Protein GI | 120603736 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR00586] mutator mutT protein [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.265639 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCATG CTATGCGCAG CACACCCGCC GGGCTTGCCG GTAGCACGCC CCTGCGCTAC ACCCGCACCA TGCACGACAA CGCCCCGCAG CACGAATACG ACGCCTTCGC CAAGGCCCTT CTCGACTGGT TCGCCGCCGC CCGCAGACCC CTGCCGTGGC GTGAGCATTA CACCCCCTAC GGTGTGTGGA TTTCGGAAAT CATGCTCCAG CAGACGCAGA TGGAGCGCGG CGTGGACTAC TACCTGCGCT GGATGGAACG CTTTCCCGAC GTGGCAAGCG TGGCCACAGC ACCTGAAGCC GACCTGCTCA AGGCATGGGA GGGACTCGGC TACTACCGCC GTGTACGCAA TCTGCAAGCG GCGGCGCGTG TCATCATGGA GCAGCACGAG GGCATCTTCC CCGACCTGCC CGATGCCATC CGCGCCCTGC CCGGTATCGG CCCCTATACG GCGGGCGCCA TCGCCAGCAT CGCCTTCAAC CACGACGTCA TCGCCGTAGA CGGCAATGTG GAACGCGTCT TTTCAAGGGT GTTCGACATC GACACCCCGG TGCGTGAGAA GACGGCAGCC ACACGCATCC GCATGCTGAC GGCACGCACC CTGCCCAAGG GCCGCGCCCG CGACTTCAAT CAGGCCCTCA TGGAACTTGG CGCCCTCGTC TGCCGTAAGA AGCCCGACTG CACAGCCTGC CCGGTGGCAC GATTCTGCGA AAGCCTCCAT CTTGGCATTC CGCATGAACG CCCTGTGCCG GGCCGCAGAC AGCCCATCGT CCCGCTGGAT GTGGTCTCGG GAGTACTCGT CCATGAAGGC CGCATCTTCG TGCAACGTCG CCCCGACACC GGAGTCTGGG CCGGATTCTG GGAATTCCCC GGCGGGCGCA TCGAACCGGG AGAGACACCG GAAGAGGCCA TCATCCGCGA ATTCCGCGAA GAGACGGACT TCGCCGTACG CACCACAGAC AAACTGGCTG TCATCCGGCA TGGCTACACG ACCTACAGGG TGGTACTGCA CTGCTATCTG CTGCACATCG ACGCCAGCAG CCGTGGCGCC CCCCCTGAAC ATCCCGTCAT CACTGCCGCC ACCGACCATC GATGGGCCAC ATTGGCAGAT ATCGACGCCC TCACCCTGCC CGCTGGCCAT CGCAAGCTGG CGGACCTGCT TGCCGCGGAC CTGCGCTTCG CAGGGCTGTG A
|
Protein sequence | MIHAMRSTPA GLAGSTPLRY TRTMHDNAPQ HEYDAFAKAL LDWFAAARRP LPWREHYTPY GVWISEIMLQ QTQMERGVDY YLRWMERFPD VASVATAPEA DLLKAWEGLG YYRRVRNLQA AARVIMEQHE GIFPDLPDAI RALPGIGPYT AGAIASIAFN HDVIAVDGNV ERVFSRVFDI DTPVREKTAA TRIRMLTART LPKGRARDFN QALMELGALV CRKKPDCTAC PVARFCESLH LGIPHERPVP GRRQPIVPLD VVSGVLVHEG RIFVQRRPDT GVWAGFWEFP GGRIEPGETP EEAIIREFRE ETDFAVRTTD KLAVIRHGYT TYRVVLHCYL LHIDASSRGA PPEHPVITAA TDHRWATLAD IDALTLPAGH RKLADLLAAD LRFAGL
|
| |