Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_0452 |
Symbol | |
ID | 6199221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | + |
Start bp | 508434 |
End bp | 509300 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641704444 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_001831594 |
Protein GI | 182677448 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.112388 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAAT TGCCTGAAGT CGAAACCGTC CGCCGGGGGC TGGCGCCGGT CATGGTCGGG GCTTCCTTCA CGACTGTAGA GCAGCGACGG GCCGATCTTC GCTTTCCCTT TCCGGACAAT TTCGCGGCTC GCCTTGAGGG TCGCCGTGTG GAAGCGCTCG GCCGCCGCGC CAAATATCTG CTCGCCGATC TCGATGATGC GCAAGTGCTG GTCATGCATC TCGGCATGTC CGGTTCGTTT CGCATCGAGA AGGCGGGCGA TCTCGCCTCC CCGCCTCCTG GCAAGAACGC TGCGCATGAT CATGTGGTCT TCGGGCTTTC GACGGGCACG CGCATCATTT ACAATGATCC ACGCCGTTTC GGTTTCATGC ATTTGATCGC GCGCCAGGAC CTGGCGGGGC ATCCTTTGTT CCGCAATGTC GGCATCGAGC CGCTCGGCAA TGAGCTCGAA GGGGCCTTGC TGGCGCGATT GTTTGCAGGC AAGACGACGC CGCTCAAGAC CGCGCTGCTC GATCAGACCC TGATTGCCGG CCTCGGCAAT ATTTATGTAT GCGAGGCTTT GCACCGGGCG GGGCTTTCAC CGCGCCGGGC GGCGGGGACG CTCGCCGGCA AGAAGGGCCA GCCGACAGAG CGCGCGCATC GTCTTTCCGA AATCATCCGC GCGGTTTTGG AAGAAGCGAT CGAGGCGGGC GGTTCATCCT TGCGCGATCA TCGCCAAGCC GATGGCGCGC TCGGCTATTT CCAGCACCGT TTCCGTGTCT ATGATCGGGA AGCAGAGCCC TGCCCGCGAG AGGGGTGTGG TGGCACGATC AAGCGTATCG TGCAGGCCGG ACGATCGACG TTCTTTTGTG CCAAATGCCA GCGGTGA
|
Protein sequence | MPELPEVETV RRGLAPVMVG ASFTTVEQRR ADLRFPFPDN FAARLEGRRV EALGRRAKYL LADLDDAQVL VMHLGMSGSF RIEKAGDLAS PPPGKNAAHD HVVFGLSTGT RIIYNDPRRF GFMHLIARQD LAGHPLFRNV GIEPLGNELE GALLARLFAG KTTPLKTALL DQTLIAGLGN IYVCEALHRA GLSPRRAAGT LAGKKGQPTE RAHRLSEIIR AVLEEAIEAG GSSLRDHRQA DGALGYFQHR FRVYDREAEP CPREGCGGTI KRIVQAGRST FFCAKCQR
|
| |