Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_2028 |
Symbol | |
ID | 6199942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | + |
Start bp | 2318622 |
End bp | 2319494 |
Gene Length | 873 bp |
Protein Length | 290 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641706015 |
Product | phage SPO1 DNA polymerase-related protein |
Protein accession | YP_001833139 |
Protein GI | 182678993 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGATT CCCTCACCGC TCTTTTGCGC TGGTATGCGG CGATGGGCGT CGATATCGCT CTCGACGAAG CACCACACGA CCGGTTCGCG GAATCCGCCG CCATGGCCAG CCGGCGAGCC CCCTCCGTCC CCCAAGACGA CCGGCAGGCG CGGGCAGATC ATCCACCACC ATCCAGCGTC TCGCCATCTC GGCAACAGGC CCCCGCTCCT CGGCAGCTCC CAGGCTTCGC GCAAGAGGCT CTCTCCCAAG AGGCTCTGTC CCAAAATGCC GAGGAAGCCG CCGCTTCGGC CAAGACACTC GATGAATTGC GCGAGAAACT CGATGCCTTT CAAGGTTGCG GCCTGAAAAA CACGGCGACA CAACTCGTCT TCGCCGATGG CAATCCAAAC AGTTCCGTCA TGATCATCGG CGAGGCGCCT GGTGCCGACG AGGACCGCCA GGGCCGCCCC TTTGTCGGCC GCGCCGGGCA ATTGCTCGAT CGCATGCTGG CAGCCGTGGA TCTCGACCGC ACCCAGGTCT ATATCGCCAA TATCGTGCCC TGGCGCCCGC CGGGCAATCG CACGCCGACC CCGCTCGAAA TCGCGGCCTG CCTGCCCTTC ATCCGCCGTC AGATCGAATT GGTTTCGCCA CGTTTCATTG TCTGCCTCGG CGCGCCCTCC GCACAAACGC TCCTCGGCAC CAAGGAGGCC ATCACCCGGC TGCGCGGCCG CTGGCGCGAT TGGTCCTGCG GCGGCAAGAC CATTCACGTC CTGCCGATGC TGCATCCAGC CTATCTGCTG CGTCAGCCGG CGGAGAAAAA ACGCGCATGG GCGGATTGGC GCCTACTCGC CAAGGCGCTT CGCGATGACA GTCCACAGAC TCTGAAGGAT TAA
|
Protein sequence | MSDSLTALLR WYAAMGVDIA LDEAPHDRFA ESAAMASRRA PSVPQDDRQA RADHPPPSSV SPSRQQAPAP RQLPGFAQEA LSQEALSQNA EEAAASAKTL DELREKLDAF QGCGLKNTAT QLVFADGNPN SSVMIIGEAP GADEDRQGRP FVGRAGQLLD RMLAAVDLDR TQVYIANIVP WRPPGNRTPT PLEIAACLPF IRRQIELVSP RFIVCLGAPS AQTLLGTKEA ITRLRGRWRD WSCGGKTIHV LPMLHPAYLL RQPAEKKRAW ADWRLLAKAL RDDSPQTLKD
|
| |