Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1539 |
Symbol | |
ID | 4445940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 1714818 |
End bp | 1715768 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639689354 |
Product | DNA-(apurinic or apyrimidinic site) lyase / formamidopyrimidine-DNA glycosylase |
Protein accession | YP_831033 |
Protein GI | 116670100 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0751871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACATCA CCCTTCCCCG CAGGGAGCGC CTTGGCGGAC CCGTGTGGCA CAGTAGTGGC ATGCCAGAAC TTCCCGAAGT GGCCGGGCTG GGCGCCTTCC TGGGCGACCG GCTTCGCGGA GCTGTGCTGA CGAAAATCCA GATCGTTTCG TTCGCGGTCC TCAAAACGGC GGACCCGCCA TATACGGCGC TGGAAGGCCG CACCATTTCC GGCGTCCAGC GCCGGGGCAA GTTCATCATC ATTGATGCCG ACGGCATCTA TCTCGCGTTC CACCTCGCCA AGGCCGGCTG GCTGCGGTAC ACCGAATCGC CGTCGAACGC CCTTTTGCCA CGGGGCAAAG GGTATATAGC CGCAAGGTTC GAGTTCTCCA GGATCCGCCC TGATGCCGAC GGCGGCGAAG CCCATCTGGG GATCGACCTC ACCGAGGCGG GAACAAAGAA AAGCCTGGCC CTCTACGTAG TCCGCGACCC GGAGGATATC CCCGGCATCG CAAGCCTCGG CCCGGATCCG TTGAGCGCCT CGTTCACCCT TGACGCCTTC GCTGAAATTC TTTCCTCGAG CAGCCAGCAG ATTAAGGGAC TGTTACGAAA CCAGGGGGTG ATCGCCGGCA TCGGCAACGC CTACAGCGAC GAAATCCTCC ACGCTGCCCG GATATCCCCC TTCGCCACCG CGAAGTCACT CGACCCGGAG TCCGTCCGCG TCCTGTACGA CTCGGTGCAC AACATTCTGG GGGCCGCCGT GGCGGAGGCT GTGGGAAAGG CTCCGAACGA ATTGAAGGAC GCGAAGCGGA GCACCATGCG GGTCCATGGC CGGACCGGCC AGGCGTGCCC GGTCTGCGGG GACACGGTCC GGGAGGTGTC ATTTGCGGAC AGGGCGCTCC AGTATTGCCC GCGCTGCCAG ACAGGCGGCA AGATCCTCGC GGACCGGCGG ACGTCGCGTT TCCTGAAGTA G
|
Protein sequence | MHITLPRRER LGGPVWHSSG MPELPEVAGL GAFLGDRLRG AVLTKIQIVS FAVLKTADPP YTALEGRTIS GVQRRGKFII IDADGIYLAF HLAKAGWLRY TESPSNALLP RGKGYIAARF EFSRIRPDAD GGEAHLGIDL TEAGTKKSLA LYVVRDPEDI PGIASLGPDP LSASFTLDAF AEILSSSSQQ IKGLLRNQGV IAGIGNAYSD EILHAARISP FATAKSLDPE SVRVLYDSVH NILGAAVAEA VGKAPNELKD AKRSTMRVHG RTGQACPVCG DTVREVSFAD RALQYCPRCQ TGGKILADRR TSRFLK
|
| |