Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2500 |
Symbol | |
ID | 4444906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2803749 |
End bp | 2804729 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639690315 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_831979 |
Protein GI | 116671046 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCCGAAC TGCCGGAAGT CGAAGTTGTC CGCCGCGGCC TGGTGAGCTG GGTCCGCGGC AGGACGATCA CTTCTGTCGA CGTCCTGGAT CCGCGTTCAA TCCGCCGGCA CGCCCTCGGC GCCCAGGACT TCACCGGCAA CCTCGAAGGC TCCCGGGTCC TGGATGTGGT GCGTCGCGGA AAATTCCTCT GGCTACCGCT GGAGGAGGCG GCAGCAGTCC AGCCAGGTAC TGACGGCATT CCGGCCGCAG GCACGTCCCG GCCGCGAGTG GCGCTCATGG CCCACCTGGG AATGAGCGGC CAGCTGCTGA TGCAGGATTC CGTGGTACCG GATGAAAAGC ACCTAAAAGT CCGCCTGCGG CTGAGCCCCG CCCACGGCAT GCCGGAACAA CTCAGATTCG TGGACCAACG CATCTTTGGG GGTCTGTTTG TCACGTCGCT GGTGCCAACG GCCGACGGCG GACCCGGCGG CCTTGGGGAG GTCCCGGAGC CGTTTATTGC CGAAGAGGCG TCCCACATCG CCCGGGATCC CCTGGATCCC TATTTTTCCT TCGATTCCTT TTACCGCCGG CTGCGGAGCC GTAAGACTGG ACTCAAACGT GCGCTGCTGG ACCAGGGACT CGTTTCCGGG ATCGGCAACA TCTATGCAGA CGAGGCACTG TGGCGGGCGC GCCTCCACTA CGCCCGGCCC ACCGAAACAC TCCGCCGCGC CGATGCGCTG CGGGTTCTCG ACGCCGCCCG TGAGGTGATG CTGGACGCCC TTGCCGCCGG CGGGACAAGC TTCGACTCCC TCTACGTCAA TGTAAACGGC GCCTCCGGGT ACTTTGACCG GTCGCTTAAC GCGTACGGCA GGGAAAACCA GGAGTGCAAA CGCTGCGCCG CTGCAGGCAT CGTAAGCCTG ATGAAGCGCG AACAATTCAT GAACCGGTCC TCCTATACCT GCCCCGTTTG CCAGCCCCGT CCCCGCAACG GCCGGTGGTG A
|
Protein sequence | MPELPEVEVV RRGLVSWVRG RTITSVDVLD PRSIRRHALG AQDFTGNLEG SRVLDVVRRG KFLWLPLEEA AAVQPGTDGI PAAGTSRPRV ALMAHLGMSG QLLMQDSVVP DEKHLKVRLR LSPAHGMPEQ LRFVDQRIFG GLFVTSLVPT ADGGPGGLGE VPEPFIAEEA SHIARDPLDP YFSFDSFYRR LRSRKTGLKR ALLDQGLVSG IGNIYADEAL WRARLHYARP TETLRRADAL RVLDAAREVM LDALAAGGTS FDSLYVNVNG ASGYFDRSLN AYGRENQECK RCAAAGIVSL MKREQFMNRS SYTCPVCQPR PRNGRW
|
| |