Gene Arth_2500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2500 
Symbol 
ID4444906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2803749 
End bp2804729 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content65% 
IMG OID639690315 
Productformamidopyrimidine-DNA glycosylase 
Protein accessionYP_831979 
Protein GI116671046 
COG category[L] Replication, recombination and repair 
COG ID[COG0266] Formamidopyrimidine-DNA glycosylase 
TIGRFAM ID[TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCGAAC TGCCGGAAGT CGAAGTTGTC CGCCGCGGCC TGGTGAGCTG GGTCCGCGGC 
AGGACGATCA CTTCTGTCGA CGTCCTGGAT CCGCGTTCAA TCCGCCGGCA CGCCCTCGGC
GCCCAGGACT TCACCGGCAA CCTCGAAGGC TCCCGGGTCC TGGATGTGGT GCGTCGCGGA
AAATTCCTCT GGCTACCGCT GGAGGAGGCG GCAGCAGTCC AGCCAGGTAC TGACGGCATT
CCGGCCGCAG GCACGTCCCG GCCGCGAGTG GCGCTCATGG CCCACCTGGG AATGAGCGGC
CAGCTGCTGA TGCAGGATTC CGTGGTACCG GATGAAAAGC ACCTAAAAGT CCGCCTGCGG
CTGAGCCCCG CCCACGGCAT GCCGGAACAA CTCAGATTCG TGGACCAACG CATCTTTGGG
GGTCTGTTTG TCACGTCGCT GGTGCCAACG GCCGACGGCG GACCCGGCGG CCTTGGGGAG
GTCCCGGAGC CGTTTATTGC CGAAGAGGCG TCCCACATCG CCCGGGATCC CCTGGATCCC
TATTTTTCCT TCGATTCCTT TTACCGCCGG CTGCGGAGCC GTAAGACTGG ACTCAAACGT
GCGCTGCTGG ACCAGGGACT CGTTTCCGGG ATCGGCAACA TCTATGCAGA CGAGGCACTG
TGGCGGGCGC GCCTCCACTA CGCCCGGCCC ACCGAAACAC TCCGCCGCGC CGATGCGCTG
CGGGTTCTCG ACGCCGCCCG TGAGGTGATG CTGGACGCCC TTGCCGCCGG CGGGACAAGC
TTCGACTCCC TCTACGTCAA TGTAAACGGC GCCTCCGGGT ACTTTGACCG GTCGCTTAAC
GCGTACGGCA GGGAAAACCA GGAGTGCAAA CGCTGCGCCG CTGCAGGCAT CGTAAGCCTG
ATGAAGCGCG AACAATTCAT GAACCGGTCC TCCTATACCT GCCCCGTTTG CCAGCCCCGT
CCCCGCAACG GCCGGTGGTG A
 
Protein sequence
MPELPEVEVV RRGLVSWVRG RTITSVDVLD PRSIRRHALG AQDFTGNLEG SRVLDVVRRG 
KFLWLPLEEA AAVQPGTDGI PAAGTSRPRV ALMAHLGMSG QLLMQDSVVP DEKHLKVRLR
LSPAHGMPEQ LRFVDQRIFG GLFVTSLVPT ADGGPGGLGE VPEPFIAEEA SHIARDPLDP
YFSFDSFYRR LRSRKTGLKR ALLDQGLVSG IGNIYADEAL WRARLHYARP TETLRRADAL
RVLDAAREVM LDALAAGGTS FDSLYVNVNG ASGYFDRSLN AYGRENQECK RCAAAGIVSL
MKREQFMNRS SYTCPVCQPR PRNGRW