Gene Arth_1539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1539 
Symbol 
ID4445940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1714818 
End bp1715768 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content64% 
IMG OID639689354 
ProductDNA-(apurinic or apyrimidinic site) lyase / formamidopyrimidine-DNA glycosylase 
Protein accessionYP_831033 
Protein GI116670100 
COG category[L] Replication, recombination and repair 
COG ID[COG0266] Formamidopyrimidine-DNA glycosylase 
TIGRFAM ID[TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0751871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACATCA CCCTTCCCCG CAGGGAGCGC CTTGGCGGAC CCGTGTGGCA CAGTAGTGGC 
ATGCCAGAAC TTCCCGAAGT GGCCGGGCTG GGCGCCTTCC TGGGCGACCG GCTTCGCGGA
GCTGTGCTGA CGAAAATCCA GATCGTTTCG TTCGCGGTCC TCAAAACGGC GGACCCGCCA
TATACGGCGC TGGAAGGCCG CACCATTTCC GGCGTCCAGC GCCGGGGCAA GTTCATCATC
ATTGATGCCG ACGGCATCTA TCTCGCGTTC CACCTCGCCA AGGCCGGCTG GCTGCGGTAC
ACCGAATCGC CGTCGAACGC CCTTTTGCCA CGGGGCAAAG GGTATATAGC CGCAAGGTTC
GAGTTCTCCA GGATCCGCCC TGATGCCGAC GGCGGCGAAG CCCATCTGGG GATCGACCTC
ACCGAGGCGG GAACAAAGAA AAGCCTGGCC CTCTACGTAG TCCGCGACCC GGAGGATATC
CCCGGCATCG CAAGCCTCGG CCCGGATCCG TTGAGCGCCT CGTTCACCCT TGACGCCTTC
GCTGAAATTC TTTCCTCGAG CAGCCAGCAG ATTAAGGGAC TGTTACGAAA CCAGGGGGTG
ATCGCCGGCA TCGGCAACGC CTACAGCGAC GAAATCCTCC ACGCTGCCCG GATATCCCCC
TTCGCCACCG CGAAGTCACT CGACCCGGAG TCCGTCCGCG TCCTGTACGA CTCGGTGCAC
AACATTCTGG GGGCCGCCGT GGCGGAGGCT GTGGGAAAGG CTCCGAACGA ATTGAAGGAC
GCGAAGCGGA GCACCATGCG GGTCCATGGC CGGACCGGCC AGGCGTGCCC GGTCTGCGGG
GACACGGTCC GGGAGGTGTC ATTTGCGGAC AGGGCGCTCC AGTATTGCCC GCGCTGCCAG
ACAGGCGGCA AGATCCTCGC GGACCGGCGG ACGTCGCGTT TCCTGAAGTA G
 
Protein sequence
MHITLPRRER LGGPVWHSSG MPELPEVAGL GAFLGDRLRG AVLTKIQIVS FAVLKTADPP 
YTALEGRTIS GVQRRGKFII IDADGIYLAF HLAKAGWLRY TESPSNALLP RGKGYIAARF
EFSRIRPDAD GGEAHLGIDL TEAGTKKSLA LYVVRDPEDI PGIASLGPDP LSASFTLDAF
AEILSSSSQQ IKGLLRNQGV IAGIGNAYSD EILHAARISP FATAKSLDPE SVRVLYDSVH
NILGAAVAEA VGKAPNELKD AKRSTMRVHG RTGQACPVCG DTVREVSFAD RALQYCPRCQ
TGGKILADRR TSRFLK