Gene Arth_3451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3451 
Symbol 
ID4443849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3883613 
End bp3884725 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content67% 
IMG OID639691275 
Productputative DNA alkylation repair protein 
Protein accessionYP_832926 
Protein GI116671993 
COG category[L] Replication, recombination and repair 
COG ID[COG4335] DNA alkylation repair enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGCCA TGAACGAACT GATTGACCAG GCCGCCGTCG GCCGGCTGGT TCGGGTCCTC 
GCGGACGCCG CCCCCGGGGC TTGCTGGTCC AACCTCGGCG ACGCAGGGTC TTCCCTGGGG
AACCTGAGCC TTCGCGAACG CACCGATCAT GTGAGCCGGG GACTGCTCGC CGACTTTGCC
GCTGCCGCCA GTCCGGCGGA CTATTCGACG GCGGCGCGCG TCTTCCGGAG CGCCCTCCTG
GATCCTGGCT TCACCGGCTG GACGCTCTGG CCGGTTACGG AAACAGCGGT AACGCTGGCC
TTGAATTCGA CCCGGTCCGC GGATTTTGAA GACTGCCTCC AGCTTCTGGC CGAACTGACT
CCGCGGCTGA CCGGGGAATT CGCCATCCGG CGGATGCTGG CCGCCGACCT GGACCGTGCA
CTCGCCGTCG TCCTGACCTG GACCGCCCAC CCTGACCAGC ATGTGCGCCG CCTCGCCAGC
GAAGGCACCC GACCGTATCT CCCGTGGGCG GTCCGGATTC CCGGCCTGGT CCAGCGCCCG
GACGCCACGA TTCCCATCCT GGACGCGCTC TACCGGGATC CACACGAGTA CGTCCGGCGT
TCAGTGGCCA ATCACCTCAA CGACCTGGCA CGCCATTCTC CCGAGGCGGT GCTGGCCGCA
GCTGCCGGCT GGACTGCCGC GCCGGACGCC AATACTCCGT GGGTGGTCCG GCATGGACTC
CGCACCCTCG TGAAGAAGGC CCACCCGGGC GCACTGGCCC TGCAGGGGTT CGCTCCCGCG
TCCCTCTCGG TATCCCCGCC GAGGCTGGAC CGGCACACCG TGGCCCTGCC GGCGGACCTC
GCCTTCGAAT TCGAGATCTC CAACACGGGT GTCGATCCGG CCAGGCTCGC GGTGGATTAC
ATCGTGCACT ACATGAAGGC AAACGGCTCA CAAACGGAGA AGGTCTTCAA ACTGGCGGCC
CTGACCCTGA ATCCCGGCGA AACCCGGACA GTGTCCAAAC GCCATGCGTT CCGCCAGATG
ACCACCCGGG TGCACCATCC GGGCAGCCAC GCTCTGGAGC TCCAGATCAA CGGCGTCCGG
TACGCCCACA CGCAGTTCCT CGTCGAGATC TGA
 
Protein sequence
MGAMNELIDQ AAVGRLVRVL ADAAPGACWS NLGDAGSSLG NLSLRERTDH VSRGLLADFA 
AAASPADYST AARVFRSALL DPGFTGWTLW PVTETAVTLA LNSTRSADFE DCLQLLAELT
PRLTGEFAIR RMLAADLDRA LAVVLTWTAH PDQHVRRLAS EGTRPYLPWA VRIPGLVQRP
DATIPILDAL YRDPHEYVRR SVANHLNDLA RHSPEAVLAA AAGWTAAPDA NTPWVVRHGL
RTLVKKAHPG ALALQGFAPA SLSVSPPRLD RHTVALPADL AFEFEISNTG VDPARLAVDY
IVHYMKANGS QTEKVFKLAA LTLNPGETRT VSKRHAFRQM TTRVHHPGSH ALELQINGVR
YAHTQFLVEI