Gene Arth_2200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2200 
Symbol 
ID4445261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2478569 
End bp2479675 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content63% 
IMG OID639690009 
Productphage integrase family protein 
Protein accessionYP_831680 
Protein GI116670747 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.343714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTCG ACGGGTCCGG GCGTGTGCTG CAGCTGAGCG CTGTTCAGCT GCTGCATCCC 
GAGGAACAGA CGCTCGAAGA CATGCTGACC GGCTGGCGCA ACCAGCAGCT CTCCAGGAAC
CTCCAGTTCG ACACAGTCGA CAAGGGCATC GAGTGCGTCC GCCGGTTCGT CAACCATGTG
AACGAGTTCC CGTGGAACTG GGCACCGGAG CAAGTCGAGG AGTATTTCGG TGACCTCCGC
TCGATCCACC AGCTGAAGCA CTCCACTATC CGCGGCTACC AGTCCACGCT CCGCCGGTTC
ACGTCCTACG TGTCGAACCC CGACTACGGC TGGGACCAGG TCTGCGAACA ACGCTTCGGC
ACACACCCCT CCCAGGTCTT CTTTGACTGG AACACCGCCA CCCACACGCA GGAGTACGAA
GGACGCGCCT CCAAGCGGCC CTTCACCAAG ACCGAACTGC AGATGCTGTT CGATCACGCC
GACGACCAGG TCGAACTCAT CGCCGCCTCA GGCAAGAAAG GCTGGCAGGC AGCCTACCGG
GACGCCGTCA TGCTGAAAGT CGCCTACTCG TACGGGCTCA GATTCAACGA GCTCCGGCAC
CTGCAAACCA TCGACTTTGC GGCCAACCCC CAAGCACGAA GGTTCGGCAA GGCAGGCGTC
TGCAAGGTCC GGTTCGGCAA ATCACGCAAG GGCTCCCCCC ACAAACCCCG CAGCGTCCTG
ACGGTCTTCG ACTGGACCGC CGGAGTCATC GAGGACTGGC TCGCCAACGG ACGAGGCACA
CTCGACACCT TGGACCTGTT CCCCAGCGAA CGCGGCGGCC TGATCTGTGA ATCCACCCTG
CTGCGCCGGC TCCGGCGCTA CCTCAACGAG CTGGGCCTGC CAATGGATGG CCTGGACCTG
CATTCGCTCC GGCGCTCCTA TGCAACGCAC CTGCTCGAGG ACGGATGGGA TCCTAGATTC
GTGCAACATC AAATGGGCCA CGAACACGCC TCCACCACCG GGATCTACCA GTTCGTCAGC
GACGACTTCC GCAACACGAC CCTCCGGGCG GCCCTGGACC GCACCATGGA CGAAGTCCTG
GGCGTGCAGA TGCGAGGTCA ATGGTGA
 
Protein sequence
MAVDGSGRVL QLSAVQLLHP EEQTLEDMLT GWRNQQLSRN LQFDTVDKGI ECVRRFVNHV 
NEFPWNWAPE QVEEYFGDLR SIHQLKHSTI RGYQSTLRRF TSYVSNPDYG WDQVCEQRFG
THPSQVFFDW NTATHTQEYE GRASKRPFTK TELQMLFDHA DDQVELIAAS GKKGWQAAYR
DAVMLKVAYS YGLRFNELRH LQTIDFAANP QARRFGKAGV CKVRFGKSRK GSPHKPRSVL
TVFDWTAGVI EDWLANGRGT LDTLDLFPSE RGGLICESTL LRRLRRYLNE LGLPMDGLDL
HSLRRSYATH LLEDGWDPRF VQHQMGHEHA STTGIYQFVS DDFRNTTLRA ALDRTMDEVL
GVQMRGQW