Gene Arth_4520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4520 
Symbol 
ID4443341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008537 
Strand
Start bp142683 
End bp143885 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content62% 
IMG OID639687573 
Productphage integrase family protein 
Protein accessionYP_829270 
Protein GI116662215 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.167996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTCT CGGATCGCAG TTGGGATAGG CATCGAGCGG AGGGACTGAC TGTCGCGAAC 
GTCGGGCGGG TGATTCCTCG CTCCAGTGTT CCTGGGTTCG TGGTCCTCGA CGCGATGGGC
GAGGAGTTCG CTCCTGCCAC GGAGTATCTG CTTGAGCTGG CAGCGTCGGA CCGTTCGCCT
CAGACGGTAC GAACTTATGC CTTGTCGTTG CTTCGGTTCC TTCGCTTTTT GTGGGCTGTC
GGGGTCAGCT GGGAGCAGGC AACGTCCCTT GAAGCGCGTG ACTTCGTGTT GTGGGCGCGC
CAGGCCGAGA AGTTCGTCGG GAATCGTAAC GTCCCGCAGC GGCGGGGGAG CCGGAACCTT
GTCACGGGAA AGAAGCATCT AGGAATGCGT TACTCACCCT CGACGATCAA CCACACGACG
ACGGTGTGTA AGGAGTTTTA CGCTTTCCAG CTTCGGATGG GTGACGGGCC CATCGTGAAT
CCTTTCGAGC TGCGCCGTGG GCGATCCCAT GCGCATCATG ATCCTCAGCG GGAGTTTGCC
CCGGTGCGGC GCCAGCCCTT GCGTCAGCGC GAGGCGCACC GGGTGCCCAG ATCAATCCCG
GACGGAAAGT TCAACGATCT GTTTCGCCGT TTGCGGTCCA ACAGGGACCG GGCGTTGGTG
GCGTTTTATG TCAGCAGCGG TGCACGGGCG AGCGAGCTGC TCGGGCTCAC GGGTGACCGG
GTCAACGTGG GTGACCAGCT GATCGGCGTT TACCGCAAAG GTGGCCAGCT GCAATGGTTG
CCCGCGGCGC CTGATGCTTT CGTATGGCTT CGGCTCTATC AGCTCGAAGG AGGCGTTGCC
GGCCCGGATG AACCGGTTTG GCTCACGCTG CGGGGCGAGC CCCGTCCTCT GACCTACGAA
GCCATGCGCG CCGTACTGAG ACGCTGCAAC GACCTGCTCG GTTCGAACTG GACGCTGCAC
GATCTGCGGC ACACGTTCGC GATCCGAGCG CTCGAGGGCG GGATGGGCCT TCACGAAGTC
CAGGAGTTAC TGGGTCACCA ATCGCGGACT ACGACCACGG TCTATGCGGT TCCGCATATG
GAGGAAGTCA TCGAGCACTA CCGGACCCAT CTGAGCAGCA GGACTTCCCC TGCCATTGAC
AGTTCACCGG CCGGCCAGCC CTATAACCCC GACGAGTTGC GCGTGCTTTG GGGGAACCAG
TGA
 
Protein sequence
MDLSDRSWDR HRAEGLTVAN VGRVIPRSSV PGFVVLDAMG EEFAPATEYL LELAASDRSP 
QTVRTYALSL LRFLRFLWAV GVSWEQATSL EARDFVLWAR QAEKFVGNRN VPQRRGSRNL
VTGKKHLGMR YSPSTINHTT TVCKEFYAFQ LRMGDGPIVN PFELRRGRSH AHHDPQREFA
PVRRQPLRQR EAHRVPRSIP DGKFNDLFRR LRSNRDRALV AFYVSSGARA SELLGLTGDR
VNVGDQLIGV YRKGGQLQWL PAAPDAFVWL RLYQLEGGVA GPDEPVWLTL RGEPRPLTYE
AMRAVLRRCN DLLGSNWTLH DLRHTFAIRA LEGGMGLHEV QELLGHQSRT TTTVYAVPHM
EEVIEHYRTH LSSRTSPAID SSPAGQPYNP DELRVLWGNQ