Gene Arth_4521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4521 
Symbol 
ID4443342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008537 
Strand
Start bp143882 
End bp146293 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content62% 
IMG OID639687574 
Productphage integrase family protein 
Protein accessionYP_829271 
Protein GI116662216 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0704652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAACTC CACCCATCAA TAAGATCCAG ACCGCACGAC GAAGCTGGAA TGACCACACG 
CCGCCACCAC AGGAACAACA ACCGCCTGCA CTGCCGCCCA TCCCGCCTGG ACCGCTGACC
AGCGCCAGCA TTGAGGACAT CATCAACCGT CTTGAGGACT GCAGCCCGTA CTGGCCACAG
GTGCATGAGC ACAGGGCGCG CATCCGGCGC TTCCTCGGTC ATTTGAGGCA GTTGCCGGGT
GAAACCTGGC AGGACCGCTG GGCGCTGTTC GAGGGCCATG TCGGGCATGA CGCTCTGACG
TGGAGGCTCA GAATAGCCGG CGGTTCAAGC AAGAACACGA AGAACCGCAA CGAGGTCGAA
GCGCTCAACG CGGCGATCGG ACCGCTGCTG GCCCTCGACG TCATGCGCCC CTCCCACCGG
TGGATCGCGG GTCAGCGCTT TGCCTTCTGG AAAGCACTCA GCCGGTTCCG GGACCCCGCC
CACACAGCTG AACTCGAAAA CGCCCTCCTC CATGCGAAGG CTTCACCGGC CCAAGCCACT
AGCGGGATCC TCACCCTAGG TCGCATTCAG GCCCATACAG GCAAATCGAT CCGCCAACTA
ACAGCCGCAG ACATCCTCGA CCTGACTACC CAGCTTCCCG GGAAGCTGAT CGACGTCAGC
CGCAGCGGGC TGACCGCCGT ATGGCCCGTG CTCCACGGGC TGGGATGGAT CAAACACGAT
TCCGGGACCA TGCCCACACG GCTCCGCGTC GGACCGAAAA CGGTGGATGA GATGGTCGAC
TACTACGACA TCACCGGCCC CTGCCGGGAA CCACTCATCC GCTATCTTCA GGTCCGGTCG
GCTGGCCTCG ACTACGCCTC GCTCTGTGCT TTGGGCAGGT ATCTGGCCGG CCTCTTCTTC
GCCGACATCC TCAAGCACTA CCCTGACCAA CGCAGCTTCG CACTGACCCC GAAGCAGGCG
GCTGGCTGGA AAGCCAGGGT CAAAGTCAAA GCCGACGGCT CGCCGCGCCG GGAAGTTTAC
AGCGTCTTCT TCTCCATCCG GGCGTTCTAT CTCGACATCA GCCAATGGGC ACTCGAAGAC
GCCTTCTGGG CGCAATGGGC CGCCCCAAGC CCTATAACCT TCGCCGAAAC CCGCGGTTAC
GTGAAGCACC GGCGCGCCAC CATCGCAACG ATGCAACAGC GCACCCGTGA ACTGGCCCCC
GTGTTGCCCC GTCTGGTGGA GGCCGCAGAA CAGGCCCTCC GCGCAGCAGA AGCAACCCTG
GCGGATGCCA CCACGGCCGG CGCCGGCCAA ACCATCGACA TGGACGGCGA GGCTTGGAAA
GTCTTCCAGA GCAGTGAACA TGCGCATGTC CGGATTCGGC GCAATGGAAA GGACCGGGAC
CTGTCCTTCG AAGAGGACAA CGCCTTCTGG ACTTGGGCGC TTGTTGAGAC ACTTCGACAC
ACCGGTGCAA GGATCGAAGA AGTCCTCGAG CTGGTGCACA TGAGCATCCA GCCCTACAAG
GTTCCAACGA CCGGGGAGAC CATACCGCTG CTGCACTTCG CTCCGAGCAA GACCGACACA
GAGCGTCTAA TGGTGGCAGG CCCCGAGCTC GTGCACGTGC TCTACCGAGT GCTTTCTCGT
ATCACCAAGG ACACTGGTGG TGCACCTCTG ACACAGCGAT GGGACTCAGC CGAGAAAGTG
CTGAGCTCAC CCCTTCCGCA CCTGTTCGCC CGCCGCCGTG GCGCCGGACC GCCGAACGTG
ATGACGACGG CCACCGTCTC CACGCTGCTC AATGACTTGG CAGGACGCGC ACATATCAAG
GTCAGCGGTG CCGAGGTCAG GTTCACCCCG CACGATTTCC GTCGCATCTT CGCGACCGAG
GCACTTGCCA GCGGCCTGCC TCCACACATC GTGCAGGTGC TGATGGGGCA CGCCAGCCTG
GCGACGACTC AGGGATACGC AGCGATTTAT CCCAGGGACG TCATCCGGCA CCACCGCACC
TTCATCGAAA AGCGACGGGT CATTCGGCCA ACCGAAGAAT ACCGCGAACC AACCGCCGCC
GAATGGGACG AATTCGAAGC CCATTTCGTG CAACGCAAAC TCAGCCTGGG CAGCTGCGGA
CGCGCCTACG GCACAGGCTG CCAACACGAA CACGCCTGCC TCCGCTGCGC CCTTCTCCGT
CCGGATCCCA GCCAGATCAA CCGGCTCCAG GACATCATCG ACAACCTCGA AGAACGCATC
ACCGAAGCCG AACAACACGG CTGGCTCGGC GATGCCGAAG GACTGCGGGT CACGCTCAAC
AGCGCCGAAA TGAAACTGGC CCAAATGTTC AAACTACAGA GCCAACGAAA CAGCAAAATC
GTCGACCTCG GAACTCCACA AATGCGGAAC TCCACGGGTG CCGGTTATCA CCCTCACCCG
CCACAACAGT GA
 
Protein sequence
MTTPPINKIQ TARRSWNDHT PPPQEQQPPA LPPIPPGPLT SASIEDIINR LEDCSPYWPQ 
VHEHRARIRR FLGHLRQLPG ETWQDRWALF EGHVGHDALT WRLRIAGGSS KNTKNRNEVE
ALNAAIGPLL ALDVMRPSHR WIAGQRFAFW KALSRFRDPA HTAELENALL HAKASPAQAT
SGILTLGRIQ AHTGKSIRQL TAADILDLTT QLPGKLIDVS RSGLTAVWPV LHGLGWIKHD
SGTMPTRLRV GPKTVDEMVD YYDITGPCRE PLIRYLQVRS AGLDYASLCA LGRYLAGLFF
ADILKHYPDQ RSFALTPKQA AGWKARVKVK ADGSPRREVY SVFFSIRAFY LDISQWALED
AFWAQWAAPS PITFAETRGY VKHRRATIAT MQQRTRELAP VLPRLVEAAE QALRAAEATL
ADATTAGAGQ TIDMDGEAWK VFQSSEHAHV RIRRNGKDRD LSFEEDNAFW TWALVETLRH
TGARIEEVLE LVHMSIQPYK VPTTGETIPL LHFAPSKTDT ERLMVAGPEL VHVLYRVLSR
ITKDTGGAPL TQRWDSAEKV LSSPLPHLFA RRRGAGPPNV MTTATVSTLL NDLAGRAHIK
VSGAEVRFTP HDFRRIFATE ALASGLPPHI VQVLMGHASL ATTQGYAAIY PRDVIRHHRT
FIEKRRVIRP TEEYREPTAA EWDEFEAHFV QRKLSLGSCG RAYGTGCQHE HACLRCALLR
PDPSQINRLQ DIIDNLEERI TEAEQHGWLG DAEGLRVTLN SAEMKLAQMF KLQSQRNSKI
VDLGTPQMRN STGAGYHPHP PQQ