Gene Arth_4274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4274 
Symbol 
ID4443442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp6788 
End bp7879 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content66% 
IMG OID639687595 
Productphage integrase family protein 
Protein accessionYP_829292 
Protein GI116662238 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTGG TGGCGTTGCC GGGTGGCGGC GCCGTGGATC CAGCGTTCTG GAGTGTGCAG 
GCAGCCGAGG ACTTTGAACA GGAGATCGTC GACCAGTACG CGCTGGCGAT GGCCGCGGCT
GGTCTGAGTG ACTCGCATAT CCGCAATACG CGGTCGACAA TCATCGAATT CGCCCGTTCG
GTGACCGGCC CGCTGTGGGC GGCAACGTGT CAGGATGCCG ACCGGTTCCT GGCCGAGCAG
AGACGGCTGG GCCGCAGCGT GAGCACCAGG GCCGGCAAGG CAGGGACGCT GGCGTTGTTT
TACGAGTTCA TGATCAGCCG GTACCAGGGC CGGATCCACC GGTTGACCGG GGTACTGGTC
GAGCAGCCTA TCGACGAGTT CAACCGCCAG GCCGGGGCAT CGCTGGGCAA GGTCCGTGTG
CCGCCGTCGG ATGCCGAGAT TGACGCGTTC TTCACCTGTT GGAGGCACTC GATACCCCAG
GCCCGCAAGT ATTTGCCTGC CGCGCGGGAT TACTTCGCTG CTTCGCTGTG GCGCCGGCTG
GGGTTGCGGA TCACCGAGAC GGTGATGCTC GACATCCGTG ATTGGCGCCC TGACCTGGGC
GGGTTCGGCA AGCTCCACGT GCGGTACGGC AAAGGAGCCC ACGGCCGTGG CCCCAAGCCG
CGCCTCGTCC CGGCTATCAA CGGCGCGGCC GAGCTGATCG ACTGGTGGCT GGGCGACGTC
CGGCACCGGT ACGGCGAGGA CTGGGCCGAC CCCGACGCAC CCCTGCTCCC CTCGGAACGG
TTTGACCGTG AGCTGGGACG ATGCGGCCGG GTCGGTGGCA ACGCGCTGCG GCGAAGTCTG
GGGCTGCAGG TCGACCAGTG GCTGCCGGCA TGGTCCGGAA GGATGACTCC CCATGTTCTG
CGTCATTACT GCGCTTCCTC GCTCTACGGG GCAGGGATGG ACATCAAGGC CCTCCAGGAG
CTGCTTGGGC ATCAGTGGCT CTCGACTACC TCGGGCTACA TCCACGTGCG CAGCGAGCAC
GTCGAGCAGG CCTGGAAGAA CGCCAACGAG CGGGTCGAGT CCCGCTTCGC GACCACACAG
AAGGAAGGAT GA
 
Protein sequence
MALVALPGGG AVDPAFWSVQ AAEDFEQEIV DQYALAMAAA GLSDSHIRNT RSTIIEFARS 
VTGPLWAATC QDADRFLAEQ RRLGRSVSTR AGKAGTLALF YEFMISRYQG RIHRLTGVLV
EQPIDEFNRQ AGASLGKVRV PPSDAEIDAF FTCWRHSIPQ ARKYLPAARD YFAASLWRRL
GLRITETVML DIRDWRPDLG GFGKLHVRYG KGAHGRGPKP RLVPAINGAA ELIDWWLGDV
RHRYGEDWAD PDAPLLPSER FDRELGRCGR VGGNALRRSL GLQVDQWLPA WSGRMTPHVL
RHYCASSLYG AGMDIKALQE LLGHQWLSTT SGYIHVRSEH VEQAWKNANE RVESRFATTQ
KEG