Gene Arth_4490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4490 
Symbol 
ID4443436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008537 
Strand
Start bp112294 
End bp113949 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content67% 
IMG OID639687543 
Productmulticopper oxidase 
Protein accessionYP_829240 
Protein GI116662185 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.248277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAGCTG TCCCCCGGCA CACCGGCGCC AGCCCCGGCA AGCCCGACAA CCCCGACTCC 
TCTGAAAGAT ACGAAATTAT GCCCACCCCA ATGAATCCCC GTCTCAACCG GCGCAGTTTC
CTCGGCCTCT CGGTATCGGC GGCTGCGGCC GCGGCGCTGG CTGCCTGCAG CAGCACGCCT
CCCTCCGCGG ACGGCCCGCG GAGGATCATG CCCGGCGATC CGGCCATTGC CGAGCTGGAG
GCCGGCCGGG CCTTCACCGG CACCACCGTC AGCCAGAACC TCACCGCGGC ACCGATATCG
GCCTCGGTCG CCGGCAAGAC CGTGTCCACC TGGGGGTACA ACAACGGTCT GGTCGCGCCC
ACCATCCGGG CGAAGGCCGG CGACAGGCTC CGCGTGGCGC TCAAGAACAA CCTTCCGGAC
CCCACCAGCA TCCACTGGCA CGGTCTGGCG CTGCGGAACG ACCAGGACGG CGTACCCCAC
GTCACCCAGG ATGCCGTCGC CTCGAACGCC GACTATTCCT ACGATTTCCG GATCGCCCAC
CCGGGCACCT ACTGGTATCA CTCGCATGTC GAAATGCAGC GCGAACGCGG GCTCTACGGC
GCCCTCATCG TCGACGACCC GGCCGAGAAA CTGGTCTACG ACCGGGATTG GGTCATTGTC
CTTGACGACT GGCTGGACGG CGTCACGGGC ACCCCCGACG AGGTCCTCAA GGAGGTCTCC
GGCGGCATGG GTATGGGCGG CGGCTCCATG CCCGGAATGG ATCATGGTTC GATGGGCGGA
TCCATGGGTT CAACGCCCTC TCCGTCGCCC GGGTCCGGAA CGCACATGTT GATGGGTTCA
CGCAGTGACT TCCTGGGCGG CGACGCCGGC GACGTCCGTT ACCCGTACCA CCTGTTCAAC
GGCAAGGCTC CCGATCAGGC AGAGGAATTC ACGGCCAAGG CCGGCGAGAT TATCCGGCTG
AGGATCATCA ACGCGGCCGG GGACACTGCT TACCGGGTCG GCATCCCCGG CCGGAAGCTC
ACCCTCACCC ACACGGACGG CTTCCCTGTA GTGCACAAGG ACGTCGACGC CGTGGTCCTC
GGCATGGGCG AACGCATTGA TGCCCTGGTC ACGGTGGGCG ACGGCTACAG CCCGGTCCTG
GCCTTGCCCG AGGGCAAAAC CGGCCAGTCG GTCGGTTTCA TCAGCACGGG AACGGGCAAG
CGGCCGGTGC CGGCGACGCT CCCCATCGCA CTCGACGGCG ATGTGGTGGA CGGCGGCCAA
CTGAAGGCGG ACCCGTCGGT CGCCCTGCCG TCAAAGACCC CGGACAGGAC CCACGTTCTC
AAGCTCACGG GCGGGATGAT GAAGTACGAC TGGGGCATCA ACGGCCGGCG GTTCGACATG
AACGACCCCC TCCGCGACGC GTTCGATATT AGGCGCGGCG AGCGGGTTGA AGTGCAGTTC
GTCAACGAGA CCACCATGTG GCACCCCATG CACCTCCACG GCCACACCTT CCAGGTGGGG
CGCGACGGTG CCCGCAAGGA CACGGTAATC GTCAGGGCGG GTCAGACCGT TACGGTGGTT
TTTGATGCGG ACAACCCGGG CCAGTGGCTG ACGCACTGCC ACAACGCCTA CCACGCCGAG
CGCGGCATGA TGGGTGTGAT CTCCTACGTG GCCTGA
 
Protein sequence
MTAVPRHTGA SPGKPDNPDS SERYEIMPTP MNPRLNRRSF LGLSVSAAAA AALAACSSTP 
PSADGPRRIM PGDPAIAELE AGRAFTGTTV SQNLTAAPIS ASVAGKTVST WGYNNGLVAP
TIRAKAGDRL RVALKNNLPD PTSIHWHGLA LRNDQDGVPH VTQDAVASNA DYSYDFRIAH
PGTYWYHSHV EMQRERGLYG ALIVDDPAEK LVYDRDWVIV LDDWLDGVTG TPDEVLKEVS
GGMGMGGGSM PGMDHGSMGG SMGSTPSPSP GSGTHMLMGS RSDFLGGDAG DVRYPYHLFN
GKAPDQAEEF TAKAGEIIRL RIINAAGDTA YRVGIPGRKL TLTHTDGFPV VHKDVDAVVL
GMGERIDALV TVGDGYSPVL ALPEGKTGQS VGFISTGTGK RPVPATLPIA LDGDVVDGGQ
LKADPSVALP SKTPDRTHVL KLTGGMMKYD WGINGRRFDM NDPLRDAFDI RRGERVEVQF
VNETTMWHPM HLHGHTFQVG RDGARKDTVI VRAGQTVTVV FDADNPGQWL THCHNAYHAE
RGMMGVISYV A