Gene Arth_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0544 
Symbol 
ID4446961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp580071 
End bp583208 
Gene Length3138 bp 
Protein Length1045 aa 
Translation table11 
GC content66% 
IMG OID639688341 
Productprotease domain-containing protein 
Protein accessionYP_830043 
Protein GI116669110 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAATCAA AAGGAACAAG CCTTGTCCGG GTCGGGGGAC TCCGGAAGGC GGCAGCACTG 
GCTGTTGGCC TCCCGCTGCT TCTCTCCTCC GTCGCAGTGG GTCCCGCTAC CGCGGCGCCT
GCCGAACCCA GCGGAGTCGA ACAGGCCAAA AACGTCAACC CGACAGACTA CAAGGACGGC
CGTTACATCG TGGTCCTCGC CGAGAAAGCC GCGGCAGCGT ACGACGGCGG AACGGCCGGC
ATAGCGGCAA CCAAGCCGCA ACAGGGCCGC AAACTTGATT CCGGCAGCCA GAACTACAAG
GCCTACGACG CGCACCTCCG CAAGACCCAG CGGGAGGTCG CGTCCAAGCA GGGCGTCACC
CCGTCCAAGC AGTTCACAGG CGCGCTCAAC GGCTTCGTGG CTGAGCTCAC GGCGGTGCAG
GCAGCCGAGC TTGCCAAGGA CAGCAACGTG CTTGTGGTGG CACCGGACGT GGAGAACGCC
CCGGATTACA CCACCACCGA CTTCCTGAAG CTCACAGGCA GCGACGGCGC CTGGGCAAAG
CAGTTCGGCG GCGAATCCGG AGCCGGCAAG GGCGTGGTGG TCGGTGTGAT CGACTCCGGC
TACGCCCCGG ACAATCCCTT CCTGCAGGGG GATGCAGTGC AGCCGCTCAA GGGCAAGGCA
CAGGTGGGCG TCCCGTATCT CACCGCGGAC GGCAAGATCG CCATGCTCAA GTCGGACGGC
ACGACTTTCC AGGGCGAATG CCAGAAGGGC ATCGAATCAG GGGCATCCTT CGACGGAACC
CTCTGCAACT CCAAGGTGGT CAGCGCCCGC TACTTCGCCG ACTCCTTCCG TCAGTACGTG
ACACCCGAAC ACCGAGCCCC GGAGGAACTG ATCTCCCCGG TGGACGTGGG CAGCCATGGC
ACACACACGG CCACCACCGC CGCCGGCAAC GCCAACGTGG AACAGGTGAT CGACGGCGCC
AGCTTCGGCA AAAGCTCCGG CGTCGCCCCG GCAGCGAAAG TCTCCGTCTA CAAGGTCTGC
TGGGAAGACG ACAATCCCAA CACCGGCGGC TGCTATTCGT CCGCCTCGGT CGAAGCCGTT
GACGCAGCCA TTAAGGACGG CGTGGACGTC CTTAACTACT CCATTTCCGG CAACACCAAC
AGCACGACTG ATCCCGTGGC CCTGGCGTTC CTGAATGCGG CTGCCGCAGG CGTGTTCGTC
TCGGCATCGG CAGGCAACTC CGGCCCCGCC GTCTCCACGG TGAACCACGC CTCGCCGTGG
CTGACCACCG TCGCGGCCTC GACATTCCCC AGCGACCTGC TCGGCACGGT GAAGGTGTCC
GACGGCAGCA TGTTCCGGGG CGCGTCCATC ATGAAATCGG AAGTCAAGGA CGCCGCCGTG
GTGGTGGCCG CCGACGCAGC TGCACCGGAT GCGCCTACTG CCGCGAACCT TTGCGGCCCG
GGAACGCTGG ACGCCGCCAA GGTCACCGGC AAGGTGGTGG TCTGTGACCG CGGTGTCGTG
GACCGGACGG CGAAGAGCCT TGAGGTCCAG GCGAAGGGCG GCGTTGGCAT GATCCTGGTC
AACCTGACCT CCAGCTCCGA GGACGCGGAC AACCATGTCG TGCCCACCGT CCACGTCAAT
GCCCCCAAGA GCCTGGAACT GAAGTCCAAG CTTTCTGCCA AGCCGGCGCT CACGGTGAGC
CTCGTCAAGG GCGATCTCAC CGGCGAGCCA TTGCCGCCGG CACCCCAAGT GGCCGGCTTC
TCCTCTCGGG GCCCGAGCCT GGCCTCCGGC GGGGACCTCC TGAAGCCGGA CATCTCCGCT
CCCGGCGTGA ATGTCCTCGC GGGCGTTTCC ACCATCGGCA ACAACGGCGC ACAGTTCGGC
TTCATGTCCG GAACCTCCAT GGCTGCCCCG CATATTGCGG GATTCGGAGC CCTGGTGCTC
AGCAAGCAGC CCACCTGGTC GCCTGCCATG GTCAAGTCCG CCATGATGAC CACCGCCTAC
CCGCTGGTCA ACGCGGACGG CACTCCCAAC CGGAACCCGT TTGAAGGCGG CGCAGGGCAC
ATTGACGCCA CCCGCGTGCT GGATCCGGGC CTCGTGTACA ACTCGGACAT CAAGCAGTGG
CTGGCCTTCC TGAACGGCCA GGGCGTGGAG ACCGGTGCAC CACAGGCCGG CAGCATTGCC
GCACGCAACC TCAACCTGCC GTCCATTGCC CTGGGCAGCC TCGTGGGCGA GATCCAGGTC
AAGCGCCAGC TCACTGCGCT GGTCCCGGGG AGCTACCGGC CCGCTGTGGA CATGCCCGGT
GTCGGTGTCC ATGTGGAGCC GCAGGTGCTG AACTTCGCCA AGGCGGGGCA GACCCGTGAG
GTCACCATCA CCATCAAGAA CGTCAGCGCA CCGGTGGGCA AGTTCACTAC GGGCACACTG
ACGTGGAAGG GGCCGCGCAC GGTCAGCTCG CCCATTGCGG TCCGTCCGGT GGACGCCCAG
ATTGCGCCGT CGTTCTCCTT CAGCTCCGCG ACCGGCACGG GCAGCGGCAC CCTGGACCTG
GTCTCCGGCT CGGATTCGCC CATCGCGGTG GGTGTTGAAG GGCTTGCACC GCTGTCCGAG
ACGGCCATCA CCAAAACGCC CGGCGCTTAC GCTGCAAAAA ATGACGAACA CAACGCACTC
GTCAAGGTGG AGGTGCCCGC CGGTGCGACT TTCGCGCGGC TGGGCGTCCA GGCCGAATCG
GACGACGTTG ACTGGGACAT GGTGGTTTAC GCGCCCAACG GAAGCGGAGG CCTCGTGGCC
ACCCAGGTGG CAACGGCGTC GACCAGCGAG TTCCTGGACC TGGAATCGCC CCGTGCGGGC
ACCTACTACA TCGTGGCGAA CCTCTACTCC ACCCCGGACA ACGGGCCCGC GTCCGCGGTC
GTCCAGACGG TCTCGTTCCC CGGAAAGCCG GAGACGAAGC TCGCCGTCAA CCCGAATCCG
ATCATTGCCC CCAACGGTAC GGCCACCACG GCGACAGCCA GCTGGACCGG CCTGGCACCG
GGGTCCTACC TGGGCCGGCT GAGCCTGGGC GGAAACGGGA TCAGGACGTG GATCAGCGTC
AAGGTGGGCA CCGGCACGGC GGCTGCCCCG GCCGGTGCCC CGGCGGTCAC CCCCGTCGAT
GCCGTACCGG GGACCTAG
 
Protein sequence
MKSKGTSLVR VGGLRKAAAL AVGLPLLLSS VAVGPATAAP AEPSGVEQAK NVNPTDYKDG 
RYIVVLAEKA AAAYDGGTAG IAATKPQQGR KLDSGSQNYK AYDAHLRKTQ REVASKQGVT
PSKQFTGALN GFVAELTAVQ AAELAKDSNV LVVAPDVENA PDYTTTDFLK LTGSDGAWAK
QFGGESGAGK GVVVGVIDSG YAPDNPFLQG DAVQPLKGKA QVGVPYLTAD GKIAMLKSDG
TTFQGECQKG IESGASFDGT LCNSKVVSAR YFADSFRQYV TPEHRAPEEL ISPVDVGSHG
THTATTAAGN ANVEQVIDGA SFGKSSGVAP AAKVSVYKVC WEDDNPNTGG CYSSASVEAV
DAAIKDGVDV LNYSISGNTN STTDPVALAF LNAAAAGVFV SASAGNSGPA VSTVNHASPW
LTTVAASTFP SDLLGTVKVS DGSMFRGASI MKSEVKDAAV VVAADAAAPD APTAANLCGP
GTLDAAKVTG KVVVCDRGVV DRTAKSLEVQ AKGGVGMILV NLTSSSEDAD NHVVPTVHVN
APKSLELKSK LSAKPALTVS LVKGDLTGEP LPPAPQVAGF SSRGPSLASG GDLLKPDISA
PGVNVLAGVS TIGNNGAQFG FMSGTSMAAP HIAGFGALVL SKQPTWSPAM VKSAMMTTAY
PLVNADGTPN RNPFEGGAGH IDATRVLDPG LVYNSDIKQW LAFLNGQGVE TGAPQAGSIA
ARNLNLPSIA LGSLVGEIQV KRQLTALVPG SYRPAVDMPG VGVHVEPQVL NFAKAGQTRE
VTITIKNVSA PVGKFTTGTL TWKGPRTVSS PIAVRPVDAQ IAPSFSFSSA TGTGSGTLDL
VSGSDSPIAV GVEGLAPLSE TAITKTPGAY AAKNDEHNAL VKVEVPAGAT FARLGVQAES
DDVDWDMVVY APNGSGGLVA TQVATASTSE FLDLESPRAG TYYIVANLYS TPDNGPASAV
VQTVSFPGKP ETKLAVNPNP IIAPNGTATT ATASWTGLAP GSYLGRLSLG GNGIRTWISV
KVGTGTAAAP AGAPAVTPVD AVPGT