Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0544 |
Symbol | |
ID | 4446961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 580071 |
End bp | 583208 |
Gene Length | 3138 bp |
Protein Length | 1045 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639688341 |
Product | protease domain-containing protein |
Protein accession | YP_830043 |
Protein GI | 116669110 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAATCAA AAGGAACAAG CCTTGTCCGG GTCGGGGGAC TCCGGAAGGC GGCAGCACTG GCTGTTGGCC TCCCGCTGCT TCTCTCCTCC GTCGCAGTGG GTCCCGCTAC CGCGGCGCCT GCCGAACCCA GCGGAGTCGA ACAGGCCAAA AACGTCAACC CGACAGACTA CAAGGACGGC CGTTACATCG TGGTCCTCGC CGAGAAAGCC GCGGCAGCGT ACGACGGCGG AACGGCCGGC ATAGCGGCAA CCAAGCCGCA ACAGGGCCGC AAACTTGATT CCGGCAGCCA GAACTACAAG GCCTACGACG CGCACCTCCG CAAGACCCAG CGGGAGGTCG CGTCCAAGCA GGGCGTCACC CCGTCCAAGC AGTTCACAGG CGCGCTCAAC GGCTTCGTGG CTGAGCTCAC GGCGGTGCAG GCAGCCGAGC TTGCCAAGGA CAGCAACGTG CTTGTGGTGG CACCGGACGT GGAGAACGCC CCGGATTACA CCACCACCGA CTTCCTGAAG CTCACAGGCA GCGACGGCGC CTGGGCAAAG CAGTTCGGCG GCGAATCCGG AGCCGGCAAG GGCGTGGTGG TCGGTGTGAT CGACTCCGGC TACGCCCCGG ACAATCCCTT CCTGCAGGGG GATGCAGTGC AGCCGCTCAA GGGCAAGGCA CAGGTGGGCG TCCCGTATCT CACCGCGGAC GGCAAGATCG CCATGCTCAA GTCGGACGGC ACGACTTTCC AGGGCGAATG CCAGAAGGGC ATCGAATCAG GGGCATCCTT CGACGGAACC CTCTGCAACT CCAAGGTGGT CAGCGCCCGC TACTTCGCCG ACTCCTTCCG TCAGTACGTG ACACCCGAAC ACCGAGCCCC GGAGGAACTG ATCTCCCCGG TGGACGTGGG CAGCCATGGC ACACACACGG CCACCACCGC CGCCGGCAAC GCCAACGTGG AACAGGTGAT CGACGGCGCC AGCTTCGGCA AAAGCTCCGG CGTCGCCCCG GCAGCGAAAG TCTCCGTCTA CAAGGTCTGC TGGGAAGACG ACAATCCCAA CACCGGCGGC TGCTATTCGT CCGCCTCGGT CGAAGCCGTT GACGCAGCCA TTAAGGACGG CGTGGACGTC CTTAACTACT CCATTTCCGG CAACACCAAC AGCACGACTG ATCCCGTGGC CCTGGCGTTC CTGAATGCGG CTGCCGCAGG CGTGTTCGTC TCGGCATCGG CAGGCAACTC CGGCCCCGCC GTCTCCACGG TGAACCACGC CTCGCCGTGG CTGACCACCG TCGCGGCCTC GACATTCCCC AGCGACCTGC TCGGCACGGT GAAGGTGTCC GACGGCAGCA TGTTCCGGGG CGCGTCCATC ATGAAATCGG AAGTCAAGGA CGCCGCCGTG GTGGTGGCCG CCGACGCAGC TGCACCGGAT GCGCCTACTG CCGCGAACCT TTGCGGCCCG GGAACGCTGG ACGCCGCCAA GGTCACCGGC AAGGTGGTGG TCTGTGACCG CGGTGTCGTG GACCGGACGG CGAAGAGCCT TGAGGTCCAG GCGAAGGGCG GCGTTGGCAT GATCCTGGTC AACCTGACCT CCAGCTCCGA GGACGCGGAC AACCATGTCG TGCCCACCGT CCACGTCAAT GCCCCCAAGA GCCTGGAACT GAAGTCCAAG CTTTCTGCCA AGCCGGCGCT CACGGTGAGC CTCGTCAAGG GCGATCTCAC CGGCGAGCCA TTGCCGCCGG CACCCCAAGT GGCCGGCTTC TCCTCTCGGG GCCCGAGCCT GGCCTCCGGC GGGGACCTCC TGAAGCCGGA CATCTCCGCT CCCGGCGTGA ATGTCCTCGC GGGCGTTTCC ACCATCGGCA ACAACGGCGC ACAGTTCGGC TTCATGTCCG GAACCTCCAT GGCTGCCCCG CATATTGCGG GATTCGGAGC CCTGGTGCTC AGCAAGCAGC CCACCTGGTC GCCTGCCATG GTCAAGTCCG CCATGATGAC CACCGCCTAC CCGCTGGTCA ACGCGGACGG CACTCCCAAC CGGAACCCGT TTGAAGGCGG CGCAGGGCAC ATTGACGCCA CCCGCGTGCT GGATCCGGGC CTCGTGTACA ACTCGGACAT CAAGCAGTGG CTGGCCTTCC TGAACGGCCA GGGCGTGGAG ACCGGTGCAC CACAGGCCGG CAGCATTGCC GCACGCAACC TCAACCTGCC GTCCATTGCC CTGGGCAGCC TCGTGGGCGA GATCCAGGTC AAGCGCCAGC TCACTGCGCT GGTCCCGGGG AGCTACCGGC CCGCTGTGGA CATGCCCGGT GTCGGTGTCC ATGTGGAGCC GCAGGTGCTG AACTTCGCCA AGGCGGGGCA GACCCGTGAG GTCACCATCA CCATCAAGAA CGTCAGCGCA CCGGTGGGCA AGTTCACTAC GGGCACACTG ACGTGGAAGG GGCCGCGCAC GGTCAGCTCG CCCATTGCGG TCCGTCCGGT GGACGCCCAG ATTGCGCCGT CGTTCTCCTT CAGCTCCGCG ACCGGCACGG GCAGCGGCAC CCTGGACCTG GTCTCCGGCT CGGATTCGCC CATCGCGGTG GGTGTTGAAG GGCTTGCACC GCTGTCCGAG ACGGCCATCA CCAAAACGCC CGGCGCTTAC GCTGCAAAAA ATGACGAACA CAACGCACTC GTCAAGGTGG AGGTGCCCGC CGGTGCGACT TTCGCGCGGC TGGGCGTCCA GGCCGAATCG GACGACGTTG ACTGGGACAT GGTGGTTTAC GCGCCCAACG GAAGCGGAGG CCTCGTGGCC ACCCAGGTGG CAACGGCGTC GACCAGCGAG TTCCTGGACC TGGAATCGCC CCGTGCGGGC ACCTACTACA TCGTGGCGAA CCTCTACTCC ACCCCGGACA ACGGGCCCGC GTCCGCGGTC GTCCAGACGG TCTCGTTCCC CGGAAAGCCG GAGACGAAGC TCGCCGTCAA CCCGAATCCG ATCATTGCCC CCAACGGTAC GGCCACCACG GCGACAGCCA GCTGGACCGG CCTGGCACCG GGGTCCTACC TGGGCCGGCT GAGCCTGGGC GGAAACGGGA TCAGGACGTG GATCAGCGTC AAGGTGGGCA CCGGCACGGC GGCTGCCCCG GCCGGTGCCC CGGCGGTCAC CCCCGTCGAT GCCGTACCGG GGACCTAG
|
Protein sequence | MKSKGTSLVR VGGLRKAAAL AVGLPLLLSS VAVGPATAAP AEPSGVEQAK NVNPTDYKDG RYIVVLAEKA AAAYDGGTAG IAATKPQQGR KLDSGSQNYK AYDAHLRKTQ REVASKQGVT PSKQFTGALN GFVAELTAVQ AAELAKDSNV LVVAPDVENA PDYTTTDFLK LTGSDGAWAK QFGGESGAGK GVVVGVIDSG YAPDNPFLQG DAVQPLKGKA QVGVPYLTAD GKIAMLKSDG TTFQGECQKG IESGASFDGT LCNSKVVSAR YFADSFRQYV TPEHRAPEEL ISPVDVGSHG THTATTAAGN ANVEQVIDGA SFGKSSGVAP AAKVSVYKVC WEDDNPNTGG CYSSASVEAV DAAIKDGVDV LNYSISGNTN STTDPVALAF LNAAAAGVFV SASAGNSGPA VSTVNHASPW LTTVAASTFP SDLLGTVKVS DGSMFRGASI MKSEVKDAAV VVAADAAAPD APTAANLCGP GTLDAAKVTG KVVVCDRGVV DRTAKSLEVQ AKGGVGMILV NLTSSSEDAD NHVVPTVHVN APKSLELKSK LSAKPALTVS LVKGDLTGEP LPPAPQVAGF SSRGPSLASG GDLLKPDISA PGVNVLAGVS TIGNNGAQFG FMSGTSMAAP HIAGFGALVL SKQPTWSPAM VKSAMMTTAY PLVNADGTPN RNPFEGGAGH IDATRVLDPG LVYNSDIKQW LAFLNGQGVE TGAPQAGSIA ARNLNLPSIA LGSLVGEIQV KRQLTALVPG SYRPAVDMPG VGVHVEPQVL NFAKAGQTRE VTITIKNVSA PVGKFTTGTL TWKGPRTVSS PIAVRPVDAQ IAPSFSFSSA TGTGSGTLDL VSGSDSPIAV GVEGLAPLSE TAITKTPGAY AAKNDEHNAL VKVEVPAGAT FARLGVQAES DDVDWDMVVY APNGSGGLVA TQVATASTSE FLDLESPRAG TYYIVANLYS TPDNGPASAV VQTVSFPGKP ETKLAVNPNP IIAPNGTATT ATASWTGLAP GSYLGRLSLG GNGIRTWISV KVGTGTAAAP AGAPAVTPVD AVPGT
|
| |