Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0548 |
Symbol | |
ID | 4446965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 585511 |
End bp | 588531 |
Gene Length | 3021 bp |
Protein Length | 1006 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639688345 |
Product | protease domain-containing protein |
Protein accession | YP_830047 |
Protein GI | 116669114 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTACC CGCCTGTCGG CTCCCGCGAT CACCGCAGAA ATTCAGCCAC CCGCCTCGCC GCCGTCGTCG CTTTGGGAAT TTCCCTGATG ATCGGCCAGG GGCTGGCGGC TCCGGCCTGG GCGGCCGCCG ATCCCGCAGC GGAAGAAACG CCGTCTTTGG ACGCCGGCCG CTACATTGTC ATGCTGAAGG ACAGGCCCCT TGCGGCCTAC ACCGGTGGCG TCGAGGGCAT TCCGGGCACC GCCGCGTCCA ACGGCAGGAA GCTCGACGCC GACAGCGCTG AGTCCCGGCG CTACTCGGCA CACCTGGAGG CAGAGCAGAG CAGGCTGGCC GCGGCGGAAG GCGTGGCCAT CGACGACAGT TACACGCTGG CAGTGAACGG CTTCAGTGCG GAGCTGACCG CGGAGCAGGC AAACGCATTA ACGAAGGACG GAAACGTCCT TGCTGTCGTT AAGGACAGCC AGTACAAGAT CGACTACTCG AGCACCGAGT TCCTGGGCCT GCCGGGTCCC GGGGGAGTCT GGGCCGAACA GTTTGGTGGC GACGCCAACG CGGGCAAAGG CACGGTGGTG GGCGTACTCG ATACGGGCTA CACGCCCGGT AACCCGTTCT TTGCCGGGGA ACAGGTGAAG CCGCTGTCGG GAGCTCCCCA TGTGGGCGAA CCGTACCTGT CGGCCGGAAA CCAGATCACC ATGCTGAAAG CGGACGGAAG CACGTTCGCA GGCGTCTGTC AGGCGGGTGA CCAATTCGCC GGAACTGAAT GCAACAGCAA GGTCATTGGG GCCCGGTACT ACGACGCGGC CTTCAAGTCC GCCGTCCCTC CGGGCCTGCG CTCGCCGAAG GAAACGTACT CGCCCGTGGA CATCAACAAC CACGGTTCCC ATACCGCCAG CACCGCCGCG GGCAACAGCG ACGTCAGCCA GGCTGTCGGC GGCAGGGACT TCGGCAAGGG ATCGGGTGTT GCCCCCGCGG CGAAGCTCGC CATCTACAAG GTCTGCTGGG AGGGTGTCAG CCCTGCAACC ACCGGCTGCT TCGCTTCCAG CGGCGTGGAA GCCATCGAGG ATGCCATCAG GGACGGCGTC GACGTCCTGA GCTATTCCAT CTCCGGAACC AACAATTCGA CGGTCGACCC GGTGTCCATC GCCTTCCTGA ATGCCGCGGC GGCAGGAATC TTCGTGGCTG CGTCGGCCGG CAACTCAGGC CCGGCCGCCA GTACCGTGAA CCATGCTGCT CCCTGGATGA CCAGCGTGGC GGCCTCCACC CACAGCAGCA GCCTCCGCGG CACGGTGGAG TTGTCCAGCG GCGACAAGTT CGCCGGTGCC AGCATCATGT CCACTGAAGT CGCGAATGCG CCCATCGCGC TCGCCGCGGC TGTCAAGACG GCCGACGCCG TCGATGCCAA CGCCGCCCTG TGCGCGCCAG GCACTTTGGA CCCGGCCAAA ACCGCCGGCA AGATCGTGGT CTGCGACCGG GGCGTGGTGG ACCGGACGGC CAAGAGCATG ACCGTGGCCC AGGCAGGCGG CGTAGGCATG GTCCTGGTCA ACCTGACGCC CAACTCGCTG GACGTGGACC TGCACAGCGT TCCCACGGTC CACCTCGACG ACCCCGCCAT CAAGGAGGCG GTCGGGACTG ATGCGGCGTT GACTGCGAGC CTGGTGGCCA CTGACACCAC GGGACTGGAC CCGCCGCCGG TCCCGCAGAT CGCGGGTTTC TCCTCCCGCG GGCCCACCCT TGCGGCCAAC GGTGATCTCC TGAAACCGGA CATTGCAGCT CCGGGGGTAG GCGTGCTGGC GGCTGTCTCG CCTGCCGGGT CCAACGGCCA GAACTTCGGG TTCCTGTCCG GAACGTCCAT GGCAGCGCCG CATATTGCGG GATTCGGGGC ATTGTTGCTG GGCAAGAACC CGTTGTGGTC CGCGGCGACG GTGAAGTCAG CGATGATGAC CACGGCCTAC GATCTCGTGG ACGCGGAAGG ATCCCCGGTC CATGACGTCT TCGCCCAGGG CGCTGGACAG ATCGATCCGG CCCGGATCGC AACGCCCGGA CTGGTGTATG ACGCCGGCCC CAGCGACTGG CTCGGCTTCC TGCAGGGCCT GGGCTATCAG CTGGGAGTTG CCCCGCTGGC TGCCAAGGAC GTCAACCTGC CGTCCATCGC GCTCGGTGGC CTCACCGGCA CCCAAACCGT CACCCGGACG GTGACGGCGT TGACGGCCGG CAGCTACCGT GCAGAGGTCG ATGTTTCCGG GATCACGGCT GAGGTGACGC CGGACGTGCT GACCCTGGCA GAAGGCGAGA AAGCCACGTT CACGGTGCAG TTCACGAACT CCGGTGCAGC CCTCGACGCC TTTGTTGGCG GGTCACTTAC CTGGAGCTCG GACGAGGCCG TGGTCCGTTC GCCGGTTGCC ATCCGTTCGG TGACCGCTGT TGCGCCGGCT GTCGTCAATG CCTCGTCCGC CGGCGGAAGC GGCAGCATCG TCATCCCGGT AACGTCCGGC AGTCCCGAGC CCATCGATGT GACCGTGAAG GGTCTTGCCA AGGCCAGCTC GACGGCGATC AGCCTGGTTC CCGGGCCCTA CACCGGTGTC AAGGACGCAT CGAACGACGT TCAGATCGTG AATGTGCCGG CAGGTTCTTC CCTTGCGAGG TTTGCGGTCA ACTCCGCCAA TCCGGCAGCG GACTTCGATC TTTACGTGGT CTCGCCGGCC GGCCTGCTTT ATCCAGGGGC CACGCCTGCA GCCAACGAGG CAGTCTCCAT TCCCGACCCG GTGGCGGGCG ACTGGAAGGT GACCACCAAC CTGTTCGCCA GCCCCGGCGG CGCGGCGACG GCAGCTTCGG TTGAAGCGAT GGTTCTCGCC GGCGACGCCG GAAACCTCAC GGTCAGTCCG AACCCGCTCG CCATTGAAAA CGGCGCCACC GGTGAGCTCA CTGCCACCTG GACCGGACTT GAGGCCGGCA ACTGGGTGGG CTTGATCAAG TACGGTACGG GACCGTCAAC CCAACTGAAC GTGGCCGTGA CAGCGCCTTG A
|
Protein sequence | MTYPPVGSRD HRRNSATRLA AVVALGISLM IGQGLAAPAW AAADPAAEET PSLDAGRYIV MLKDRPLAAY TGGVEGIPGT AASNGRKLDA DSAESRRYSA HLEAEQSRLA AAEGVAIDDS YTLAVNGFSA ELTAEQANAL TKDGNVLAVV KDSQYKIDYS STEFLGLPGP GGVWAEQFGG DANAGKGTVV GVLDTGYTPG NPFFAGEQVK PLSGAPHVGE PYLSAGNQIT MLKADGSTFA GVCQAGDQFA GTECNSKVIG ARYYDAAFKS AVPPGLRSPK ETYSPVDINN HGSHTASTAA GNSDVSQAVG GRDFGKGSGV APAAKLAIYK VCWEGVSPAT TGCFASSGVE AIEDAIRDGV DVLSYSISGT NNSTVDPVSI AFLNAAAAGI FVAASAGNSG PAASTVNHAA PWMTSVAAST HSSSLRGTVE LSSGDKFAGA SIMSTEVANA PIALAAAVKT ADAVDANAAL CAPGTLDPAK TAGKIVVCDR GVVDRTAKSM TVAQAGGVGM VLVNLTPNSL DVDLHSVPTV HLDDPAIKEA VGTDAALTAS LVATDTTGLD PPPVPQIAGF SSRGPTLAAN GDLLKPDIAA PGVGVLAAVS PAGSNGQNFG FLSGTSMAAP HIAGFGALLL GKNPLWSAAT VKSAMMTTAY DLVDAEGSPV HDVFAQGAGQ IDPARIATPG LVYDAGPSDW LGFLQGLGYQ LGVAPLAAKD VNLPSIALGG LTGTQTVTRT VTALTAGSYR AEVDVSGITA EVTPDVLTLA EGEKATFTVQ FTNSGAALDA FVGGSLTWSS DEAVVRSPVA IRSVTAVAPA VVNASSAGGS GSIVIPVTSG SPEPIDVTVK GLAKASSTAI SLVPGPYTGV KDASNDVQIV NVPAGSSLAR FAVNSANPAA DFDLYVVSPA GLLYPGATPA ANEAVSIPDP VAGDWKVTTN LFASPGGAAT AASVEAMVLA GDAGNLTVSP NPLAIENGAT GELTATWTGL EAGNWVGLIK YGTGPSTQLN VAVTAP
|
| |