Gene Arth_0548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0548 
Symbol 
ID4446965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp585511 
End bp588531 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content66% 
IMG OID639688345 
Productprotease domain-containing protein 
Protein accessionYP_830047 
Protein GI116669114 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTACC CGCCTGTCGG CTCCCGCGAT CACCGCAGAA ATTCAGCCAC CCGCCTCGCC 
GCCGTCGTCG CTTTGGGAAT TTCCCTGATG ATCGGCCAGG GGCTGGCGGC TCCGGCCTGG
GCGGCCGCCG ATCCCGCAGC GGAAGAAACG CCGTCTTTGG ACGCCGGCCG CTACATTGTC
ATGCTGAAGG ACAGGCCCCT TGCGGCCTAC ACCGGTGGCG TCGAGGGCAT TCCGGGCACC
GCCGCGTCCA ACGGCAGGAA GCTCGACGCC GACAGCGCTG AGTCCCGGCG CTACTCGGCA
CACCTGGAGG CAGAGCAGAG CAGGCTGGCC GCGGCGGAAG GCGTGGCCAT CGACGACAGT
TACACGCTGG CAGTGAACGG CTTCAGTGCG GAGCTGACCG CGGAGCAGGC AAACGCATTA
ACGAAGGACG GAAACGTCCT TGCTGTCGTT AAGGACAGCC AGTACAAGAT CGACTACTCG
AGCACCGAGT TCCTGGGCCT GCCGGGTCCC GGGGGAGTCT GGGCCGAACA GTTTGGTGGC
GACGCCAACG CGGGCAAAGG CACGGTGGTG GGCGTACTCG ATACGGGCTA CACGCCCGGT
AACCCGTTCT TTGCCGGGGA ACAGGTGAAG CCGCTGTCGG GAGCTCCCCA TGTGGGCGAA
CCGTACCTGT CGGCCGGAAA CCAGATCACC ATGCTGAAAG CGGACGGAAG CACGTTCGCA
GGCGTCTGTC AGGCGGGTGA CCAATTCGCC GGAACTGAAT GCAACAGCAA GGTCATTGGG
GCCCGGTACT ACGACGCGGC CTTCAAGTCC GCCGTCCCTC CGGGCCTGCG CTCGCCGAAG
GAAACGTACT CGCCCGTGGA CATCAACAAC CACGGTTCCC ATACCGCCAG CACCGCCGCG
GGCAACAGCG ACGTCAGCCA GGCTGTCGGC GGCAGGGACT TCGGCAAGGG ATCGGGTGTT
GCCCCCGCGG CGAAGCTCGC CATCTACAAG GTCTGCTGGG AGGGTGTCAG CCCTGCAACC
ACCGGCTGCT TCGCTTCCAG CGGCGTGGAA GCCATCGAGG ATGCCATCAG GGACGGCGTC
GACGTCCTGA GCTATTCCAT CTCCGGAACC AACAATTCGA CGGTCGACCC GGTGTCCATC
GCCTTCCTGA ATGCCGCGGC GGCAGGAATC TTCGTGGCTG CGTCGGCCGG CAACTCAGGC
CCGGCCGCCA GTACCGTGAA CCATGCTGCT CCCTGGATGA CCAGCGTGGC GGCCTCCACC
CACAGCAGCA GCCTCCGCGG CACGGTGGAG TTGTCCAGCG GCGACAAGTT CGCCGGTGCC
AGCATCATGT CCACTGAAGT CGCGAATGCG CCCATCGCGC TCGCCGCGGC TGTCAAGACG
GCCGACGCCG TCGATGCCAA CGCCGCCCTG TGCGCGCCAG GCACTTTGGA CCCGGCCAAA
ACCGCCGGCA AGATCGTGGT CTGCGACCGG GGCGTGGTGG ACCGGACGGC CAAGAGCATG
ACCGTGGCCC AGGCAGGCGG CGTAGGCATG GTCCTGGTCA ACCTGACGCC CAACTCGCTG
GACGTGGACC TGCACAGCGT TCCCACGGTC CACCTCGACG ACCCCGCCAT CAAGGAGGCG
GTCGGGACTG ATGCGGCGTT GACTGCGAGC CTGGTGGCCA CTGACACCAC GGGACTGGAC
CCGCCGCCGG TCCCGCAGAT CGCGGGTTTC TCCTCCCGCG GGCCCACCCT TGCGGCCAAC
GGTGATCTCC TGAAACCGGA CATTGCAGCT CCGGGGGTAG GCGTGCTGGC GGCTGTCTCG
CCTGCCGGGT CCAACGGCCA GAACTTCGGG TTCCTGTCCG GAACGTCCAT GGCAGCGCCG
CATATTGCGG GATTCGGGGC ATTGTTGCTG GGCAAGAACC CGTTGTGGTC CGCGGCGACG
GTGAAGTCAG CGATGATGAC CACGGCCTAC GATCTCGTGG ACGCGGAAGG ATCCCCGGTC
CATGACGTCT TCGCCCAGGG CGCTGGACAG ATCGATCCGG CCCGGATCGC AACGCCCGGA
CTGGTGTATG ACGCCGGCCC CAGCGACTGG CTCGGCTTCC TGCAGGGCCT GGGCTATCAG
CTGGGAGTTG CCCCGCTGGC TGCCAAGGAC GTCAACCTGC CGTCCATCGC GCTCGGTGGC
CTCACCGGCA CCCAAACCGT CACCCGGACG GTGACGGCGT TGACGGCCGG CAGCTACCGT
GCAGAGGTCG ATGTTTCCGG GATCACGGCT GAGGTGACGC CGGACGTGCT GACCCTGGCA
GAAGGCGAGA AAGCCACGTT CACGGTGCAG TTCACGAACT CCGGTGCAGC CCTCGACGCC
TTTGTTGGCG GGTCACTTAC CTGGAGCTCG GACGAGGCCG TGGTCCGTTC GCCGGTTGCC
ATCCGTTCGG TGACCGCTGT TGCGCCGGCT GTCGTCAATG CCTCGTCCGC CGGCGGAAGC
GGCAGCATCG TCATCCCGGT AACGTCCGGC AGTCCCGAGC CCATCGATGT GACCGTGAAG
GGTCTTGCCA AGGCCAGCTC GACGGCGATC AGCCTGGTTC CCGGGCCCTA CACCGGTGTC
AAGGACGCAT CGAACGACGT TCAGATCGTG AATGTGCCGG CAGGTTCTTC CCTTGCGAGG
TTTGCGGTCA ACTCCGCCAA TCCGGCAGCG GACTTCGATC TTTACGTGGT CTCGCCGGCC
GGCCTGCTTT ATCCAGGGGC CACGCCTGCA GCCAACGAGG CAGTCTCCAT TCCCGACCCG
GTGGCGGGCG ACTGGAAGGT GACCACCAAC CTGTTCGCCA GCCCCGGCGG CGCGGCGACG
GCAGCTTCGG TTGAAGCGAT GGTTCTCGCC GGCGACGCCG GAAACCTCAC GGTCAGTCCG
AACCCGCTCG CCATTGAAAA CGGCGCCACC GGTGAGCTCA CTGCCACCTG GACCGGACTT
GAGGCCGGCA ACTGGGTGGG CTTGATCAAG TACGGTACGG GACCGTCAAC CCAACTGAAC
GTGGCCGTGA CAGCGCCTTG A
 
Protein sequence
MTYPPVGSRD HRRNSATRLA AVVALGISLM IGQGLAAPAW AAADPAAEET PSLDAGRYIV 
MLKDRPLAAY TGGVEGIPGT AASNGRKLDA DSAESRRYSA HLEAEQSRLA AAEGVAIDDS
YTLAVNGFSA ELTAEQANAL TKDGNVLAVV KDSQYKIDYS STEFLGLPGP GGVWAEQFGG
DANAGKGTVV GVLDTGYTPG NPFFAGEQVK PLSGAPHVGE PYLSAGNQIT MLKADGSTFA
GVCQAGDQFA GTECNSKVIG ARYYDAAFKS AVPPGLRSPK ETYSPVDINN HGSHTASTAA
GNSDVSQAVG GRDFGKGSGV APAAKLAIYK VCWEGVSPAT TGCFASSGVE AIEDAIRDGV
DVLSYSISGT NNSTVDPVSI AFLNAAAAGI FVAASAGNSG PAASTVNHAA PWMTSVAAST
HSSSLRGTVE LSSGDKFAGA SIMSTEVANA PIALAAAVKT ADAVDANAAL CAPGTLDPAK
TAGKIVVCDR GVVDRTAKSM TVAQAGGVGM VLVNLTPNSL DVDLHSVPTV HLDDPAIKEA
VGTDAALTAS LVATDTTGLD PPPVPQIAGF SSRGPTLAAN GDLLKPDIAA PGVGVLAAVS
PAGSNGQNFG FLSGTSMAAP HIAGFGALLL GKNPLWSAAT VKSAMMTTAY DLVDAEGSPV
HDVFAQGAGQ IDPARIATPG LVYDAGPSDW LGFLQGLGYQ LGVAPLAAKD VNLPSIALGG
LTGTQTVTRT VTALTAGSYR AEVDVSGITA EVTPDVLTLA EGEKATFTVQ FTNSGAALDA
FVGGSLTWSS DEAVVRSPVA IRSVTAVAPA VVNASSAGGS GSIVIPVTSG SPEPIDVTVK
GLAKASSTAI SLVPGPYTGV KDASNDVQIV NVPAGSSLAR FAVNSANPAA DFDLYVVSPA
GLLYPGATPA ANEAVSIPDP VAGDWKVTTN LFASPGGAAT AASVEAMVLA GDAGNLTVSP
NPLAIENGAT GELTATWTGL EAGNWVGLIK YGTGPSTQLN VAVTAP