Gene Arth_0151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0151 
Symbol 
ID4447379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp155456 
End bp157525 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content66% 
IMG OID639687946 
ProductMername-AA223 peptidase 
Protein accessionYP_829652 
Protein GI116668719 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCTA AGAGTTTCTT CAAGGGCCCG GGCATCTGGA TCGTTGTTGT GGTCGGAATG 
CTCCTGCTGG CCTTTGCCAC GCTCGCCCCC GGAGGCGCAA CCCGGATCGA CACAAAACCG
GGCCTCGAGC TCCTTGCGGA CAAGGGCAAG GTGGAGCAGG CCAAGATCTT TGACGCGGAG
AACCGCGTAG ACCTTGTCCT CAAGGACAAC CTGGTCATCG ACGGTCAGGA CAAGGGCAAG
AACGTCCAGT TCTTCTTTGT CAACGCCCGG GCCGCGGACG TGGTCAAGGC CGTCACCGAC
GCCGACCCCT CGAGCGGCTA CACCGACCAG CCCATCGAAA ACAACTGGTT CTCCGGGCTG
TTCTCCCTCC TGATTCCCGT GCTGCTGCTG GGCGTCCTCT TCTGGTTCCT GCTCTCCCGC
ATGCAGGGCG GCGGGTCCAA GATCATGCAG TTCGGCAAGT CCAAGGCCAA GCTGGTCAAC
AAGGACATGC CCCAGGTGAC GTTCAGCGAC GTCGCCGGCG CTGACGAGGC TGTGGAGGAA
CTCCAGGAAA TCAAGGAATT CCTCCAGGAA CCGGCCAAAT TCCAGGCCGT GGGGGCCAAG
ATCCCCAAGG GCGTGCTGCT CTACGGCCCT CCGGGTACCG GTAAGACCCT GCTGGCCCGC
GCCGTAGCCG GTGAAGCGGG AGTGCCGTTC TTCTCCATCT CGGGCTCCGA CTTCGTTGAA
ATGTTCGTCG GCGTGGGTGC GTCCCGTGTC CGCGACCTCT TCGAGCAGGC CAAAGCCAGC
TCACCCGCCA TCATCTTCGT GGACGAAATC GACGCCGTTG GCCGTCACCG CGGCGCCGGC
ATCGGCGGCG GCAACGACGA ACGCGAGCAG ACCCTCAACC AGCTGCTGGT TGAAATGGAC
GGCTTCGACG TCAAGACCAA CGTCATCCTG ATCGCCGCCA CCAACCGCCC CGACGTGCTG
GACCCGGCCC TGCTGCGCCC CGGCCGCTTC GACCGCCAGA TCACCGTTGA AGCCCCGGAC
CTGGTAGGCC GCGACCAGAT CCTGCAGGTC CACGCGAAGG GCAAGCCGAT GGCTCCCGGG
GTCGACCTCA AGGCCGTTGC CAAGAAGACC CCCGGCTACA CCGGCGCCGA CCTCGCCAAC
GTCCTCAATG AAGCTGCCCT GCTGACGGCG CGCTCCAATG CGAACCTGAT CGACGACCGC
GCCCTGGACG AGGCCATCGA CCGCGTCATG GCCGGCCCGC AGAAGCGCAG CCGCGTCATG
AAGGAGCACG AGCGCAAGAT CACGGCCTAC CACGAAGGCG GACACGCCCT GGTGGCTGCC
GCACTGCGGA ACTCGGCACC GGTCACCAAG ATCACCATCC TGCCCCGCGG CCGCGCCCTG
GGCTACACCA TGGTGGTTCC GGAGAACGAC AAGTATTCCG TCACCCGCAA CGAACTGCTG
GACCAGATGG CGTACGCCAT GGGCGGCCGC GTGGCCGAGG AAATCGTCTT CCACGATCCG
TCCACCGGCG CCTCCAACGA CATCGAGAAG GCCACGGCGA CGGCCCGCAA GATGGTCACG
GAATTCGGCA TGAGTGAACG AGTCGGCGCA GTACGCCTCG GCCAGGGCGG CGGCGAGCCG
TTCCTGGGCC GCGATGCCGG CCACGAGCGC AACTACTCCG ACCAGATCGC CTACATCGTG
GATGAGGAAG TGCGCCGGTT GATCGACCAG GCCCACGACG AGGCCTACGC GATCCTCACC
GAGAACCGCG ACATCCTCGA CTCCCTGGCC CTGGAACTGC TGGAGCGGGA GACCCTCAAC
CAGGCCGAGA TTGCGTACGT CTTCCGGGAC ATCCGCAAGC GCGACTTCCG CGAGGTGTGG
CTCTCCAAGG AAACCCGCCC GGTGCAGAGC GCCGGCCCCG TCGAGTCCCG GCACGAGCGG
GCCGAACGTG AAGCCCAGGA AGAAGCCAAG GAAGCCCGCC TGGAGGAGCC GCTGGACGCT
CGGCCGCCGC ACCCGCAGGG CGTGGCAGGG CAGGAGACCT TCGGCGGCGG TGTTACGGAC
GTCAGCACCG ACGGACCGCA GCACGGCTAA
 
Protein sequence
MKAKSFFKGP GIWIVVVVGM LLLAFATLAP GGATRIDTKP GLELLADKGK VEQAKIFDAE 
NRVDLVLKDN LVIDGQDKGK NVQFFFVNAR AADVVKAVTD ADPSSGYTDQ PIENNWFSGL
FSLLIPVLLL GVLFWFLLSR MQGGGSKIMQ FGKSKAKLVN KDMPQVTFSD VAGADEAVEE
LQEIKEFLQE PAKFQAVGAK IPKGVLLYGP PGTGKTLLAR AVAGEAGVPF FSISGSDFVE
MFVGVGASRV RDLFEQAKAS SPAIIFVDEI DAVGRHRGAG IGGGNDEREQ TLNQLLVEMD
GFDVKTNVIL IAATNRPDVL DPALLRPGRF DRQITVEAPD LVGRDQILQV HAKGKPMAPG
VDLKAVAKKT PGYTGADLAN VLNEAALLTA RSNANLIDDR ALDEAIDRVM AGPQKRSRVM
KEHERKITAY HEGGHALVAA ALRNSAPVTK ITILPRGRAL GYTMVVPEND KYSVTRNELL
DQMAYAMGGR VAEEIVFHDP STGASNDIEK ATATARKMVT EFGMSERVGA VRLGQGGGEP
FLGRDAGHER NYSDQIAYIV DEEVRRLIDQ AHDEAYAILT ENRDILDSLA LELLERETLN
QAEIAYVFRD IRKRDFREVW LSKETRPVQS AGPVESRHER AEREAQEEAK EARLEEPLDA
RPPHPQGVAG QETFGGGVTD VSTDGPQHG