Gene Arth_0342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0342 
Symbol 
ID4447171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp360239 
End bp363400 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content68% 
IMG OID639688138 
Productbeta-phosphoglucomutase family hydrolase 
Protein accessionYP_829843 
Protein GI116668910 
COG category[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0637] Predicted phosphatase/phosphohexomutase
[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR02009] beta-phosphoglucomutase family hydrolase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAGT CGCCAACCGC CGCCGCCGCG CTGGCCCCGG TTGACGCCGT GGTTTTCGAT 
CTTGACGGTG TCGTCACGGA CACAGCGGAC CTCCACGCGG CGGCATGGAA GGAATTGTTC
GACGACGTCC TCCAGGACCC CCGGATTCCG CCGACCGCAC GGCGCGATCC TTTCACGGAT
GCCGACTACC TGCGCTATGT GGACGGCCGC ACCCGGGAGG ACGGAGCGGC GTCGTTCCTG
CACTCGCGAG GGGTGGATCT GCCGGCTGGC GGGCCCGCGG ACGGCCCCGC GGAGTGGACG
GCGGTGGGGC TGGGGGCGCG CAAGAACGGG ATCTTCGAAA GGCTGCTGAG GCTCCGGAGC
GTCCCTGTTT TCCCCGGAAC ACTGGCGCTG CTCGAACGCC TCAAGGCCGG GAAGGTCCCG
GTGGCTCTCG CTACCGCCAG CCGGAATGCC CGCGCGGTGC TGGCATCCGC CGGCCTCGAA
GACGGTTTCG ACGTCGTTGT CGACGGGAAC ACCGCGGCGC AGCTGGGACT GGCCGGCAAG
CCGGACCCTG CGCTGTTCCT CCACGCCATC GGCGAACTCG GGGTGGCACC GGAACGGGCG
GTGGTCATCG AGGATGCCGT GGCCGGCGTG GCAGCGGGCC GACGCGGCGG CTTCGGGCTG
GTGGTGGGGA TCGACCGGGC CGGACAGCGG GCCGAGCTGG AAGCCGCGGG CGCCGATTTC
GTCCTGGACG ACGTCAGTGA ACTGGACCTT GGCCTGGTCA TCGCGGATCC CTGGCAGCTG
GTCTATGAGG GATTCGACCC CGCCCACGAG GGCCACCGGG AGGCCCTGAC CACCCTGGGC
AACGGCTATA TGGCAACCCG GGGGGCGGCC CCCGAACACC GCAAGGACGA CGTGCACTAC
CCGGGAACCT ACCTGGCCGG CGTCTACAAC CGGCTCACCA GCGTGATCCA GGGGCAGGAG
ACCGAGGACG AGCACATGGT GAACATGCCG GACTGGCTGC CGCTGGACCT GTGCGTGGAG
GGCGGAAAAT GGTGGTCCGA AGGTGGACTG CGGCTCCGCA GCGAGCGGCG CACGCTGGAC
CTGAAGCGAG CGCTGCTGAC GCGGGAAGCT GTGCTGGAGG ACGACGCCGG GAGGCAGCTT
GATGTGGTCC AGCGTCGGCT GGTCTCCATG GCAGAACCGC ACCTCGCCGC CCTCGAGATG
ACGGTGACAG CCGTCGGCTG GGGTGGCCAG GTGAGTATCC GTAGTGGCTG CGACACGGAC
ATCACCAACT CGAACGTGGC GGTGGAAGCG CTCCTGTCCA ACCGGCACCT GACAGATGTG
ACGGTTTCGG GGGCCGATGA CGGGCCCGGA GCCGCCACCC AGGTTGTCCT CGTCCAGACC
ACCCAGAGCA GGATCGGGGT GGCGGTGGCC ATCCGGACAG ACGTCCCGGC CAGCCTGCAT
CCGGCCAGGC CCGAGGAGCT GGGAGGACTG TTCGTGCACC GCTTTGACGT GGAGCTGCGG
GACGGCGAGC CGGCAACTGC CACCAAGACC GTGGCCGTGG CGACCTCCCG CGACCACGCC
ATTTCCTCGC CCCGGACAGC GGCCGTTGAT GTCCTCAACC GCTCCGGGGG AGGTTTCGAG
GTGCTGCTGG CCAGCCACGA GGCGGCGTGG CGGGGACTGC TCACGCCCTT CATCATCGAG
GTCGATGACT CTACCGAGTC ACAGCTGATC CTGAACCTGC ATGTCTTCCA CCTGCTCCAG
ACCATTTCGG CGCACACGGC GGAGCTGGAC GCCGGTGTCC CCGCCCGCGG CCTGCACGGC
GAGGGCTACC GGGGCCACAT TTTCTGGGAC GAACTGTTTG TCCTTCCGCT CCTGACGTCC
CGGCTTCCGG CCGTGACGCG GGCCCTCCTT GACTACCGGT GGCGGCGGCT GGGAACGGCC
CGGGATGCGG CCAGGGCGGC CGGCTTCGCA GGTGCCATGA TTCCGTGGCA AAGCGGGAGC
GACGGCAGGG AGGAGACGCC CCGGCTCCTG TTCAACTCCC GTTCCGGCCG GTGGGTGCCG
GACTACTCCC ACCTGCAGCG CCACTCCGGG CTGACGGTCG CGTACAACGC CTGGCAGTAT
TTCGAAGCGA CCCAGGACCG CGCCTGGCTG ACCCACCACG GTGCGGAGAT CATCGTGGAG
GTTGCCCGCC TTTTCGCGTC GATGGCGGAG TACGATCCCG CCGCAGACAG GTTCCATATC
CGCGCGGTGG TGGGTCCGGA CGAGTACCAC ACGGGCTACC CGGACAACCC CGGCGGCGGC
CTGGATGACA ACGCCTACAC CAATGTCATG GCGGCCTGGG TATGCGACCA GGCGGTATGG
ATCATGAGTT CCGTGCGGGG CTTCGACATG GACGACTTCC GGGAAAGGCT GCGGGTCACG
GACAACGAAA TTGACGGCTG GGCGCGGCTC GGGCGCAGGG TGTTTGTTCC CTTCCACGCC
GACGGCATCA TCAGCCAATT CGCCGGGTAT GAGAATCTCA AGGAGTTGGA CTGGGAGCAC
TACCGCCGCA CCTACCGGAA CGTCCAGCGG CTGGACCTCA TCCTCGAAGC GGAGGGCAGC
AGCACCAACC ACTACCGGCT CGCCAAACAG GCGGACGCCC TGATGCTGCT CTACGTCCTC
GGCGAAGACC AGCTCACCAC CTTCCTGGAC CGGCTCGGGT ACACGGTGAC GGCGGAGCAG
ATCGCGAGGA CCGTCGACTT CTACCTTGCC CGGACGGCGC ACGGGTCAAC GTTGAGCAGG
GTGGCGCACG CCTCCGTCCT GGCGCAGCTT GATCCGGAAC GGGCGTGGGA CACGTTCCGC
GAAGCCCTCG ACGCGGACCT TGACGACACC CAGGGCGGCA CCACCCGGGC GGGCATCCAC
CTGGGCGCCA TGGCCGGTTC CATCGACGTC ATCCAGCGCA GCTTCGCCGG CCTGCGCATT
ACGAGGGACG CCCTGGATTT CTCTCCCCGC CTGCCCGCGG AACTCGGCAG GGTTACTTTC
AATGTGCGCT ACCGGGACCA GCTGCTGGCG GTGCACCTGG AGAAAGGCCG CCTGCTGGTT
TCCGCGGACC CGGGCGACGC TTCGCCGGTG CTGGTCCGGC TTGGGACCCA ACACGTCCTG
CTGCATGCCG GGCAGGACCA CGAATTCCGG CTGGCCACCT GA
 
Protein sequence
MTESPTAAAA LAPVDAVVFD LDGVVTDTAD LHAAAWKELF DDVLQDPRIP PTARRDPFTD 
ADYLRYVDGR TREDGAASFL HSRGVDLPAG GPADGPAEWT AVGLGARKNG IFERLLRLRS
VPVFPGTLAL LERLKAGKVP VALATASRNA RAVLASAGLE DGFDVVVDGN TAAQLGLAGK
PDPALFLHAI GELGVAPERA VVIEDAVAGV AAGRRGGFGL VVGIDRAGQR AELEAAGADF
VLDDVSELDL GLVIADPWQL VYEGFDPAHE GHREALTTLG NGYMATRGAA PEHRKDDVHY
PGTYLAGVYN RLTSVIQGQE TEDEHMVNMP DWLPLDLCVE GGKWWSEGGL RLRSERRTLD
LKRALLTREA VLEDDAGRQL DVVQRRLVSM AEPHLAALEM TVTAVGWGGQ VSIRSGCDTD
ITNSNVAVEA LLSNRHLTDV TVSGADDGPG AATQVVLVQT TQSRIGVAVA IRTDVPASLH
PARPEELGGL FVHRFDVELR DGEPATATKT VAVATSRDHA ISSPRTAAVD VLNRSGGGFE
VLLASHEAAW RGLLTPFIIE VDDSTESQLI LNLHVFHLLQ TISAHTAELD AGVPARGLHG
EGYRGHIFWD ELFVLPLLTS RLPAVTRALL DYRWRRLGTA RDAARAAGFA GAMIPWQSGS
DGREETPRLL FNSRSGRWVP DYSHLQRHSG LTVAYNAWQY FEATQDRAWL THHGAEIIVE
VARLFASMAE YDPAADRFHI RAVVGPDEYH TGYPDNPGGG LDDNAYTNVM AAWVCDQAVW
IMSSVRGFDM DDFRERLRVT DNEIDGWARL GRRVFVPFHA DGIISQFAGY ENLKELDWEH
YRRTYRNVQR LDLILEAEGS STNHYRLAKQ ADALMLLYVL GEDQLTTFLD RLGYTVTAEQ
IARTVDFYLA RTAHGSTLSR VAHASVLAQL DPERAWDTFR EALDADLDDT QGGTTRAGIH
LGAMAGSIDV IQRSFAGLRI TRDALDFSPR LPAELGRVTF NVRYRDQLLA VHLEKGRLLV
SADPGDASPV LVRLGTQHVL LHAGQDHEFR LAT