Gene Arth_1230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1230 
Symbol 
ID4446259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1349496 
End bp1350944 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content66% 
IMG OID639689038 
Productpolysaccharide deacetylase 
Protein accessionYP_830724 
Protein GI116669791 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA GGCATGAACT GCCGGGGGAA CAGGGCGGTC CGCCCGGTTC CGGTTTGATG 
CGGCAGCGGA AACGGACAAC GCTCTGGGCA GTCTCATGGC TGGCCGGGCT GGCACTCCTG
ACGGGATCCG CCGCCCTGGT ACCCAAGGCC GGCCCGGGGC CGGCCCGGGA TGCGCAGGGG
CCCCTGAAGA ATTTTGCGGC ACCCGTGGAC ACGAGCCTCG CGCTGACCAC CGTCACGCTG
ACGTTCGACG GCGGCAGGGC CAGCCAGCTG GCCGCCGCGG AGACCCTTCG AAGCCACGGG
CTACGGGGGA CGTTCTTCGT CAACTCGGGT TTCATGGGGG CCAAGGACTA CATGACGGTG
GAGGATCTGC ACAAGCTGGC CGCGGACGGC AACGAGATTG GCGGGCACAC CGCAACCCTT
GCAGACTTGA CGGCACTTGA ACCTGCGGAA GCCACCCGAC AGGTCTGCAA TGACCGGACC
AACCTCACCG ACTGGGGCTT CAAGGTCACA TCTTTCTCCT ACCCCTTTGC CGCAAAGTCG
CCAGAGGCGG AGGCGATGGT GGCCGGCTGC GGCTACAACA GCGCGCGCAG CCAGGGTGAC
CTCCGCAGCA AGCTGGGGTG CGCCGATTGC GCCGTGTCCG AAACCGTCCG GCCGGCGGAC
CCGTTCAGCA CGAGGTCGAC GCCGGAGATC GGGTCCGCGT GGACCATCGC GGACCTCCAG
CAGTCAGTCA TGGACGCTGA AACCACCGGA GGGTGGCTGC AGCTGAGCTT CTTCGACATC
GATGACAGCG GGAGTCCCCG GTCTGTCAGC CCTGCACTCT TCAACGACTT CGTCTCGTGG
CTGGTGACCC GCACAGACGA AGGAACCACC GCCATCCGCA CCGTGCACGA TGTGATTGGT
GGCGGGGCAA AACCCGCAGT TCCGGGACCT GTTGCTCCGC CTGCGCCGCC CGGCACCAAC
GCGCTCCGCA ACCCGGGACT GGAAACAGCC GGAAGGTACG GCCTTCCCGA GTGCTGGCAG
GTTTCCTCCT ACGGCGAGAA TTCACATGTC CTGAGCACGC TGACTCCCGG GCACTCGGGC
ACTATTGCCC GGCGCCTCGA CGTCACGGGC TACACATCCG GGGACGCAAA GCTGCTCCCA
GTCATGGACC TGGGGGCGTG CGCACCGAGC GTCGTCGCCG GCCACAGCTA TACGCTGCGC
GCATGGTACG CCTCCACTTC ATCAACGCAG TTCGAGTTGT ATTACCGCAA CAAGGTTGGC
ACCTGGACCT ATTGGACAGC CAGCCCATGG TTTCCCGCCA GCGCTGCCTA CCGGCAGGCT
GAGTGGACTG CACCGCCGGT GCCGGCAGAC GCCGTCGGCA TCAGTTTTGG CTTGAATCTG
TTCAGCGACG GGGAGCTGGC TACGGATGAC TACGAGATGT TCGACACGGG GGCGCCGCCC
GCCCCGTAG
 
Protein sequence
MSQRHELPGE QGGPPGSGLM RQRKRTTLWA VSWLAGLALL TGSAALVPKA GPGPARDAQG 
PLKNFAAPVD TSLALTTVTL TFDGGRASQL AAAETLRSHG LRGTFFVNSG FMGAKDYMTV
EDLHKLAADG NEIGGHTATL ADLTALEPAE ATRQVCNDRT NLTDWGFKVT SFSYPFAAKS
PEAEAMVAGC GYNSARSQGD LRSKLGCADC AVSETVRPAD PFSTRSTPEI GSAWTIADLQ
QSVMDAETTG GWLQLSFFDI DDSGSPRSVS PALFNDFVSW LVTRTDEGTT AIRTVHDVIG
GGAKPAVPGP VAPPAPPGTN ALRNPGLETA GRYGLPECWQ VSSYGENSHV LSTLTPGHSG
TIARRLDVTG YTSGDAKLLP VMDLGACAPS VVAGHSYTLR AWYASTSSTQ FELYYRNKVG
TWTYWTASPW FPASAAYRQA EWTAPPVPAD AVGISFGLNL FSDGELATDD YEMFDTGAPP
AP