Gene Arth_1227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1227 
Symbol 
ID4446290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1344858 
End bp1346831 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content63% 
IMG OID639689035 
Productpolysaccharide deacetylase 
Protein accessionYP_830721 
Protein GI116669788 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.18341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTTC TGCAACGCTA CACTGCGGCC TCGGAGCCCC GGCACCGGGC TGAACGGCCT 
CCCGGCCGAC GCAAGGCCCG CTGGCTGGCG GCCTCCACAA CGGTGGCGCT CGCTGCCTCC
CTTCTCGGAC TCACGGCTCC CGCCGCCAAC GCTGTTGGCG AAACTGTCGT GACGCTGACG
TTCGACGACG CCAACGTTGA CCAGGTGGTT GCCGCCGACA AGCTGGTCTC GGCCGGCATG
CGCGGGACGT TCTTCCTGCC CTCGGGGTTC ATGAACCAAA CGAACTATAT GACGACGGCC
CAGGCACTGG CGTTGCAGGC CGCCGGACAC GAGATTGGCG GGCACAGTGT GACGCACGCC
GATCTGGCCG CCGTAGGAAC TGACGAGCTC GCACGCCAGA TCTGCAATGA CCGGGCCACT
CTGATGAGTT ATGGCCTGAA CGTGGAAAAC TTCGCATATC CTTTTGCCTC GGCAAGTCCG
GCCGCAGAAG CTGAGGTCGG AACGTGCGGC TATAACAGCG CCCGCGGCTT GGGGGACATC
CGGAGCAAGG TTGCGGGTTC GGAAACCTTC CCGTTTGCTG AATCGTTGAC GCCCGCTAAC
CTCTTTTACA CCGGGGCACC GGACCAGCTG GACGAAACAT GGACCTTGGC GGATATCCAG
AGCCTTGTCA CGCAGGCCCA GGCCAATGGC GGGGGTTGGG TCCAGCTGAC GTTCCACCAC
GTCGGTACCG GCATCCTGGT TGGTTCACCT GTTAAGGATC CGCTGACCGT CAGCACGGAC
ACCTTCAACC AGTTCATCGA CTGGCTGGCC GGCCAGAGGG ACACCAACAC CACCAACCCG
GTCCAGGTTA AGACCGTGAA GGAAGTCATC GGCGGCACTC TCAAGGCAGC GCCCGCCGTT
GTTCCGCCAC CGGCACCGCA GACCACCGGC AACCTGGTCA AGAACCCGAG CCTGGAGACC
GCGGGCCTTA ACGGATCACT TCTCCCGCGG TGCTGGCAGG CCGGCGGCTA CGGCAACAAC
ACCCGGACAT TCTCCACAGT GACCCCGGGA CACGCTGGTA CGGCCACTGC GTCGCAGCAG
ATGGTGATCT CGGCCTACAC CGACGGCGAC GGCAAGCTGT TGCCCACCTT GGACACGGGT
GAGTGCGCGC CCAGTGCAAC TCCGGGGCAC ACGTACCTTG CCAAGGCCTG GTATAAATCG
ACCGGGGGGA ACACCCAGTT CAATCTGCAC TACCGGACGG CGTCCGGAAC ATGGACGTAC
TGGACGGACA GCACTCTCTA CCTCCCGACA AATAACGTTT GGACCCAGCT GGCCTTTACG
ACGCCCCCGG TTCCGGCCGG TGCAACGGCT ATCAGTTTCG GCTTGAACAT GATCGGCGTC
GGGACCCTGG TCACTGACGA CTACGAACTG TACGACACTG CAGGCATCAA GACATTCAGC
GATGTTGCCA CCAGCAACCA GTTCTACAAT GAGATCAGCT GGCTCTCCAA CAACCTGATC
ACCCAGGGTT ACGCGGATGG TACCTTCCGG CCTCTGGCAT CGATCAACCG GGACGCGATG
GCAGCGTTCC TGTACCGGTT GGCGGGCAGC CCCGCAGTGC CGGCCAACGC TCCGACGTTC
ACCGACGTAG GTCCTACCAA CCAGTTCTAC AACGAGATCC GCTGGCTGGC GGCCCAAGGC
ATCACCACCG GCTACACGGA CGGAACCTAC CGCCCGTTGG ACCCTGTCAA CCGCGACGCG
ATGGCGGCGT TCCTTTACCG CTACAACGGC AAGCCGGCTG TGCCGACCAC CGCCCCCACG
TTCCCTGACG TCACTACCGG CAACCAGTTC TACAACGAGA TCCGCTGGCT GGCAGCGACC
GGGATCACCA CAGGCTACCC GGATGGCACA TTCCGTCCGG TTCAGCCGAT CAGCAGGGAC
GCCATGGCAG CGTTCGTCTA CCGCTACAAC CTGAACTTCC CGAAGGGAAT GTAA
 
Protein sequence
MTLLQRYTAA SEPRHRAERP PGRRKARWLA ASTTVALAAS LLGLTAPAAN AVGETVVTLT 
FDDANVDQVV AADKLVSAGM RGTFFLPSGF MNQTNYMTTA QALALQAAGH EIGGHSVTHA
DLAAVGTDEL ARQICNDRAT LMSYGLNVEN FAYPFASASP AAEAEVGTCG YNSARGLGDI
RSKVAGSETF PFAESLTPAN LFYTGAPDQL DETWTLADIQ SLVTQAQANG GGWVQLTFHH
VGTGILVGSP VKDPLTVSTD TFNQFIDWLA GQRDTNTTNP VQVKTVKEVI GGTLKAAPAV
VPPPAPQTTG NLVKNPSLET AGLNGSLLPR CWQAGGYGNN TRTFSTVTPG HAGTATASQQ
MVISAYTDGD GKLLPTLDTG ECAPSATPGH TYLAKAWYKS TGGNTQFNLH YRTASGTWTY
WTDSTLYLPT NNVWTQLAFT TPPVPAGATA ISFGLNMIGV GTLVTDDYEL YDTAGIKTFS
DVATSNQFYN EISWLSNNLI TQGYADGTFR PLASINRDAM AAFLYRLAGS PAVPANAPTF
TDVGPTNQFY NEIRWLAAQG ITTGYTDGTY RPLDPVNRDA MAAFLYRYNG KPAVPTTAPT
FPDVTTGNQF YNEIRWLAAT GITTGYPDGT FRPVQPISRD AMAAFVYRYN LNFPKGM