Gene Arth_2337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2337 
Symbol 
ID4445102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2626341 
End bp2627843 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content66% 
IMG OID639690146 
Productpolysaccharide deacetylase 
Protein accessionYP_831817 
Protein GI116670884 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00797389 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTGGT CGAATCAGCC TGTCCGCAAG CCCGCGCGTT GCGTGGCGGA TCCCCCTGAA 
CCATTGGCCG AAGGCCGGCA CGGCAGGCGC TGGGTTGCAG TCCTGGGTGC CCTGCTGCTT
GCTGTCAGCA CAGTGGCGGG CGTGGCCGGT ACTTCGAAGG CTGCGGCGCC AACCATCGTC
AGCCTGACGT TCGACGACGG CCTCGGCAGC CAGTTGGCCG CGGCGCAGGA GCTGAAAGCC
CACGGACTGG TGGGAACTTT CTTCATCACC ACGTCCTTCG TGGGCCAGTC CGGGTTCCTC
ACCCAGGCAA ACCTGAACAC TCTCGTGGCC GACGGCAATG AGATCGGCGG CCACAGCGTG
ACCCACCCGG ACATGACAAC GCTCAGCGCA GCGGCCGCCA GCGCGGAGGC CTGCAACAGC
AAGTCGACCC TCGAGGCCTG GGGCTTCACC GTCCGGAACT TCGCCTACCC CTTCGCGGCG
GTGAATCAGA CAGCCCAAAA CGCCGTCAGC GGCTGCGGGT ACAGCAGTTC CCGTGGCCTC
GGCGACATCC GTTCTCCCGC CAGCTGCGCC GACTGCCCTG TCGCCGAGAC GCTCCCGCCG
CAGGAACCCA TGGTCACCAA GGCGCCGGAC CAGGTGGCCG CCACATGGAC CCTGGCGGAC
CTGCAGGCGA CAGTCACCAA CGCCGAGACA ACCGGCGGCT GGCTGCAGCT GACGTTCCAC
GAGATTGCGA ACGGAACGGA TCCGTCGCTG TCCATCAGCC CCGCGCTGTT CAAAGAGTTC
GTCACGTGGC TGGCCGCGCG GACAGCAAAC GGTACGACGT CGGTGCGTAC AGTGGCGCAG
GCATTGGGCC AGTCCCCCGT GACGTCACCG CCTCCCTCGC CTTCACCGTC ACCATCGCCG
ACCCCGTCAC CCTCGCCGGC GGGCTCGTTT ACGGATGTTC CCGCGAACTC GCAGTTCTAC
ACGGAGATCA GCTGGCTCGC CTCGCAAGGG ATTTCCACTG GCTGGGTTGA AGCCAACGGC
ACCAGCACGT ATCGCCCGGC GCTCGCCGTC AACCGGGATG CCATGGCTGC CTTCATGTAC
CGACTCGCCG GGAGTCCGGC GTACACACCG CCGGGCACCT CCCCGTTTAT CGATGTGACG
CCGCAGACGC AGTTCTATAA GGAAATCGCC TGGCTCGCTT CGAAGGGCAT TTCCACCGGC
TGGGACGAAG GAAACGGCGC CAAGTCGTAC CGGCCGCTGC AGACCGTGAA CCGTGATGCG
ATGGCAGCCT TCATGTACCG CTTTGCCGGC AGCCCTGCCT ACAACGCCCC GGGCGCCTCC
CTCTTCACTG ACGTGGTCCC GCAGACGCAG TTCTACAAGG AAATCAACTG GCTCGCGTCG
ATGAACATCT CCACCGGCTG GGTCGAAGGG AACGGCACCA GGACCTTCCG TCCCGTCCAG
TCCGTGAACA GGGACGCCAT GGCGGCCTTC ATGTACCGCT ACAACAACGC CTTCCCTGCG
TAA
 
Protein sequence
MRWSNQPVRK PARCVADPPE PLAEGRHGRR WVAVLGALLL AVSTVAGVAG TSKAAAPTIV 
SLTFDDGLGS QLAAAQELKA HGLVGTFFIT TSFVGQSGFL TQANLNTLVA DGNEIGGHSV
THPDMTTLSA AAASAEACNS KSTLEAWGFT VRNFAYPFAA VNQTAQNAVS GCGYSSSRGL
GDIRSPASCA DCPVAETLPP QEPMVTKAPD QVAATWTLAD LQATVTNAET TGGWLQLTFH
EIANGTDPSL SISPALFKEF VTWLAARTAN GTTSVRTVAQ ALGQSPVTSP PPSPSPSPSP
TPSPSPAGSF TDVPANSQFY TEISWLASQG ISTGWVEANG TSTYRPALAV NRDAMAAFMY
RLAGSPAYTP PGTSPFIDVT PQTQFYKEIA WLASKGISTG WDEGNGAKSY RPLQTVNRDA
MAAFMYRFAG SPAYNAPGAS LFTDVVPQTQ FYKEINWLAS MNISTGWVEG NGTRTFRPVQ
SVNRDAMAAF MYRYNNAFPA