Gene Namu_3200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3200 
Symbol 
ID8448814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3524503 
End bp3526287 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content66% 
IMG OID645042280 
Productpolysaccharide deacetylase 
Protein accessionYP_003202521 
Protein GI258653365 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000473135 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00151321 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGCAGC GTCAACCGCA GTTCCACCAC CGGGCCGCCG GGTTCGTCAT CGCCCTCGGC 
TTCCTGATCG GCGCGGCCAT CCTGATCGCG CCCATCGCGA ATGCCGCCGA CACCACGGTC
GACGATGCGG TGACCAGTGG CGCCAACTAC TTCACCTACA CCGGCTCGTC CTGGACCAGC
TGCGGCGGCT GCAACAGCAC CGCCACCAAC AACAGCTACC GCTACGCCTA CACCACCGGC
AACAAAGCCG TCCTGACCTT CACCGGCACC CAAGCCACCA TCTACGGCTT CAAAGAACCC
CCCGGCGGCA TCGGCAGTTT CGCCGTCGAC AACGGCCCCA CCACCGACAT CGACTTCTAC
GCCGCCACCC AAACCCTCAC CGCCGTCTAC ACCACCCCCA CCCTCACCAA CACCACCCAC
ACCATCACCA TCACCGTCAC CGGCCGCAAG ACCAACGGCG CCTCCCCCAC CATCAACATC
GACAAGGCCG TCATCGCAAC AACAACCGGC GCCGGTACCA CCACCCCCAC CAGCACCACC
ACCCCCACCA GCACGGCCAC CACCACCCGA ACCACCACCC CCACCAGCAC CACCACCCAG
CCGACGGTCA CCCCGCCGAC CACTCCCCCG GCGACCGGGG TGACCGTTGA TGATGCGGTG
ACCAGTGGCG CCAACTACTT CACCTACACC GGCTCGTCCT GGACCAGCTG CGGCGGCTGC
AACAGCACCG CCACGAACAA CAGCTACCGC TACGCCTACA CCACCGGCAA CAAAGCCGTC
CTGACCTTCA CCGGCACCCA AGCCACCATC TACGGCTTCA AAGAACCCCC CGGCGGCATC
GGCAGTTTCG CCGTCGACAA CGGCCCCACC ACCGACATCG ACTTCTACGC CGCCACCCAA
ACCCTCACCG CCGTCTACAC CACCCCCACC CTCACCAACA CCACCCACAC CATCACCATC
ACCGTCACCG GCCGCAAGAC CAACGGCGCC TCCCCCACCA TCAACATCGA CAAGGCCGTC
ATCGCAACAA CAACCGGCAC CGGTACCCCC ACCCCGACCA GCACCACCAC CCCCACCAGC
ACGGGCACCA CCACCCGAAC CACCACCCCC ACCACCACCC AGCCGGGCGG GACGGGCATC
GCCAGCATCA CCTTCGATGA CGGCACGATC GGCCAATACA CCTATGCGCG GCCACTTCTC
GTGCAACGCT CGCTTCCGGC GACCTTCTTC ATCATCTCGG ACGCGCTCGG CTGGACAGGC
ACCAACATGA ACGCGACCCA GGTCCGGCAA CTCGTCGCCG ACGGCGACGA GATCGGCAAT
CACACCCGTG ACCACACGAA CCTGGCCACG CTGTCGGCGA GTCAGGTCAG CGCGGAGTTC
ACCCATTCAC AAACGGTCAT CGCCAACCAG ATCGGGGTGA CCCCCACGAC GTGCGCCTAT
CCGTACGGCA GTCACAACTC CACCGTCGAT TCGGTTGCCG GGAATTTCTT TCGTGGGTGC
CGGGAAACGG GTGGAGGATT GAACACGTCG GGCTCGTTGC GGCCGTACGC GCTCACCAAT
TACTACGTCG GCCAGACCAC CACGGCCGCC GACATCCGCA ACGCCGCCGA ACAGGCGAAG
GCGCAGAATG CCTGGGTCAT CTTCACCTAT CACGGCGTCG ACCCGAGCGG GACCGGCTCG
GAGGACGTCA CCCCGACGAA CCTGGCCGCG CAACTGGACG CCCTCCGGTC CACCGGGATC
CCCGTCGTCA CGGTGAGCGC GGCACTGTCG GCCTATGGCC GCTGA
 
Protein sequence
MSQRQPQFHH RAAGFVIALG FLIGAAILIA PIANAADTTV DDAVTSGANY FTYTGSSWTS 
CGGCNSTATN NSYRYAYTTG NKAVLTFTGT QATIYGFKEP PGGIGSFAVD NGPTTDIDFY
AATQTLTAVY TTPTLTNTTH TITITVTGRK TNGASPTINI DKAVIATTTG AGTTTPTSTT
TPTSTATTTR TTTPTSTTTQ PTVTPPTTPP ATGVTVDDAV TSGANYFTYT GSSWTSCGGC
NSTATNNSYR YAYTTGNKAV LTFTGTQATI YGFKEPPGGI GSFAVDNGPT TDIDFYAATQ
TLTAVYTTPT LTNTTHTITI TVTGRKTNGA SPTINIDKAV IATTTGTGTP TPTSTTTPTS
TGTTTRTTTP TTTQPGGTGI ASITFDDGTI GQYTYARPLL VQRSLPATFF IISDALGWTG
TNMNATQVRQ LVADGDEIGN HTRDHTNLAT LSASQVSAEF THSQTVIANQ IGVTPTTCAY
PYGSHNSTVD SVAGNFFRGC RETGGGLNTS GSLRPYALTN YYVGQTTTAA DIRNAAEQAK
AQNAWVIFTY HGVDPSGTGS EDVTPTNLAA QLDALRSTGI PVVTVSAALS AYGR