Gene Namu_3374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3374 
Symbol 
ID8448989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3712750 
End bp3714234 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content72% 
IMG OID645042451 
Productglycosidase PH1107-related protien 
Protein accessionYP_003202691 
Protein GI258653535 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2152] Predicted glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0000228446 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00648646 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAACAGG TTCTCGCGCA TCGGCTGTCC GTCCGGCTCG AGCCGGATCC GGCGCGGGTG 
GTGGCCCGCC TGTTCCTGCC CGGCGAGCAG ACCCAGCACG AACGCTCGCG GGTCGGGGGC
ATCGCCGACC GGGTGCTGGC CCTTCCCGAG GCGGACGTGC AGGTGCTGGC CGAGCAGGTG
CTGCGGGACT TCTCCGCGCG GCACCGTGAC CTGCCGGCCA TCCTGGACGG GCACGCCGTG
ATCATCGCCT CCCGGATCGG GGACACCCCG GTCTCGGCGG CGCGCATGCT GGTGCTGGGG
GCCAGCGTCA CCGCCGAGTA CGCCACCGAG TCCGCCGCGC TGTGCAACCC GAGCGCCGTG
CCGCACCCCG ATCAGTCCGG CCTGTTGGAC GGGCAGCTGC GGGTGGCGGT GAGCCTGCGG
GCCATCGGGG AGGGACACAT CTCCGCGATC GCCTTCGGCA CCGCCGTCAT CGGGCCCGGT
GCGCAGTGGG AGTTCCAGGA CCGGGAACGA CCGCTGGTCA CCGGCCGCAG CTCGCCGGCC
CAGTGGCGCA ACAGCCAGTT GGCCGCGGTG CTGGCCGACC ACGGCGTCGT GGACACCCTG
GGCGCGACCC TGCTGCACGA GCTGCCCGAC CTGTTCGACG TCGTCGATCT GGAGCGGGTG
CTGGCCCATG CCCCCGGCGA CCTGCTGGCC CGCCTAGGTG GACCGGCCAC CATCGACCTG
GTCCGCCGGG TGGTGTCCTC GGCCTACCGG GTCGAGTTCG ACGCCGACAC TGCGCTGGCC
CAACGGATCC TGCAGCCCAA TGCGGCCGAG GAGAGCAACG GACTGGAGGA CGCGCGGTTC
ACCCGCTTCG TCGACCCGGA CGGGGTGGTG GAGTACCGGG CGACCTACAC CGCCTACGAC
GGCCACCAGA TCGCCCCGCG GCTGCTGATC AGCTCGGATC TGCGCGAGTT CAACGCCTAC
CGGCTGGCTG GGTCGGCCGC CCGGAACAAG GGCATGGCCC TGTTCCCGCG GCTGGTCGGC
GGGCGGCACC TGGCGTTGTG CCGCACCGAC GGCGAGAACA TCAGCCTGGC CTACTCCACG
GACGGATTCC GCTGGTCCGA GCCGACCCTG CTGTACGGGC CGAGCCGGGC CTGGGAAGTG
GTGCAGGTGG GCAACTGCGG CCCGCCGGTC GAGACCGAGC GCGGCTGGCT GGTGCTCACC
CACGGGGTGG GTCCGATGCG CACCTACGCG ATCGGCGCCA TCCTGCTCGA CCTGGACGAC
CCGTCCCGGG TGATCGGCTC GCTGCGCCAC CCCCTGCTGG AGCCGATTGA CGGCGAGCGG
GACGGGTACG TGCCCAACGT GGTCTACTCC TGCGGCCCGG TCCGGCACGA CGGCCGGCTG
TGGGTACCGT TCGGCATCGA CGACGCGCGG ATCGGCGTCG CCTGGCTCGA TCTGGACGAG
CTGCTCGACG AACTCCTTGA CGGTGGGGTC ATCTCCGCGC TCTGA
 
Protein sequence
MKQVLAHRLS VRLEPDPARV VARLFLPGEQ TQHERSRVGG IADRVLALPE ADVQVLAEQV 
LRDFSARHRD LPAILDGHAV IIASRIGDTP VSAARMLVLG ASVTAEYATE SAALCNPSAV
PHPDQSGLLD GQLRVAVSLR AIGEGHISAI AFGTAVIGPG AQWEFQDRER PLVTGRSSPA
QWRNSQLAAV LADHGVVDTL GATLLHELPD LFDVVDLERV LAHAPGDLLA RLGGPATIDL
VRRVVSSAYR VEFDADTALA QRILQPNAAE ESNGLEDARF TRFVDPDGVV EYRATYTAYD
GHQIAPRLLI SSDLREFNAY RLAGSAARNK GMALFPRLVG GRHLALCRTD GENISLAYST
DGFRWSEPTL LYGPSRAWEV VQVGNCGPPV ETERGWLVLT HGVGPMRTYA IGAILLDLDD
PSRVIGSLRH PLLEPIDGER DGYVPNVVYS CGPVRHDGRL WVPFGIDDAR IGVAWLDLDE
LLDELLDGGV ISAL