Gene Namu_1771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1771 
Symbol 
ID8447373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1940433 
End bp1942676 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content69% 
IMG OID645040897 
Productglycoside hydrolase clan GH-D 
Protein accessionYP_003201150 
Protein GI258651994 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.145856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.079989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCAC CCGCCACGAC TTCCACCGTC AATGAGGCCG CGAGGCCGGC CGACCTCGGA 
TTGAACCACC CCGGTTCGGC GGTCCGGCAC CTGCGAGCCG CCGGGGTCAG CGTGGTGATC
GACTGCCGCG GGCCGCGTCT ACCGCGCATC GTGCACTGGG GCGCCGACCT GGGCGATCTC
GACCGGACCA CGCTGGACAA CCTGGTTCGA GCCGATGTGC AGGCCGTGGT CAGCAATGTT
CCCGACCAAC CGATCAGCCC TGGACTGCTC GCCGAACACT CGACCGGCTG GTTGGGACTG
CCCGGCCTGC TCGGACACCG TGCCGGCCAT CAGTGGTCGA CCCTGTTCGA CGTGACGACC
GTCAGCTCGA GCAACGACCG GGACGGAACC CGACGCGTCG TCGTCGACGC AGCGGATCCG
GACGCCGGCC TGCGCCTCGT GCTGACCCTG GAGCTGTTGC CGGCCGGGGT GCTTCGCACC
CGAGCCACCC TGACGTCGAT CGCCGACCAG GCCGGCGCGC CCTACACCGT CGACGGGCTG
GCCCTGTCGC TTCCGGTCCC GACCCGTGCC ACCGAACTGC TCGATTTCAC CGGCCGGCAC
CTGCGGGAAC GGCATCCGCA GCGCCGCCGG TTCGATGTCG GCAGTTGGGT CCGCGACAAC
CGCCGCGGCC GCACCGGAGC CGACGCCACG ATGCTGTTGG CCGCTGGCAC GCCCGGTTTC
GGCTTCGGCC ACGGTGAGGT CTGGTCGGTG CACACCGCCT GGAGCGGCAA TCACCGCACG
ATCGCCGAGC GCAGTCCGAA CGGACACGCC GTGCTGTCCG GCGGGGAACT GTTGCTGCCT
GGCGAGATCA CCCTGGGGCC GAGCGAGTCG TACACCACGC CATGGATCTA CGGTTCCTAC
GGCGGTGCCG GCCTGAACGC CGTGGCCGAC CGCTTCCACG CCTTCCTGCG GGGGCGGCCG
CAGCATCCCC GCAGCCCGCG TCCGGTCATC CTGAACACCT GGGAATCGGT GTACTTCGAC
ATGGACTTGC CGACGCTGAT CGCGCTGGCC GAGGCCGGCG CCGAGGTGGG TGTCGAACGG
TACGTGCTGG ACGATGGCTG GTTCACCGGC CGCCGCGACG ACACCGCCGG GCTGGGCGAC
TGGCAGGTCG ACCGCGACGT CTGGCCCAAT GGCCTCAAGC CCTTGGTGGA TCGGGTCACC
AAGCTCGGCA TGCAGTTCGG CATCTGGATT GAACCCGAGA TGATCAATCC CGACTCCAAT
CTCGCCCGCG CGCACCCGGA ATGGATGCTG TCCACCGGGC ACCGGCTGCC CATCGAATCA
CGGCACCAGC AGGTGCTGGA TCTGGCCAAT CCGGGCGCGT TCGACTACAT CCTGGGCAGC
CTGGACGACC TGCTCAAGAA GCACGACATC AGTTACCTCA AGTGGGACCA CAACCGGGAT
CTGATCGACG CCGGGCACAG TCCGGACGGT CAGCCGGCCG TGCACGACCA GACCCTGGCC
GTGTACCGGC TGATTGACGA ACTGCGCCGC CGCCACCCCG GGGTCGAGAT CGAGTCATGC
TCCTCCGGCG GCGCCCGAGT CGATCTGGAG GTGCTGCAGC GCACCGATCG GGTTTGGACC
AGCGACTGCA TCGACGCCCT GGAACGGCAG ACGATCCAGC GCTACACCGG GCTGCTGGTG
CCGCCGGAGA TGCTCGGCGC CCACATCGGC ACCGGCCAGG CGCACACCAC CGGACGCCGG
CACAACCTGT CCTTCCGGGC CGGCACCGCC CTGTTCGGCC ACATGGGAAT CGAGGCCAAC
CTGACCAGCA TGTCGGCCGC CGAGCGAGCT GAACTGGGCG AATGGGTTGC CCTGCACAAG
AAGCTGCGGC CGCTCCTGCA CACCGGCCGG GTGGTCCGTT TCGACGACGT CGACCCCTCG
CTGATGGTGC AGGGTGTCTA CGCGGCCGAC CTTTCCCAAG CCGTGATCAG CATCGCCGCG
ATCGCGACCG CCGACAGCGC CCCGATCGGC CGGCTCGTCA TTCCCGGCCT GGATCCGGAC
GCGCCCTATC ACCTGGAGCT GCTGCCGCCC GGCGATGTCA TCCAGGGCGA GGACGCCGAG
CGCCGCAACG GCAACAACAA GCACCTGCCA CCGTGGCTCG TCACCGGGAC CGACCTGACC
GGCGCGGCGC TGACCTACGC CGGGGTGCAG CTGCCCGATC TGCTTCCCGA GCAGCTGCTG
CTGCTGCGCG TCACCCGGGT ATGA
 
Protein sequence
MTSPATTSTV NEAARPADLG LNHPGSAVRH LRAAGVSVVI DCRGPRLPRI VHWGADLGDL 
DRTTLDNLVR ADVQAVVSNV PDQPISPGLL AEHSTGWLGL PGLLGHRAGH QWSTLFDVTT
VSSSNDRDGT RRVVVDAADP DAGLRLVLTL ELLPAGVLRT RATLTSIADQ AGAPYTVDGL
ALSLPVPTRA TELLDFTGRH LRERHPQRRR FDVGSWVRDN RRGRTGADAT MLLAAGTPGF
GFGHGEVWSV HTAWSGNHRT IAERSPNGHA VLSGGELLLP GEITLGPSES YTTPWIYGSY
GGAGLNAVAD RFHAFLRGRP QHPRSPRPVI LNTWESVYFD MDLPTLIALA EAGAEVGVER
YVLDDGWFTG RRDDTAGLGD WQVDRDVWPN GLKPLVDRVT KLGMQFGIWI EPEMINPDSN
LARAHPEWML STGHRLPIES RHQQVLDLAN PGAFDYILGS LDDLLKKHDI SYLKWDHNRD
LIDAGHSPDG QPAVHDQTLA VYRLIDELRR RHPGVEIESC SSGGARVDLE VLQRTDRVWT
SDCIDALERQ TIQRYTGLLV PPEMLGAHIG TGQAHTTGRR HNLSFRAGTA LFGHMGIEAN
LTSMSAAERA ELGEWVALHK KLRPLLHTGR VVRFDDVDPS LMVQGVYAAD LSQAVISIAA
IATADSAPIG RLVIPGLDPD APYHLELLPP GDVIQGEDAE RRNGNNKHLP PWLVTGTDLT
GAALTYAGVQ LPDLLPEQLL LLRVTRV