Gene Namu_1770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1770 
Symbol 
ID8447372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1938889 
End bp1940436 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content70% 
IMG OID645040896 
Productglycoside hydrolase family 31 
Protein accessionYP_003201149 
Protein GI258651993 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.134766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.116734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCCG CCGAGGAGCC AGCGGGCCGG ACCATGCCGC TGCTGGATGG CGAGTTGTGG 
TGGGGCGGCG CGGTCGCCGA CGGGACCGTC ATGCCGTTCG GTGTCGGCTC CCGGCACCAC
CGGGACCTGT CGACCAACGC GGGGTTTGTC GGTGATCCCG CCGCGGGAGC GAACCAGTCT
GCGCCATTGC TGGTCTCGAG CCGGGGGCGG TACGTGTGGT CGGCTCAGGC GTTCGCGTTC
GGCTTCGCCG ACGGTCAACT TGCCGTTTCC GGCACCGATG TCGTGGTGGG CGAGGGTGAT
ACCCCAACGC TGGCGGGGGC TTTCCGCGCC GCCCGGGTGA ACTTCCCCGC GCTGGGCCGG
GCCCCGGCGG CCCCGCTATT CGCCGGGCCG CAGTACAACA CGTGGATGGA GCTGCCGTAC
CGGCCCACCC AGGACGGTGT GCTGGCCTAC GTCCGGGGGC TGCTGGACGC CGGGTTCCCG
CCCGGGGTGG TGATGATCGA CGATCGCTGG AGCGTCGACT ACGGAGTCTG GCGCTTCGAT
CCGGCCGCGT TCCCGGACCC GTCCGGCATG ATCTCGACCC TGCATGATTG GGGCTGCCCG
GTGATGCTGT GGGTGGTGCC CTTCATCAGC CCGGATAGTG CGACGTTCCG GGACCTGGCC
GGCCGGGGGC TGCTCATTCG CCGACCGCAC GGTGAGATCG CCGTCCGGCA GTGGTGGAAC
GGGTACAGCG CGATGCTCGA CCTGACCACA CCCGACGCGA TCGCCTGGTT CACCGGCGAG
CTGGACGAAC TCCGCGAGCG GTATGGGGTG GACGGCTTCA AGTTCGACGC GGGCGACCTG
CGCGACTACC GGCTCGACGA CGTGACGGCG AAAAGTGCGA CCCCCACCGA GCTCTGCGAA
GCCTGGGCGC GCGTCGGGCT GCGGTACTCG TTCAACGAAT ACCGCGCCGG CTGGAAGATG
GGTGGCTCCC CACTCGCGCA ACGTCTGCAC GACAAACCGC CGACCTGGGA CGGCCACGGG
CTGGCCTCGC TCATCCCCGA GTCGATCGCC CAAGGTCTGA TCGGCCACCC GTTCGTCTGC
CCGGACATGA TCGGCGGCGG CGACCTGGCC GCCGCCGCGG CCGGCGTCGA TCAGGAACTG
TTCGTCCGCT ACGCCCAGCT CGCCGCACTG CATCCGATGA TGCAGTTCTC TCTGGCCCCG
CACCGGGTGC TGGACGCCGA TCATCTGATG GCGGTGCGAC AGGCCGTCGA CCTGCGCCAA
ACGCTGCTGG CCGAGCTGAC CGCAATGGTC CACGACGCCG CCCGCACCGG TGAGCCCATC
CTGCGGTCGC TGGCCTACGA CGATCCCGAC GACCCCGGCA CCACCGACCA GTACACCCTC
GGCGGCGACA TCCTGGTCGC GCCGGTTCTG GAGCCTGGTG CGACGACCCG GCGGGTCCGA
TTTCCCGCCG GGTGCTGGGT GGCCCCGGAC CGAGCCCGAT TCGATGGTCC GGACGTGCGG
TCCATCCCCG TCACGCTGAC CTCAGTTCCC TGGTACCGGC GCGCATGA
 
Protein sequence
MTSAEEPAGR TMPLLDGELW WGGAVADGTV MPFGVGSRHH RDLSTNAGFV GDPAAGANQS 
APLLVSSRGR YVWSAQAFAF GFADGQLAVS GTDVVVGEGD TPTLAGAFRA ARVNFPALGR
APAAPLFAGP QYNTWMELPY RPTQDGVLAY VRGLLDAGFP PGVVMIDDRW SVDYGVWRFD
PAAFPDPSGM ISTLHDWGCP VMLWVVPFIS PDSATFRDLA GRGLLIRRPH GEIAVRQWWN
GYSAMLDLTT PDAIAWFTGE LDELRERYGV DGFKFDAGDL RDYRLDDVTA KSATPTELCE
AWARVGLRYS FNEYRAGWKM GGSPLAQRLH DKPPTWDGHG LASLIPESIA QGLIGHPFVC
PDMIGGGDLA AAAAGVDQEL FVRYAQLAAL HPMMQFSLAP HRVLDADHLM AVRQAVDLRQ
TLLAELTAMV HDAARTGEPI LRSLAYDDPD DPGTTDQYTL GGDILVAPVL EPGATTRRVR
FPAGCWVAPD RARFDGPDVR SIPVTLTSVP WYRRA