Gene Namu_1712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1712 
Symbol 
ID8447314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1874680 
End bp1876281 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content73% 
IMG OID645040838 
Productglycoside hydrolase family 31 
Protein accessionYP_003201091 
Protein GI258651935 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000051017 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0363963 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGC GCACCGCCCG CCCGCCGGCC CTGTCCCTTG GAGTGCTGTC CCTGGAGCTG 
CCGGCCGGGG AACGCTGGTG GGGCGGCGCC GTGCCGGACG GCGGCGTCAT GCCGTTCGGC
GATCGGCCGC ACCACCGCGA TCTGGCGGTG AACGCGGGCC TGCTCGACGA CCCGACCGGC
GGGGCCAACC AGTCCGCTCC CCTGCTGCTG TCCAACCGCG GTCGTTACGT CTGGTCCGAC
CGGCCGTTCG CCTTCACCGT GACCGGCGGC CGGCTGCAGG TGCAGGGCCC CGATCTGGTG
GTCGGACAGG GCGCGGAGCC GACCCTGGCC GCGGCCTTCC GGGCGGCGTC GGCGGCCCAC
TTCCCGCCCG CCGGCCGGGC CCCGGCCCGG GCGATGTTCA CCGGTCCGCA GTACAACACC
TGGATCGAGA TGCCGTTCGC GCCCACCCAG CGCAAGGTGC TCGACTATGT CCGCGGCATG
CTCGATGCCG GACTGCCGCC CGGTCCGGTG ATGATCGACG ACCTGTGGGC GCAGGACTAC
GGCAGCTGGC AGTTCGACCG GAGCGCCTTC CCCGACCCCA CCGCGATGGT TGCCCAGCTG
CACGACTGGG GCTGCCCGGT GCTGCTCTGG GTGGTCCCGT TCATCAGCCC GGACAGCCGC
GCCTTCCGGG CGGTGCGGCC GCGCGGCCTG CTCATCCGGC GGCGGAGCGG CCGGATCGCC
ATCCGCGAGT GGTGGAACGG GTTCAGCGCG ATGCTCGACC TGACCAACCC GGCGGCCGTG
GACTGGTTGT GCGGGGAGCT GGACACGCTG CGCGACGAGC ACGGCATCGA CGGGTTCAAG
TTCGACGCGG GCGACGTGCG CGACTACCGC AGCGACGACG TCACCCTCGG TGGAGCCGAG
CCGGTCGACC TGTGCCAGGC CTGGGCCGAA CTGGGACAGC GCTACGCCTA CAACGAGTTC
CGGGCCTGCT GGCGCTCCGG CGGTGCCCCG CTGGCCCAGC GGCTGCACGA CAAGCCGGCC
ACCTGGGACG AGCGGGGGCT GGGCTCGCTC ATCCCCGAGT CGATCGCCCA GGGCCTGATC
GGCCATCCCT TCAGCTGCCC GGACATGATC GGCGGCGGCG ATCTGTGGGG GGCCGAGGAC
GAGGTCGACC AGGAGCTGTT CGTCCGCTAC GCCCAGGTCG CCGCGCTGCA CCCGATGATG
CAGTTCTCCC GGGCCCCCCA GCGCGTGCTC GACGCCGAGC ACCTGGCCAT CGTCCGGGCC
GCCGTCGCCC TGCGCGAGCG GCTGCTGCCC GACCTGCTGG CCCTGGTCGA CCACGCGGCC
CGCACCGGCG AGCCGATCCT GCGGTCACTG GCCTGGCTGG ATCCGGACGA TCCCGGTCCG
GCCGATCAGT TCCTGCTCGG TTCCGACCTG CTCGTCGCCC CGGTGCTCGA ACCCGCGTCG
CCGCATTCGG TTCCGACCCG CGGTCAGACC CGCAGCATCC GGCTGCCGCC GGGTCGTTGG
CAGGCGCAAT GGGACGGCGC CGATCGCACC GTGCTGGCCG GGCCGACCCG GATCACCGTG
CCGGTGGACC TGAGCACGCT GCCCTGGTTC CGCCGGGTCT GA
 
Protein sequence
MTERTARPPA LSLGVLSLEL PAGERWWGGA VPDGGVMPFG DRPHHRDLAV NAGLLDDPTG 
GANQSAPLLL SNRGRYVWSD RPFAFTVTGG RLQVQGPDLV VGQGAEPTLA AAFRAASAAH
FPPAGRAPAR AMFTGPQYNT WIEMPFAPTQ RKVLDYVRGM LDAGLPPGPV MIDDLWAQDY
GSWQFDRSAF PDPTAMVAQL HDWGCPVLLW VVPFISPDSR AFRAVRPRGL LIRRRSGRIA
IREWWNGFSA MLDLTNPAAV DWLCGELDTL RDEHGIDGFK FDAGDVRDYR SDDVTLGGAE
PVDLCQAWAE LGQRYAYNEF RACWRSGGAP LAQRLHDKPA TWDERGLGSL IPESIAQGLI
GHPFSCPDMI GGGDLWGAED EVDQELFVRY AQVAALHPMM QFSRAPQRVL DAEHLAIVRA
AVALRERLLP DLLALVDHAA RTGEPILRSL AWLDPDDPGP ADQFLLGSDL LVAPVLEPAS
PHSVPTRGQT RSIRLPPGRW QAQWDGADRT VLAGPTRITV PVDLSTLPWF RRV