Gene Namu_0187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0187 
Symbol 
ID8445767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp209480 
End bp211807 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content70% 
IMG OID645039334 
Productalpha-xylosidase YicI 
Protein accessionYP_003199609 
Protein GI258650453 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTCA GCGACGGCTA CTGGATGGTG CGCGAGGGCT TCCAGGTCCG CACCGCCCGC 
CAGGTCCGCG ACGTCGAGGC GACCCCCACC GAGCTGTCGG TGCTGGCCGC GACCCGGACG
ATCAACAGCC GGGGCGACAC CCTGAACACC CCGACGGCGA CCATCACCCT GTCCACCCCG
GCCGAGGGGG TGATCCGGGT GCGCATCGAA CACCACGCCG GCGCGCGGGA CCCGGGCCCG
GAGTTCGCCC TGCCGGGTGC CACCGAGCGC CGGCCGCAGA TCCGGATCGA GGCCGATCAC
GCGAGCCTGG ACGCGGGCGG CCTGACCGCC CGGATCCGGC GCGACGGTCC GCTGCGGCTG
GACTTCGAGG CCGGCGGCCG GGTGCTCACC GGCGCCGACG CGACCAGTGT CGGCCTGGCC
ACCGGACCCG ACGGCACGCC CTACGTATTC GCCCAGTTGG GCCTGGACGT CGGCGAGGTG
GTCTACGGGC TCGGTGAACG CTTCGGCCCG TTGGCCAAGA ACGGCCAGAG CGTGGACATC
TGGAACGCCG ACGGGGGAAC CAGCAGCGAG CAGGCCTACA AGAACGTGCC GTTCTTCTGG
ACGAACCGGG GTTACGGCGT CCTGGTCAAC CACCCCGAGC TGGTCTCGTT CGAGATCGGG
TCCGAAGTGG TGTCCCGTAC CCAGTTCTCG GTCACCGGCG AGCATCTGGA GTACCTGGTC
ATCCTCGGTC CGACGCCCAA GGAGATCCTG CGCCGCTACA CCGCGCTGAC CGGCCGCGCC
CCCCGAGTCC CGGAGTGGAC CTTCGGCCTG TGGCTGAGCA CCTCGTTCAC CACCGCCTAC
GACGAGCGGA CCGTGAGCTC GTTCATCGAC GGCATGGCCG AGCGCGAGAT CCCGCTGAGC
GTCTTTCACT TCGACTGCTT CTGGATGCGC GAGTTCCGCT GGTGCGACTT CGAGTGGGAC
CCGCGAACCT TCCCCGAACC GGAGGCGATG CTGGCCCGGC TCAAGGCCCG CGGGCTGCGG
GTCTGCGTGT GGATCAACCC GTACATCGCG CAACGGTCGG CCCTGTTCGA GGAGGGTCGG
GCCAAGGGGT ACCTGGTCAC CCGGGCCGAC GGCAGCCTGT GGCAGTGGGA CCTGTGGCAG
GCCGGCATGG CCCTGGTCGA CTTCACCAAC CCGGCCGCCG CGCAGTGGTA TGCCGGGCAT
CTGGAGCGGC TGCTCGATCA GGGGGTGGAC GCGTTCAAGA CCGACTTCGG CGAGCGCATC
CCCACCGATG TGGTCTGGTT CGACGGCTCG GACCCGGACC GGATGCACAA CTACTACACC
CACCTGTACA ACCGCACCGT CTTCGAATTG CTGGAGCGGC GTCGGGGCCG GGGCGAGGCG
GTCCTGTTTG CCCGGTCGGC CACCGTCGGC GGCCAGCAGT ACCCGGTGCA CTGGGGTGGG
GACTGCGATT CCACCTATGC CTCGATGGCC GAGACGCTGC GCGGTGGTCT GTCCCTGGCC
GCCTCCGGGT TCGGCTACTG GTCGCACGAC ATCGGTGGGT TCGAGGGCAC CCCCGACGCC
GGGGTATTCA AGCGGTGGCT GGCGTTCGGG CTGCTGTCCA GCCACAGCCG GCTGCACGGC
TCGGATTCCT ACCGGGTGCC CTGGGCGTTC GACGAGGAGG CCGTCCAGGT CGCGCGCCGG
TTCACCCGGC TGAAGATGAC GCTAATGCCG TACCTGCTCG GCGCGGCCCG CCAGGTCACC
GAGGAGGGAA CCCCGATGAT CCGGCCGATG GTCATGGAGT TCCCCGACGA CCCGGCCACC
GAGTACCTGA GCACCCAGTA CATGCTCGGC GACGCGTTGC TGGTCGCCCC CGTGTTCCAC
CCCGACGGCG ACGTGCGCTA CTACGTGCCC GCGGGTACCT GGACCGGTCT GCTGGACGGG
CGCACCGTGG TCGGCCCGCG CTGGGTGCAC GAGCGGCACG GCTTCGACAG CCTTCCCCTG
CTGGTCCGGC CGGGTTCGGT GATTCCGATC GGCGCGCGGT CCGACGGTCC GGAGTACGAC
TACGCCGACG GCGTCGCGCT GCACCTGTTC GACCCGGCGG CGCTGACCGA CCGCACCGTC
CGGGTGCCCA CCGCCGGCGG GGCCGCGGTG GAGTTCCGGA TCCGCCGGGA GGGTCGCCAG
CTGACCGTCT TCGGCCCGGA TCCGGCCGAA CACTCGTGGT CGGTGGTGTG TGCGGCGGCC
GAAAAGGCCG ATGGAAGCGC TTCCACAAGG GCAGCGGAGT CGGCTAGCGT CTGCATGACG
CTGCCCGGCC CGGTGGCCGA GCCACTGCCG CCTCGGAAGG ATTCCTGA
 
Protein sequence
MRFSDGYWMV REGFQVRTAR QVRDVEATPT ELSVLAATRT INSRGDTLNT PTATITLSTP 
AEGVIRVRIE HHAGARDPGP EFALPGATER RPQIRIEADH ASLDAGGLTA RIRRDGPLRL
DFEAGGRVLT GADATSVGLA TGPDGTPYVF AQLGLDVGEV VYGLGERFGP LAKNGQSVDI
WNADGGTSSE QAYKNVPFFW TNRGYGVLVN HPELVSFEIG SEVVSRTQFS VTGEHLEYLV
ILGPTPKEIL RRYTALTGRA PRVPEWTFGL WLSTSFTTAY DERTVSSFID GMAEREIPLS
VFHFDCFWMR EFRWCDFEWD PRTFPEPEAM LARLKARGLR VCVWINPYIA QRSALFEEGR
AKGYLVTRAD GSLWQWDLWQ AGMALVDFTN PAAAQWYAGH LERLLDQGVD AFKTDFGERI
PTDVVWFDGS DPDRMHNYYT HLYNRTVFEL LERRRGRGEA VLFARSATVG GQQYPVHWGG
DCDSTYASMA ETLRGGLSLA ASGFGYWSHD IGGFEGTPDA GVFKRWLAFG LLSSHSRLHG
SDSYRVPWAF DEEAVQVARR FTRLKMTLMP YLLGAARQVT EEGTPMIRPM VMEFPDDPAT
EYLSTQYMLG DALLVAPVFH PDGDVRYYVP AGTWTGLLDG RTVVGPRWVH ERHGFDSLPL
LVRPGSVIPI GARSDGPEYD YADGVALHLF DPAALTDRTV RVPTAGGAAV EFRIRREGRQ
LTVFGPDPAE HSWSVVCAAA EKADGSASTR AAESASVCMT LPGPVAEPLP PRKDS