Gene Namu_5152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5152 
Symbol 
ID8450783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5740907 
End bp5742355 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content67% 
IMG OID645044186 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003204410 
Protein GI258655254 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGTCA ACCGAGTGGC CCACAAGGCG ACGAGGAAGC GACTGGCGGC CACGGTGGCC 
GGCGTCGGTG CGATTGCGCT GGTGCTGTCA GCCTGCGGCA GCAGTTCATC AAGCAGTGGC
ACGACCAGCG CGGCCGCCAC GTCCGCGGCC GCGAGCAGCG CGGCGGGTGG CGGCGGTTCC
GCCTCCGCCG GCCCGGTCTC GGGTGACAGC GGCTGGTGCG ACACGGTCAA GTCCAAGTAC
GGCGACCTGA CCGGCAAGAC GGTCGGCATC TACACCACCA TCACCGGCAC CGAGGCCGAC
GCCTACAAGA AGTCCTATCA GCTGTTCACC GAGTGCACCG GGGCCACCGT GGCCTACGAA
GACTCGAAGG ACTTCGAGGC CCAGGTGCTG GTGCGGATCG ACTCCGGCAA CGCGCCGGAC
ATCGCGATCT TCCCGCAGCC GGGTCTGCTG TCCCAGATCG TGAACGACAA GGGAGCGGTC
AAGCCGCTGC CGGAGGACAC CGAGGCTTTC GCCAAGCAGT ACTTCCCCGA CGACTGGATG
AACTACGGCG TCGTCAACGA CATCCCCTTC GGTCTGCCGA ACAACGCCGA CTTCAAGTCG
CTGGTCTGGT ACAACCCGAA GGTCTGGGCG GAGAAGGGCT GGACGGTCCC GACCACCTGG
GACGAGCTGA CCGCGCTGCA GGACAAGATC GCCGCCTCCG GCACCAAGCC CTGGTGTATG
GGTATCGGTT CCGGTGAGGC CACCGGTTGG TACATGACCG ACTGGCTGGA GGAGTACGTC
CTGCGGATGT CCGGGCCGGA CGTCTACGAC CAGTGGGTGA CCAACGAGGT CAAGTTCACC
GATCCGCAGA TCGCGGACGC GCTGGCCGGG GTGGGCGAGA TCGCCAAGAA CCCGCAGATG
GTGAACTCCG GTTTCGGTGA CGTGCAGTCG ATCGCCTCGA CCCAGTTCTC CGACCCGGCG
GCCAAGGTCC TCTCGGGCGA GTGCCCGATG TGGCGCTTCG CGGCCAACGG TGACGCGTTC
TTCCCGGCGG GCACCAAGTT CGGCGAGGAC GGCGACGTCA ACGCCTTCTA CTTCCCGCCG
ATGAACGACA AGTTCGGGCA GACCGTCCTT GGTGGTGGCA CCTTCTACGC CGCCTTCCAG
GATCGTCCCG AGGTCGACGC GTTCATGTAC TTCGCGGCCA GCCCCGAGTA CGCGAACGAG
CGGGCCAAGG CCGGCAGCTA CATCTCGGCC AATAAGGGTC TGGACGGCGC GAACGCGTCG
ACCCCGATCC TGCAGACCGC GCTGAAGAAG CTGCAGGACC CGGAGACGAC CTTCCGGTTC
GACGCCTCCG ACCTGATGCC CGCCCAGGTG GGTTCGGCGG CCGAGTGGAA GCAGTTCACC
GCCTGGATCA CCGGGCAGGA CGACGCGACC ACCCTGGCCA ACATTCAAGC GGCCTGGCCG
TCGAGCTGA
 
Protein sequence
MEVNRVAHKA TRKRLAATVA GVGAIALVLS ACGSSSSSSG TTSAAATSAA ASSAAGGGGS 
ASAGPVSGDS GWCDTVKSKY GDLTGKTVGI YTTITGTEAD AYKKSYQLFT ECTGATVAYE
DSKDFEAQVL VRIDSGNAPD IAIFPQPGLL SQIVNDKGAV KPLPEDTEAF AKQYFPDDWM
NYGVVNDIPF GLPNNADFKS LVWYNPKVWA EKGWTVPTTW DELTALQDKI AASGTKPWCM
GIGSGEATGW YMTDWLEEYV LRMSGPDVYD QWVTNEVKFT DPQIADALAG VGEIAKNPQM
VNSGFGDVQS IASTQFSDPA AKVLSGECPM WRFAANGDAF FPAGTKFGED GDVNAFYFPP
MNDKFGQTVL GGGTFYAAFQ DRPEVDAFMY FAASPEYANE RAKAGSYISA NKGLDGANAS
TPILQTALKK LQDPETTFRF DASDLMPAQV GSAAEWKQFT AWITGQDDAT TLANIQAAWP
SS