Gene Namu_3847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3847 
Symbol 
ID8449466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4218328 
End bp4219593 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content68% 
IMG OID645042896 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003203132 
Protein GI258653976 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00585532 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.205792 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGTTGG CGTTCTCCCT GGCCGCCTGC GGCAAGGGAG CGAGCAGCAG CAGCGACGCG 
CAGACCGACG CCAGCGGCAC GACCACGTTG AAGATGTGGA CGCACAACGC GGGCAACGAC
ACGGAACTCG CCGCGATCAA CCAGGTCGTC GCGGACTACA ACGCCAGCCA GAGCAACTAC
AAGGTCGAGG TCCAGGCGTT CCCGCAGGAC TCCTACAACA CCTCGGTGAC GGCGGCGGCC
GCCTCCAAGA GCCTGCCCTG CATCCTGGAC GTGGACGGCC CGAACGTGCC GAACTGGGCC
TGGGCCGGGT ACCTGGCCCC GCTGGACGGC CTGGACGAGC GGATCGCCCA GTTCCTGCCC
AGCGTGGTCG GCAGCTTCGA CGGCAAGAAT TACGCCGTCG GCTACTACGA CGTGGCGCTG
ATCATGCAGG CCCGGACGTC GGCCTTGCAG GAGAACGGGA TCCGCATCCC GACCATCGAC
CAGCCCTGGA CCGAGGACGA GTTCGCGGCC GCGCTGGCCG CGATCAAGGC CAGCGGCAAG
TACGAGAACA CGCTGGACCT GCAGACCGGC AACACCGGTG AGTGGTGGCC GTACGCGTAC
TCGCCGATGC TGCAGAGCTT CGGGGGCGAC CTGATCAACC GGGACGGCTA CACCAGCGCC
GACGGGGTGC TCAACGGTCC GGCCGCGGTG CAGTGGGCCA CCTGGTTCCG CTCGCTGGCC
ACCGACGGCT ACATGCCGCT CAAGTCGGGC GCCGATCCGG CCCAGGACTT CCTCAACGGC
AAGACCGCGA TCCTGTACAA CGGCTCGTGG GGCGCCGAAC CCGCGCGGGC GTCGGCCATC
GCCGACGACG TCTCCTTCCT GCCGGCGGTC AATCTCGGCC AGGGAGCCAA GATCGGCGGC
GGATCCTGGC AGTGGGCGGT CAGTTCCGGC TGCCCGTCGA CCGAGGGCGC GCTGGACTAC
ATGAAGTTCG CGCTGCAGGA CAAGTACGTC GCCGCGGTGT CCAAGGCGAC CGGGACGATC
CCGGCCACCG ACGCCGCCGC GGCCATGGTG CCCGGCTACG AACCGGGTGG GGACAACGAC
ATCTTCCGTC AGTACTCCAA GGAGTTCGCC CTGATCCGGC CGGCGACCCC GGGCTACCCG
TTCATCGCGA CCACCTTCAC CAAGACCGCC CAGGACATCC TCAACGGCGC CGACCCGCAG
GAAGCGCTGA ACCAGGCGGT CGCCGACATC GACGCGAACC AGCAGTCCAA CAACAACTTC
CAGTAG
 
Protein sequence
MALAFSLAAC GKGASSSSDA QTDASGTTTL KMWTHNAGND TELAAINQVV ADYNASQSNY 
KVEVQAFPQD SYNTSVTAAA ASKSLPCILD VDGPNVPNWA WAGYLAPLDG LDERIAQFLP
SVVGSFDGKN YAVGYYDVAL IMQARTSALQ ENGIRIPTID QPWTEDEFAA ALAAIKASGK
YENTLDLQTG NTGEWWPYAY SPMLQSFGGD LINRDGYTSA DGVLNGPAAV QWATWFRSLA
TDGYMPLKSG ADPAQDFLNG KTAILYNGSW GAEPARASAI ADDVSFLPAV NLGQGAKIGG
GSWQWAVSSG CPSTEGALDY MKFALQDKYV AAVSKATGTI PATDAAAAMV PGYEPGGDND
IFRQYSKEFA LIRPATPGYP FIATTFTKTA QDILNGADPQ EALNQAVADI DANQQSNNNF
Q