Gene Namu_0514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0514 
Symbol 
ID8446097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp572390 
End bp573670 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID645039650 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003199922 
Protein GI258650766 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCGAA AGAAAGCAAT CACGGCCGGT TTGCTGGCCG GGCTGGTCGT GCTCAGCGCC 
TGCGGACGGA GCAGCGACAC CGCGGGTGCG GCCGGGACCT CCGCGCCCGC CGCCAGCATC
TCCGCCGGCC CGGCCACCGG CAAGCTGACC ATGTGGGCGC AGGGCGCCGA AGGCCAGGAT
CTGCCCGCCC TGCTCGACGA GTTCGAGGCC GCCAACCCCG GCGTCACCGT CGACGTCACC
GCGATCCCCT GGGACGCGGC GCACAACAAG TACCAGACGG CCATCGCCGG CGGTCAGACG
CCGGACATCG CACAGATGGG CACCACCTGG ATGGGCGACT TCGCCGACGC GTTCGATCCG
ACCCCCGCCG AGCTCACCGA CGCCGGGTTC TTCCCCGGTT CGGTCAACTC GACCGAGGTC
GACGGCACCG CAGTCGGTGT GCCCTGGTAC GTCGACACCC GGGTCGTCTT CTACCGCAAG
GACCTGGCCG AGAAGGCCGG GTACACCACC TTTCCGACCA ACTACGACGA CTTCAAGGCG
ATGGCCAAGG CCCTGCAGGA CAAGGCCGGC GCGCAGTGGG GCATCCAGCT CCTGGCCGGT
GGCACGGATT CCTTCCAGAG CACCCTGCCG TTCGGCTGGT CGGCCGGCGC CTCGCTGATG
GACAGTGGCA ATGACGCCTG GACCCTGGAT TCCCCGCAGT GGGTCGATGC GCTGACCTAC
TACCAGAGCT TCTTCACCGA GGGCATCGCC AACCCGGCGC CGAACATGGG GGCCGGCGCC
GCGGAATCGG CGTTCGTCGA CGGGTCCGCG CCGATGATGA TCTCCGGTCC CTACGAGATC
GGCAATCTGG AGAAGGCCGG CGGGGCCGAC TTCACCGACA AGTACGCCGT GGCCACGCTG
CCCAAGGACA AGTCCGCCAC CTCCTTCGTC GGCGGCTCCA ACCTGGTGGT CTTCAAGGAC
AGCCCCAACC GGGACGCCGC CTGGAAGCTC GTGCAGTGGC TCTCACAGCC CGAGGTCCAG
GTGAAGTGGT ACCAGGCCAC CGGTGACCTG CCCTCGGTGC AGAGCGCCTG GCAGGAGGGC
GTGCTCGCCG ACGACCCGAT GCTCTCGGTG TTCGGCGACC AGCTCAAGGA CACCAATTCC
CCGCCGGCGG TCCCGACCTG GACCCAGGTC AGCGCCGCCG CCGACAGCCA GGTCGAGCAG
ATCGTCAAGG CCGGCAAGGA TCCCGCGCAG GCCCTGCAGG AACTGCAGTC GCAGGCCGCC
TCGATCGGTA TCGGTCGCTG A
 
Protein sequence
MFRKKAITAG LLAGLVVLSA CGRSSDTAGA AGTSAPAASI SAGPATGKLT MWAQGAEGQD 
LPALLDEFEA ANPGVTVDVT AIPWDAAHNK YQTAIAGGQT PDIAQMGTTW MGDFADAFDP
TPAELTDAGF FPGSVNSTEV DGTAVGVPWY VDTRVVFYRK DLAEKAGYTT FPTNYDDFKA
MAKALQDKAG AQWGIQLLAG GTDSFQSTLP FGWSAGASLM DSGNDAWTLD SPQWVDALTY
YQSFFTEGIA NPAPNMGAGA AESAFVDGSA PMMISGPYEI GNLEKAGGAD FTDKYAVATL
PKDKSATSFV GGSNLVVFKD SPNRDAAWKL VQWLSQPEVQ VKWYQATGDL PSVQSAWQEG
VLADDPMLSV FGDQLKDTNS PPAVPTWTQV SAAADSQVEQ IVKAGKDPAQ ALQELQSQAA
SIGIGR