Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_0514 |
Symbol | |
ID | 8446097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 572390 |
End bp | 573670 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645039650 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003199922 |
Protein GI | 258650766 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCGAA AGAAAGCAAT CACGGCCGGT TTGCTGGCCG GGCTGGTCGT GCTCAGCGCC TGCGGACGGA GCAGCGACAC CGCGGGTGCG GCCGGGACCT CCGCGCCCGC CGCCAGCATC TCCGCCGGCC CGGCCACCGG CAAGCTGACC ATGTGGGCGC AGGGCGCCGA AGGCCAGGAT CTGCCCGCCC TGCTCGACGA GTTCGAGGCC GCCAACCCCG GCGTCACCGT CGACGTCACC GCGATCCCCT GGGACGCGGC GCACAACAAG TACCAGACGG CCATCGCCGG CGGTCAGACG CCGGACATCG CACAGATGGG CACCACCTGG ATGGGCGACT TCGCCGACGC GTTCGATCCG ACCCCCGCCG AGCTCACCGA CGCCGGGTTC TTCCCCGGTT CGGTCAACTC GACCGAGGTC GACGGCACCG CAGTCGGTGT GCCCTGGTAC GTCGACACCC GGGTCGTCTT CTACCGCAAG GACCTGGCCG AGAAGGCCGG GTACACCACC TTTCCGACCA ACTACGACGA CTTCAAGGCG ATGGCCAAGG CCCTGCAGGA CAAGGCCGGC GCGCAGTGGG GCATCCAGCT CCTGGCCGGT GGCACGGATT CCTTCCAGAG CACCCTGCCG TTCGGCTGGT CGGCCGGCGC CTCGCTGATG GACAGTGGCA ATGACGCCTG GACCCTGGAT TCCCCGCAGT GGGTCGATGC GCTGACCTAC TACCAGAGCT TCTTCACCGA GGGCATCGCC AACCCGGCGC CGAACATGGG GGCCGGCGCC GCGGAATCGG CGTTCGTCGA CGGGTCCGCG CCGATGATGA TCTCCGGTCC CTACGAGATC GGCAATCTGG AGAAGGCCGG CGGGGCCGAC TTCACCGACA AGTACGCCGT GGCCACGCTG CCCAAGGACA AGTCCGCCAC CTCCTTCGTC GGCGGCTCCA ACCTGGTGGT CTTCAAGGAC AGCCCCAACC GGGACGCCGC CTGGAAGCTC GTGCAGTGGC TCTCACAGCC CGAGGTCCAG GTGAAGTGGT ACCAGGCCAC CGGTGACCTG CCCTCGGTGC AGAGCGCCTG GCAGGAGGGC GTGCTCGCCG ACGACCCGAT GCTCTCGGTG TTCGGCGACC AGCTCAAGGA CACCAATTCC CCGCCGGCGG TCCCGACCTG GACCCAGGTC AGCGCCGCCG CCGACAGCCA GGTCGAGCAG ATCGTCAAGG CCGGCAAGGA TCCCGCGCAG GCCCTGCAGG AACTGCAGTC GCAGGCCGCC TCGATCGGTA TCGGTCGCTG A
|
Protein sequence | MFRKKAITAG LLAGLVVLSA CGRSSDTAGA AGTSAPAASI SAGPATGKLT MWAQGAEGQD LPALLDEFEA ANPGVTVDVT AIPWDAAHNK YQTAIAGGQT PDIAQMGTTW MGDFADAFDP TPAELTDAGF FPGSVNSTEV DGTAVGVPWY VDTRVVFYRK DLAEKAGYTT FPTNYDDFKA MAKALQDKAG AQWGIQLLAG GTDSFQSTLP FGWSAGASLM DSGNDAWTLD SPQWVDALTY YQSFFTEGIA NPAPNMGAGA AESAFVDGSA PMMISGPYEI GNLEKAGGAD FTDKYAVATL PKDKSATSFV GGSNLVVFKD SPNRDAAWKL VQWLSQPEVQ VKWYQATGDL PSVQSAWQEG VLADDPMLSV FGDQLKDTNS PPAVPTWTQV SAAADSQVEQ IVKAGKDPAQ ALQELQSQAA SIGIGR
|
| |