Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1590 |
Symbol | |
ID | 8447188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 1751500 |
End bp | 1752348 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645040717 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_003200974 |
Protein GI | 258651818 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.000563013 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.528565 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCATT CCCGTCGTGG CCTGGTCGGG GCCGTGCTGG CCCTGACCCT GGTCGGGTTG GCCGCCTGCA GTTCGGGCTC CGGCTCGGTC GGCACGCCCA GCACCGTCTC CGGCGCCCAG GAGGGCGGCA AGCTCACCAT CGGCATCTCG TTCGACCAGC CCGGGCTGGG GTTCAAGGAC GGTGAGACCT ACCGGGGCTT CGACGTCGAC ACGGCGACCT ACGTGGCCGC CGCGCTCGGC GTGCCGAGCC AGAACATCAC CTGGGTGCAG GCCGATCCGA GCGAGCGGGA GAAGCTGCTG GAAAGCGGTG ACGTGGATCT GGTGTTCTCC AGCTACTCGA TCACCGATCA GCGCAAGCAG GTGGTCGACT TCGCCGGCCC GTACTTCGTC GCCCATCAGG ACCTGCTGGT CCGGCGCAAC GAAACCGACA TCACCGGGCC GGACACGTTG GACGGTCGGG TGCTGTGCTC GGTGACCGGG ACGACGTCCT CGCAGTACAT CAAGGACAAC TACCTGGGCC GGATCACCCT GACCGAGTAC CCGCGGTTCT CCGACTGCGT GGCCGCCCTG GCCAACAGCG AGGTGGACGC GGTGAGCACC GACGACGTCA TCCTGGCCGG GTTCGCCGCG CAGGATCAGT ACAAGGGCAA GCTCAAGCTG GTGGGCAACG GGTTCACCGA CGAGCGGTAC GGCGTCGGCA TCCCCAAGGG TGACGACGCG CGCGTCGCCC AGGTCAACCA GGCGCTGGCC CAGTACATCG CGGACGGCTC CTGGCGGGCG TCGCTGGACG CCACCGTGGG CCCGTCCGGG TACGCGATCC CGGACCCGCC GACGCCCGGG TCCGCCTGA
|
Protein sequence | MRHSRRGLVG AVLALTLVGL AACSSGSGSV GTPSTVSGAQ EGGKLTIGIS FDQPGLGFKD GETYRGFDVD TATYVAAALG VPSQNITWVQ ADPSEREKLL ESGDVDLVFS SYSITDQRKQ VVDFAGPYFV AHQDLLVRRN ETDITGPDTL DGRVLCSVTG TTSSQYIKDN YLGRITLTEY PRFSDCVAAL ANSEVDAVST DDVILAGFAA QDQYKGKLKL VGNGFTDERY GVGIPKGDDA RVAQVNQALA QYIADGSWRA SLDATVGPSG YAIPDPPTPG SA
|
| |