Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1937 |
Symbol | |
ID | 4599842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 2066833 |
End bp | 2067768 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639776535 |
Product | periplasmic solute binding protein |
Protein accession | YP_923134 |
Protein GI | 119716169 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.456944 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCTCA TGAAGAGACT TGCCGCCACC ATCCCCGTCA CGCTCGCGCT CACCGGGCTC ACTCTCACCG GGTGCGCGGC GTTCACCGAC GACGGCGGCG GCCCCGCCTC CGCTGCCGGC AAGGGGGCCG GCGGGGTCCG GGTCGCGGCG GCGTTCTACC CGCTCGCGTA CGTCGCCGAC CGGGTCGGCG GTGACCACGT CGACGTCACG AACCTCACCG CACCGGGCGG CGAGCCGCAC GACCTCGAGC CGTCGGTCGC GGTGACCGCC GAGGTCGCCG GGGCCGGGCT CGTCGTCTAC GAGCGCGGCT TCCAGCCGGC GGTCGACGCC GCGGTGGACG AGAACGCGGC CGGCGACGTC CTGGACGCGG CCGGCGTCGT CGACCTGGTG CCGTTCCGCG AGCACGGAGT CGACTCCGAC GAGACCGACC CGCACTTCTG GCTCGACCCG CTGCTGCTCG CCGACGTCGC CGACGCCGTC GCCGACCGGC TCGAACAGGC CGACCCGGAC CACGCCGCGG ACTACCGCGC GAACGCCGCC GACCTGCGCG GCGACCTCGA GGGACTCGAC CAGGAGTACG CCGACGGCCT CGCGAGCTGC ACCCGCACCA CCGTCGTCGT CAGCCACGAC GCGTTCGGCT ACCTGCAGCG CTACGGCGTC GAGATGGAGG CGATCCTCGG CCTCTCCCCC GAGGCCGAGC CGACCCCGGC CGACCTGGCC CGGCTGCAGG CCCTGATCCG CGCGGACGGG GTCACCACCG TCTTCTCCGA GTCGATCGTC AGCGCCAAGG CGGCCGAGGC CCTCGCCCGG GAGACCGGCG CGGAGAGCGA CGTGCTCGAC CCGATCGAGG GGCTGACCGA CCGCACCGCC GACGAGGACT ACCTTTCCCT CATGCGCGCC AACCTCGCCG CCCTCGAGAA GGCGAACGAC TGTTGA
|
Protein sequence | MILMKRLAAT IPVTLALTGL TLTGCAAFTD DGGGPASAAG KGAGGVRVAA AFYPLAYVAD RVGGDHVDVT NLTAPGGEPH DLEPSVAVTA EVAGAGLVVY ERGFQPAVDA AVDENAAGDV LDAAGVVDLV PFREHGVDSD ETDPHFWLDP LLLADVADAV ADRLEQADPD HAADYRANAA DLRGDLEGLD QEYADGLASC TRTTVVVSHD AFGYLQRYGV EMEAILGLSP EAEPTPADLA RLQALIRADG VTTVFSESIV SAKAAEALAR ETGAESDVLD PIEGLTDRTA DEDYLSLMRA NLAALEKAND C
|
| |