Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0041 |
Symbol | |
ID | 4600092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 46643 |
End bp | 48025 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639774656 |
Product | extracellular solute-binding protein |
Protein accession | YP_921278 |
Protein GI | 119714313 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACTAC GGAAGCAGCA CACCAAGGTC TCGGGAGTCA CCAGCGGCGC CTGGTCGTCC GCCGCCGCCT CGCCGTTCTC GCGGCGCGGA CTGCTCAAGG CCGGCGGGCT CGCTGCCGGT GCCGTCGCCG CCGCCCCGCT GCTCTCGGCC TGCGGCCGCG GGTCCGGCAG CGGCTCCTCG GGCAGCAGCT ACAACATGTG GGTCATGTCG GACACCGTGC CGATCATGAA GCACTTCGTG GCCAAGTACC GCGAGAGCGA GGACAGCAAG TTCAACGCGA ACATCACCGA GATCCCCTCG GGCATGTCGC TGCGCGCCAA GATCATCTCC GCCTCGGCTG CCGGCCAGCT GCCCGAGCTG CTCGACGTCT CGATGAACTA CAGCTCGGAC TTCGCCACCT ACAACCTGTT CGAGCCGCTC GACTCGCTGA TCTCGTCCGA CCTGGCGTCG AAGTACTCTC TCTACGACCG CGTCTGGAAG TGGGCCGACA CCGCGAGCAT CCCGGGCTAC GAGGGCGACC AGCAGGTCTT CGGGATCCCC TACGGTGTCT CCGTGTTCGT CCCGACGTAC CGGGCCGACC TCTTCCGGGA GGCCGGTGTG GAGTTCCCGA CGACCTGGGA GGAGCTGGTC ACCGTCGGAC AGGCGCTGAC CCAGGCGCCC CAGCGCTACG CGCTGTCGGT GCCGACCTCG GGCGACCTGA TGGACGAGTT CCACCCCTTC CTGATGCAGG CCGGCGCCCA GTACGTCAAC GACGACCTCA CCGAAGCCTT CCCGAACCGG GAGGCCGCCT ACGACGGTTT CGAGTTCTAT CGCGACCTCT CGGCGCGCTA CAAGATCGCG CCGAAGGAGG CGCCGGACCG CTTCGCCGGC GACCCCGTGC AGCGCCTGAC CTCCGGCCAG GTGGCGGTGA CGACGCTCTC GGTGCTCAGC GTCAACGCGA TGCGCAACGA GGCAACCGAC CTCGAGTTCG GCCCCGACAA GGACTGGTAC ATCTCGAAGT TCTGGTCGGG CTCGGGCGGC CCGGGCGGCT ACTTCAACGC CAACTGCATG CACATCCGCA AGGGCGTCAA GAACGTCGAC GGTGCCATCG GCTTCATGGA GTGGATGCTC GAGCCGGAGC AGCAGGCGGA GATGTACAAG TCGTTCAACC GCCCCCCGAT GAACACGACG GTGTGGGACG GCGAGCTGGG CGATGACCCG GAGTTCCAGA TCTACCGCGA GTCGATCGAG CTCAGCGAGC GCCAGGGCGG GTTCCGCGGG TGGAAGCTGG CCGAGTTCAC CATCGACCGA GGAGTCGAGC GTGTCGTCAT CGACGGCGAG GACGTCAAGT CCGTCGTCGA CTCGACCGCG ACCGACATGA TCCAAGCCCT GCAGAACGCC TGA
|
Protein sequence | MALRKQHTKV SGVTSGAWSS AAASPFSRRG LLKAGGLAAG AVAAAPLLSA CGRGSGSGSS GSSYNMWVMS DTVPIMKHFV AKYRESEDSK FNANITEIPS GMSLRAKIIS ASAAGQLPEL LDVSMNYSSD FATYNLFEPL DSLISSDLAS KYSLYDRVWK WADTASIPGY EGDQQVFGIP YGVSVFVPTY RADLFREAGV EFPTTWEELV TVGQALTQAP QRYALSVPTS GDLMDEFHPF LMQAGAQYVN DDLTEAFPNR EAAYDGFEFY RDLSARYKIA PKEAPDRFAG DPVQRLTSGQ VAVTTLSVLS VNAMRNEATD LEFGPDKDWY ISKFWSGSGG PGGYFNANCM HIRKGVKNVD GAIGFMEWML EPEQQAEMYK SFNRPPMNTT VWDGELGDDP EFQIYRESIE LSERQGGFRG WKLAEFTIDR GVERVVIDGE DVKSVVDSTA TDMIQALQNA
|
| |