Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0869 |
Symbol | |
ID | 4599876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 907548 |
End bp | 908882 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639775470 |
Product | extracellular solute-binding protein |
Protein accession | YP_922079 |
Protein GI | 119715114 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.509231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGCA CATCACCCGT CTCGGGCTCG CTCGCCGGGA TCACCGCCCC CCGGCCGACG CGTCGTGGCC TGCTCGTGGG CGGAGGTGCG CTCGGCCTGT CCGGCCTGCT GGTCGCGTGC GGGGTCGGCG GTGGCGGCGG CGAGAAGGGC GGTGGTGGCA GCGGCAGCGG CAGCATCCGC GCCCTGTTCA TGCAGCAGGC GGGCTACTCC GAGGACAACA TCAAGGAGAT GACGGCGGCC TTCATGAAGG CCAACACCGA CATCAAGGTC ACCGCCGACT TCGTCTCCTA CGAGGCGCTG CACGACAAGA TCGTGGCGGC CGCGCCGGCG GGCACCTATG ACGTCGTGCT GATCGACGTG ATCTGGCCGG CCGAGTTCGG CACCAAGAAC ATCGTCGCCG ACGTCACCGA CCGGTGGCCC GACGAGTGGA AGCAGCAGAT GCTCGGCGGC GCGGTCGCGA CGCCGCAGTA CGACGGCAAG TTCTACGGGG TCCCGTGGAT CCTGGACACC AAGTATCTCT TCTACAACAC CGCCCAGCTC GAGAAGGCGA AGGTCGACGC CGGCGAGCTC GACACCTGGG ACGGCGTCCT CAGCGCGGCC CGCGCGCTCA AGCAGAGCGG TGTCCAGTAC CCGCTGATCT GGTCCTGGCA GCAGGCGGAG GCCTTGATCT GCGACTACAC CCAGCTCCTC GGTGCCTTCG GCGGAACCTT CCTCGACGAC GCGGGCCAGC CCGCGTTCAA CCAGGGAGGC GGCGTCGCTG CGCTGGAGTT CATGCGGCAG AGCATCGTCG ACGGGCTCAC CAACCCCGCC TCGACGCAGT CGCTCGAGGA GGACGTGCGG CGCGTGTTCT CCTCCGGTCA GGCCAGCATC GCCCTGAACT GGACCTACAT GTACGGCCTC GCCAACGACC CCAAGGAGAG CCAGATCCCC GGCGACGTCG CGGTGCTGCA GACCCCGAGC GGCCCGGTCG GCCGCCCCGG CGTGAACGGC AGCATGGCGC TCTCCCTCTC CGCGACCAGT GAGAACCAGG ATGCCGGCTG GAAGTACATC GAGTACCTCA CCAGCCAGCC GGTCCAGGAC AAGTACGCCC TCAGCTCGCT GCCCGTGTGG TCGTCGTCGT ACGACGACCC CAAGGTCGTC GACACGAACC CCGCCGTCGT GCCGCAGGCC AAGAAGCAGC TCGGCGACAT GATCCTGCGG CCCCAGGTCG CCAGCTACAA CGCGATGTCC CAGGTGCTCC AGGCCGAGAT CCAGAAGGCC CTGCTCGGTG ACAAGGAGCC GCAGCAGGCG CTGGACGACG CAGCCTCCCA GGCGGCCGAC CTGCTGGAGT CCTGA
|
Protein sequence | MKRTSPVSGS LAGITAPRPT RRGLLVGGGA LGLSGLLVAC GVGGGGGEKG GGGSGSGSIR ALFMQQAGYS EDNIKEMTAA FMKANTDIKV TADFVSYEAL HDKIVAAAPA GTYDVVLIDV IWPAEFGTKN IVADVTDRWP DEWKQQMLGG AVATPQYDGK FYGVPWILDT KYLFYNTAQL EKAKVDAGEL DTWDGVLSAA RALKQSGVQY PLIWSWQQAE ALICDYTQLL GAFGGTFLDD AGQPAFNQGG GVAALEFMRQ SIVDGLTNPA STQSLEEDVR RVFSSGQASI ALNWTYMYGL ANDPKESQIP GDVAVLQTPS GPVGRPGVNG SMALSLSATS ENQDAGWKYI EYLTSQPVQD KYALSSLPVW SSSYDDPKVV DTNPAVVPQA KKQLGDMILR PQVASYNAMS QVLQAEIQKA LLGDKEPQQA LDDAASQAAD LLES
|
| |