Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1420 |
Symbol | |
ID | 4597327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 1504169 |
End bp | 1505443 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639776018 |
Product | extracellular solute-binding protein |
Protein accession | YP_922621 |
Protein GI | 119715656 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.979516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCGTT TCCCGTTTGG TCGCCGTGGG GGGCGGACGC GCCTGATCGC GGCGGCTGCG CTGGCGGTCG CCAGCGTGGC CACCGTCGCA GGATGCGGCG CCGGCTCCGG AGCCAGTGCC GGTGACGAGG TCAAGCTGCG CTTCCTGTTC CCGGAGTACA GCGCGAAGAC CGCCCCGCTG ATGCAGGCGA TGGTCGACGA CTTCAACCAG GCCAACGACG GCAAGATCGA GGTGAGCCTC GAGATCGCCC CCTGGGACAA GATGCACGAC AAGCTCGCGG TCTCCATGGG TTCGGGGCAG GCGCCCGACG TGTTCGGCTA TGCCACCCGC TGGATCTCCG AGTTCGCGGG CCTCGACCAG CTCGCACCCC TCGACGAGCA CCTCAGCGAG GACTTCAAGA GCACGTTCAA CCAGAAGGTG CTCGAGGCCG GCACGTACGA CGGGAAGACC TACGGCCTTC CGGCCGCCGT CTCGGCGCGG CTGCTGTTCT ACCGCGCGGA CGTGTTCGAG GAGGCCGGCC TGCAGGCGCC CCAGACCTGG GACGACCTGA TGGAGGCCGC GACGACCACC GGGCAGCCGC CGGAGCGCTA CGGCCTCGGC GTACCCGCCA GCGGGATCGA GGTCGACACG TTCTTCAACT ACTTCTTGTA CAACAACGGG GGGGACATCC TCGACGAGAA CGGCAAGTCG ATGCTCAGTG CCCCGGAGAG CGTCGAGGCC CTGCAGTACC TCACCGACCT GGTCAAGGCC GGGGGCAGTG AGCCGAAGCC CACCGGCTTC ACCCGCGAGC AGGTCATCGA GAACTTCAAG GCCGGCCAGC TGTCCATGTA TCCCACCGGT CCGTGGCTCA ACGCCATGAT CGAGGCCGAC AACCCGGACC TCGAGTACTC CGCGGTGCCG TTCCCGACCA ACGACGGCAA GCCACAGCAG ACCGTCTCCG TGACCGACTC GCTCGGTCTG TCGGCGAACA CCGAGCACCC CGACGAGGCG TGGAAGTTCG TCGAGTTCAT GTACCAGACG AAGTACCGCC AGGCCTTCGA CGAGGGCGAG GGCATGCTCC CGGAGCTGAT CGCCGTGAGC CAGTCGGACT ACTTCCAGAG CCCCGAGTAC AAGCCGTTCG TCGACGCCCT CGACACTGCC AAGTTCCAGC CGCAGCACCC CAAGTTCGAG CAGATCCAGC AGATCGAGAC CGTCGCCGTG CAGAAGGCGC TCAGCGGCCA GGCCACGCCC CAGGAGGCGC TGGACGAGGC CACCGAGCAG ATCGACAAGC TCTGA
|
Protein sequence | MMRFPFGRRG GRTRLIAAAA LAVASVATVA GCGAGSGASA GDEVKLRFLF PEYSAKTAPL MQAMVDDFNQ ANDGKIEVSL EIAPWDKMHD KLAVSMGSGQ APDVFGYATR WISEFAGLDQ LAPLDEHLSE DFKSTFNQKV LEAGTYDGKT YGLPAAVSAR LLFYRADVFE EAGLQAPQTW DDLMEAATTT GQPPERYGLG VPASGIEVDT FFNYFLYNNG GDILDENGKS MLSAPESVEA LQYLTDLVKA GGSEPKPTGF TREQVIENFK AGQLSMYPTG PWLNAMIEAD NPDLEYSAVP FPTNDGKPQQ TVSVTDSLGL SANTEHPDEA WKFVEFMYQT KYRQAFDEGE GMLPELIAVS QSDYFQSPEY KPFVDALDTA KFQPQHPKFE QIQQIETVAV QKALSGQATP QEALDEATEQ IDKL
|
| |