Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0433 |
Symbol | |
ID | 4597745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 462413 |
End bp | 463492 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639775047 |
Product | ABC transporter periplasmic-binding protein |
Protein accession | YP_921662 |
Protein GI | 119714697 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4143] ABC-type thiamine transport system, periplasmic component |
TIGRFAM ID | [TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.127437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACCCA TCCATGCCCT GGCGCCACTG GCACCGCTCG TCGCGCTCGC CCTGGCGGCG ACCGGCTGCT CGACCATCGG AAACGGCTCG TCCGGCGACG AGTCGACGGC CAGCCCGGGC GGCGAGCCCG GGCGGGTCGT GCTGGTCACC CACGAGTCGT TCTCGCTGCC CAAGCCGCTG ATCAAGCAGT TCGAGCAGCA GTCCGGCTAC GACCTGGTGG TCCGCGCGTC CGGCGACGCC GGGATCCTCA CCAACAAGCT GGTGCTGACC AAGGGCGACC CGCTCGGCGA CGTGGCGTTC GGCATCGACA ACACCTTCGC CTCGCGGGCG CTCGACGAGG GCGTGTTCGC GCCGTACGAC GCTCCGCTGC CGGACGGCGC CGACCAGTAC CGGTTGCCGG GTGACGACGA GCACGACCTC ACGCCGATCG ACAACGCCAG CGTGTGCGTG AACGTCGACG ACACCTGGTT CGCCGACCAC CATCTCGCCC CGCCGAAGAC CCTGGACGAC CTGGCCGCCC CGGAGTACCA GGGCCTGTTC GTCACCCCGG GCGCGGCCAC CAGCTCGCCC GGCCTGGCGT TCCTGCTGAG CACGATCGCG GCGTACGGCG AGGACGGCTG GCAGGACTAC TGGGCGCGGC TGATGGAGAA CGGGACCAAG CTGACGGCCG GCTGGTCCGA CGCCTACGAG GTCGACTTCA CCCAGGGCGG CGGCCAGGGT GACCGGCCGA TCGTGCTGTC CTACGACTCC TCGCCGGCGT TCACCGTCGC CGACGGGGAG TCGTCGACGA GCGCCCTGCT CGACACCTGC TTCCGCCAGG TGGAGTACGC CGGCGTGCTG GACGGTGCCG CGAACCCCGC CGGCGCCGAG CAGCTGGTCG ACTTCCTGCT CTCGCCGGAG GTGCAGGCCG CACTGCCCGA GAGCATGTAC GTGTTCCCGG TCGACTCCTC GGTCCAGCTG CCGAAGGAGT GGGCCCGGTT CGCGAAGCAG CCGAGCGACC CGTTCTCGGT GGACCCCGCG TCGATCGACG AGCACCGCGA CGAGTGGCTG CGCGAGTGGA CCGACGTGAC CTCTCGATGA
|
Protein sequence | MRPIHALAPL APLVALALAA TGCSTIGNGS SGDESTASPG GEPGRVVLVT HESFSLPKPL IKQFEQQSGY DLVVRASGDA GILTNKLVLT KGDPLGDVAF GIDNTFASRA LDEGVFAPYD APLPDGADQY RLPGDDEHDL TPIDNASVCV NVDDTWFADH HLAPPKTLDD LAAPEYQGLF VTPGAATSSP GLAFLLSTIA AYGEDGWQDY WARLMENGTK LTAGWSDAYE VDFTQGGGQG DRPIVLSYDS SPAFTVADGE SSTSALLDTC FRQVEYAGVL DGAANPAGAE QLVDFLLSPE VQAALPESMY VFPVDSSVQL PKEWARFAKQ PSDPFSVDPA SIDEHRDEWL REWTDVTSR
|
| |