Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2022 |
Symbol | |
ID | 4598644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 2167558 |
End bp | 2168559 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639776626 |
Product | extracellular solute-binding protein |
Protein accession | YP_923219 |
Protein GI | 119716254 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0434097 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGCG TACCGGCCCT CCTCGCCCTG TCCCTCGCAG CCACGCTGCT CGCCACCGGC TGCGGCAGCG ACAGCGACGC CCTGGTCATC TACAACGCCC AGCACGAGGA GCTGATGACC GACGTCGCCA AGGCCTTCAC CGACGAGACC GGCATCGACG TCGAGCTGCG CAACGGCAAG GACCTGGAGA TGTCCGCCCA GCTGGTCGCC GAGGGCAAGG CCTCGCCCGC CGACGTGTTC CTGACCGAGA ACTCCCCCGC CATGGCGCAG GTCGAGAACG CCGGCCTGTT CACCGAGCTC CCGCAGGACG CGGTCGCCCC GATCCCGGCG ATGTACCGGC CACGCAGCGG GCTGTGGACC GGCTTCGTGG CCCGCTCGAC CGTGCTCGTC TACAACACCG ACCAGGTCTC CGCGGACGAG CTGCCCGACT CGATCCTCGA CCTCGCCGAC CCCGAGTGGA AGGGCCGGAT CTCCTTCTCC CCCACCGGCG CGGACTTCCA GGCGATCGTC GCCGCGGTCC TCGACCTCGA GGGCGAGCAG AAGACCCGCG CCTGGCTGGA GGGCATCAAG GCCAACGGCA CCGTGTACGA CGGCAACAAC GTCGTCCTCG AGTCGGTCAA CTCCGGCGAG TCCGAGGTCG GGATCATCTA CCACTACTAC TGGTACCGCG ACCAGGCCGA GTCGGGCGAC GTCTCCGACC ACAGCGCCCT GTACTTCTTC GGCCACCAGG ACCCCGGCGC GTTCGTGAGC GTCTCCGGCG CCGGCATCCT CGCCTCCAGC GACCACCAGG CGGACGCGGA GAAGTTCGTG TCCTACCTGA CCAGCACCGC CGGCCAGCAG GTGCTCGCCG ACAGCTACGC GCTGGAGTAC CCGCTCAACC CCGACGTCCA GCTCGACCCA CCGGTCAAGC CGTTCGCCGA GCTCGATCCG CCCCAGGTCA ACGTCTCGGA CCTCGACGGC AAGGCCGTGG TGGACCTGAT GACCGAGGTC GGGTTCCTCT GA
|
Protein sequence | MKRVPALLAL SLAATLLATG CGSDSDALVI YNAQHEELMT DVAKAFTDET GIDVELRNGK DLEMSAQLVA EGKASPADVF LTENSPAMAQ VENAGLFTEL PQDAVAPIPA MYRPRSGLWT GFVARSTVLV YNTDQVSADE LPDSILDLAD PEWKGRISFS PTGADFQAIV AAVLDLEGEQ KTRAWLEGIK ANGTVYDGNN VVLESVNSGE SEVGIIYHYY WYRDQAESGD VSDHSALYFF GHQDPGAFVS VSGAGILASS DHQADAEKFV SYLTSTAGQQ VLADSYALEY PLNPDVQLDP PVKPFAELDP PQVNVSDLDG KAVVDLMTEV GFL
|
| |