Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3914 |
Symbol | |
ID | 4598049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4119473 |
End bp | 4120846 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639778520 |
Product | extracellular solute-binding protein |
Protein accession | YP_925099 |
Protein GI | 119718134 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.313486 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATCAC GCAAGACGCG CCGGGGGATG GCAGTCGCTG CCGCTCTCGT GAGCGCAGGA CTCGTTCTCG CCGGCTGTGG CGGGAGCGAC GACGGTGGTA GTGGCGAGGC CAGCGGCAAG TCGCCGGGCG AGGGCAAGGC CGAGTGCGAG CAGCTCACGC AGTTCGGTGA CCTGACCGGC AAGGACGTCA CGGTCTACAC CTCGATCGTG GCGCCCGAGG ACAAGCCCCA CATCGACTCC TGGAAGGTCT TCGAGGACTG CACCGGCGCC GATGTGAAGT ACGAGGGCTC GAAGGAGTTC GAGACCCAGC TGCAGGTGCG CGTCCAGTCG GGCAACCCGC CGGACATCGC GTACGTCCCG CAGCCCGGCC TGCTCCAGAC CCTGGTCGGC ACCGGCAAGG TCGTCGAGGC CCCCGACACG GTCTCGGCCA ACGTCGACAA GTGGTTCGGT GAGGACTGGC GCTCGTACGG CAGCGTGGAC GGCAAGCTGT ACGCCGCCCC GCTGGGCGCG AACGTGAAGT CCTTCGTGTG GTACTCCCCC AAGATGTTCG CCGAGAACGG CTGGGAGATC CCGACGACGT GGGACGACAT GCTCGCCCTG TCCGACACGA TCACCGCGAC CGGCATCAAG CCGTGGTGCG CGGGCATCGA GTCCGGCGAG GCCACCGGCT GGCCGGCCAC CGACTGGCTC GAGGACGTGC TGCTCCGCTC GGTCGGTCCG GACGTCTACG ACCAGTGGGT CGCCCACGAG ATCCCCTTCA ACGACCCCGC GGTCGTCGAG AGCCTCGACA ACGTCGGCGC GATCCTGAAG AACGACAAGT ACGTCAACGG CGGCATCGGT GACGTCAGCT CGATCGCCAC GACCGCGTTC CAGGACGGCG GCCTGCCGAT CCTCGACGGC AAGTGCGCCC TGCACCGCCA GGCGAGCTTC TACGCCGCCA ACTGGCCCGA GGGCACCGAC GTCTCGGAGA ACGGCGACGT GTTCGCGTTC TACCTGCCGG CCATGGGCGA CGAGTTCGGC AACCCGGTCC TCGGCGGCGG CGAGTTCGTC GCAGCGTTCT CGGACGCGAT CGAGGTCCAG GCCTTCCAGA CCTACCTGTC CAGCGACCAG TGGGCCAACG AGAAGGCCAA GGCCACCCCG AACGGCGGCT GGGTCAGCGC GAACAAGGGC CTGGACATCG CCAACCTGGC GAGCCCGGTC GACAAGCTCT CCGGCGAGAT CCTGCAGGAC CCGGACGCGG TCTTCCGCTT CGACGGGTCC GACATGATGC CGGGTGAGGT CGGCGCTGGT TCGTTCTGGA AGGAAATGAC CAACTGGATC ACCGGCGAGA GCACCCAGGA CGCGCTCGAC AAGATCGAGG CCTCCTGGCC GTGA
|
Protein sequence | MRSRKTRRGM AVAAALVSAG LVLAGCGGSD DGGSGEASGK SPGEGKAECE QLTQFGDLTG KDVTVYTSIV APEDKPHIDS WKVFEDCTGA DVKYEGSKEF ETQLQVRVQS GNPPDIAYVP QPGLLQTLVG TGKVVEAPDT VSANVDKWFG EDWRSYGSVD GKLYAAPLGA NVKSFVWYSP KMFAENGWEI PTTWDDMLAL SDTITATGIK PWCAGIESGE ATGWPATDWL EDVLLRSVGP DVYDQWVAHE IPFNDPAVVE SLDNVGAILK NDKYVNGGIG DVSSIATTAF QDGGLPILDG KCALHRQASF YAANWPEGTD VSENGDVFAF YLPAMGDEFG NPVLGGGEFV AAFSDAIEVQ AFQTYLSSDQ WANEKAKATP NGGWVSANKG LDIANLASPV DKLSGEILQD PDAVFRFDGS DMMPGEVGAG SFWKEMTNWI TGESTQDALD KIEASWP
|
| |