Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3683 |
Symbol | |
ID | 4597600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 3905693 |
End bp | 3906991 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639778291 |
Product | extracellular solute-binding protein |
Protein accession | YP_924870 |
Protein GI | 119717905 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATC GAACTGTGCG CCGCACGGTG GCTGTCACAG CTGCCGTCGC GGCGCTGTCC CTGACGGCGG CATGCTCTGG GGGCAAGGGT GCGCCGGGGT CCGAGACCCC CAGCGGTGAC GGAGACGTGA GCTCGAACGT CGAAGGGACC GTCCGGGTCC TCATGGAGGG CGTGCCCGAC ACCGACATCG TCGAGGGGAT GATCGGCGAG TTCAACGAGC AGTACCCCAA CGTGAAGGTC CAGATCGAGA CCGCCGTCTA CGACCAGATG CGTGACAAGT ACGTGGCGTC CTTCACCGCA CCTGAGTCGT CCTACGATCT CGCGATCATC GACAACCCCT GGATGGGCGA CTTCGCGAAG GCCGGATTCC TGACCCCACT GGACTCGTAC ATCGAGTCGA CGTCGGGTTA CGACTACGAG GACTTCGCCG AACCTCTGCG CCAGATCAAC GAGGTCGACG GCAAGACCTA CGGTATTCCG TTCTACAACT ACGGCTTGGG GCTGATCTAC CGCACCGATC TCCTCTCGGC AGCGCCCTCC ACACTGGATG AACTCGTTGC TGCCGCCCAG GAGAACACCA CTGACACTCG GGCTGGCATT GCGATGCAGC CCCAGCGCGG CTACAAGGCA TTTGAGGAAT GGGCCAACTT CCTCTTTGCC GCTGGTGGCT CCATCTACGA CGACGAAGGA AATCTCAGCC TGGACACCCC CGAGGCGAAG GAGGCCCTCG AGACCTACAT CGAGCTCTAC GAGACCGCGG CTCCCGCCAG CAGCCTCAAC TGGGCCTTCG ACGAGGCGCT GCGATCCGTG AGCAGCGACA AGGCGGCCAT GATGGTCTCC TACAACTGGA TGCTCCCCAC CCTCAACGCT GACGACTCGC CCGCCGGCGA TCTCGCTGGC AAGTTCGCTT TGGCGACCAT GCCGGGCGGC AAGCAAGTCC TCGGTTCATG GAGTTGGGCC ATCCCCTCCA ACAGTGAGAC GGACGATGCC GACTGGGCGT TCATCTCTTG GCTGACCTCT GCCGACGGTG AGAAGCAGCG AGTGGAGGCC GGTGGCGCAC CCGTCCGGCA GAGCGTCCTG ACTGATCCGC AGGTGGCGGC CCAGGGCTTC GGTGCTGACT ACTACGCCAC TGTCGGTGAC ATCCTCGCCA ACTCGGCCCC CCTGTGCCAG GGCGCCAACT GCGACGAGAT GATCCAGGCG GTCGGAACCG AGCTCAGCGC CGCAGTCTCC GGACAGAAGA GCGTGGCAGA CGCCCTCTCT GCGGCCCAGG AGCAGGCGAC TCGGATCCAG TCCAGCTGA
|
Protein sequence | MKNRTVRRTV AVTAAVAALS LTAACSGGKG APGSETPSGD GDVSSNVEGT VRVLMEGVPD TDIVEGMIGE FNEQYPNVKV QIETAVYDQM RDKYVASFTA PESSYDLAII DNPWMGDFAK AGFLTPLDSY IESTSGYDYE DFAEPLRQIN EVDGKTYGIP FYNYGLGLIY RTDLLSAAPS TLDELVAAAQ ENTTDTRAGI AMQPQRGYKA FEEWANFLFA AGGSIYDDEG NLSLDTPEAK EALETYIELY ETAAPASSLN WAFDEALRSV SSDKAAMMVS YNWMLPTLNA DDSPAGDLAG KFALATMPGG KQVLGSWSWA IPSNSETDDA DWAFISWLTS ADGEKQRVEA GGAPVRQSVL TDPQVAAQGF GADYYATVGD ILANSAPLCQ GANCDEMIQA VGTELSAAVS GQKSVADALS AAQEQATRIQ SS
|
| |