Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3229 |
Symbol | |
ID | 4599167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 3430271 |
End bp | 3431497 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639777835 |
Product | extracellular solute-binding protein |
Protein accession | YP_924418 |
Protein GI | 119717453 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.712082 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGACT CAGCGAGGTT GGCGGCCCTG GCGCGGGCCG GTGGCGGCGT TCCCCCGTGG GCGATGTCCG GGATGGGGCG GCGGTCCTTC CTCCGCGGAG CCTCGCTGAC GGCGCTCGCG GTGGGGGCGC CGGGGCTGCT CTCGGCCTGC GGCACCGAGG GCGCGAAGGT CGACGCCGGC TCCTGCACGA GCACCGACCT CAGCGCTGAT GAGAAGACGA TCACGTTCGC CAACTGGATC GGTTACATCG ACCCCGTCAA GAAGCCGGAC TCCACGCTCT CGAAGTTCCA GCAGCAGACC GGCATCACCG TCGACTACAA GAACGGCGAC GTCAACGACA ACGAGCAGTT CTTCGCCAAG GTGTCGCCCC AGCTGCAGGA CTGCCGGCCC ACGGACCGTG ATGTCTTCGT CGTGACCGAC TACATGGCCG CGCGGATGAT CGAGCTCGGC TGGATCCAGA AGCTCGACCA CGCCAACCTG CCCAACGTCG ACGCGAACCT GATCGACTCC CTGAAGTCGC CCAGCTGGGA CCCGAACCGC GACTACAGCG TGCCGTGGCA GAGCGGCATG ACCGGCATCT GCTACAACGC CGAGCTCACC GACCCCGTCT CGAGCTTCGA GGAGCTGCTC ACCCGCCCGG ACCTCAAGGG CAAGATCGAC CTGCTCAGCG AGATGCGGGA CACGATGCTG TTCATGCTGC TCCTGAACGG CAGCAACCCG GCGGACTTCA CCGACGACGA GTTCTCCGCC GCGATCGACA GTCTCCAGGG CTACGTCGAC AGCGGCCAGG TGCGCAGGTT CACCGGCAAC GACTACGTCG ACGACATGAA GTCGGGCGAC ATCGTCGCCT GCGAGGCGTG GAGCGGCGAC GTCATCAACC TGCTCGGCGG CGGGAAGTTC AAGTACGTCC CGCCCAGTGA GGGCTTCGCG ATCTGGACCG ACAACATGCT GGTGCCGAAC AAGGCGGCGC ACAAGTCGAA CGTCGAGGAG CTGATGAACT ACTACTACGA CCCGGTCAAC GCCGCGAAGC TCGCTGCCTG GAACTACTAC CTCTGCCCGG TCAAGGGTGC CCAGCAGGAG ATCGCGCAGT TCGACAAGTC CGCAGCCAAG AGCGACTTCA TCTTCCCCGA TGCCAAGACC ATGGAGTCGG GCCACCAGTT CATGCCGCTG AGTGACACCC AGGAGCGCGA CTACCAGCGC CGGTTCAACG AGGTGATGGG TGGCTGA
|
Protein sequence | MSDSARLAAL ARAGGGVPPW AMSGMGRRSF LRGASLTALA VGAPGLLSAC GTEGAKVDAG SCTSTDLSAD EKTITFANWI GYIDPVKKPD STLSKFQQQT GITVDYKNGD VNDNEQFFAK VSPQLQDCRP TDRDVFVVTD YMAARMIELG WIQKLDHANL PNVDANLIDS LKSPSWDPNR DYSVPWQSGM TGICYNAELT DPVSSFEELL TRPDLKGKID LLSEMRDTML FMLLLNGSNP ADFTDDEFSA AIDSLQGYVD SGQVRRFTGN DYVDDMKSGD IVACEAWSGD VINLLGGGKF KYVPPSEGFA IWTDNMLVPN KAAHKSNVEE LMNYYYDPVN AAKLAAWNYY LCPVKGAQQE IAQFDKSAAK SDFIFPDAKT MESGHQFMPL SDTQERDYQR RFNEVMGG
|
| |