Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0218 |
Symbol | |
ID | 9244052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 274799 |
End bp | 275800 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | periplasmic binding protein |
Protein accession | YP_003678174 |
Protein GI | 297559200 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.311188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGACCG CACGCCACCT TTCCCCCGTC CTTCTCGGCG CCGTGCTCGC CCTGACCGCC TGCGGTGCAC CGGACGGCGG CGAGCCCGAG GCCGAGCCCG CCACCGAGGC CGCCTCCGTC CCCTGCGGGC CCCCGGAGCC CGACCCCGAG CGCCTCTCCG TGGGCGACGG CACCTTCCCC GTCACCGTCA CCGACGCCAC CGGCGAGGTC ACCGTCGAGG AGGCCCCCGA GCGGATCGTC TCCCTGTCGG CCAGCCACAC CGAGATGCTC TTCGCCATCG GCGCGGGCGA CCGGGTCGAG GCCGCCGACG AGTACTCCGA CTACCCGGAG GAGGCCCCCA CCACCTCGCT GAGCGGCTTC GAACCCAGCG TGGAGGCCAT CACCGAGTAC GACCCCGACC TGGTCCTGCT CGCCCGCAGC GCCGAGGCGA CCGTCGCCCA GCTGGAGGAC GTCGGCATCC CGTCGCTGGT CCTGGACGCC GCCCAGGACC TGGAGGACAC CTACGCCCAG ATCCGCATGC TCGGCGACGT CACCGGCCAC ACCGAGGAGG CCGACGCCGA GGCCGCGCGG GTCGAGGACG AGTTCAACGC GATCGTCGAG GGCGTGTGCG AGGAGACCGG CGACGCCGGC CTGTCCTTCT ACCAGGAGCT CGACGAGACC TCCTACTCCG CCACCTCCGA CACCTTCGTC GGCCAGATCT ACGCGTCCTT CGGCCTGGTC AACATCGCCG ACGAGGCCGA CGCCGACGGG GCCTCGGGCG GCTACCCTCA GCTGTCCCAG GAGTACGTCG TGGAGCAGAA CCCGGACCTG ATCTTCCTGT CCTACGGGGA CGAGTCGACG GTCGCCGACG TGGCCGGGCG CCCCGCCTTC GACACCGTCA CCGCGGTGCG GAACGACGCC GTCTACCTGC TCGACGCCGA CATCGCCTCC CGCTGGGGGC CGCGCGTGGT CGAGTTCGCC GAGCTGGTCG GCCGGGCCGT CACCGAGAAC GCGGCGGACT GA
|
Protein sequence | MRTARHLSPV LLGAVLALTA CGAPDGGEPE AEPATEAASV PCGPPEPDPE RLSVGDGTFP VTVTDATGEV TVEEAPERIV SLSASHTEML FAIGAGDRVE AADEYSDYPE EAPTTSLSGF EPSVEAITEY DPDLVLLARS AEATVAQLED VGIPSLVLDA AQDLEDTYAQ IRMLGDVTGH TEEADAEAAR VEDEFNAIVE GVCEETGDAG LSFYQELDET SYSATSDTFV GQIYASFGLV NIADEADADG ASGGYPQLSQ EYVVEQNPDL IFLSYGDEST VADVAGRPAF DTVTAVRNDA VYLLDADIAS RWGPRVVEFA ELVGRAVTEN AAD
|
| |