Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0558 |
Symbol | |
ID | 9244399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 689980 |
End bp | 691314 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003678511 |
Protein GI | 297559537 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGCTGC ACACTCACTG GGGTGACCTC CCCCGCAGCG AAGGGAACGT GATGAGAGAT TTCGATCGGC GCGCATTTCT CGCGTTGACC GGCATGGGCG CGGTGGGGGC CTCCCTGCTC CTGTCCGGCT GCGGCGGCAG CGGCTCGGGG AACGGCCTCC GGTACGCCTG GTGGGGCAAC ACCGTCCGCC AGCAGAACTA CACCGAGGCC CTGGAGGCGT TCCAGGAGGC CAACCCCGAC ATCACGGTCG AGCCGGAGTT CGCCGAGTAC ACCGCCTTCC AGGAGCGCAT GACCACGCAG ATGGCGGCCC GCAACGTCGC CGACGTCTTC TGGATCGCCT CCCCCCAGGT CCTGACCTAC AAGGCGAACG GGCTGTACCG CAGGCTCGAC GACATCCCCA CCCTCGACCT CTCCGACTAC AGCGCCGAGG ACATCGAGTC CTTCTCCCTG GGCGGTGAGC TCCTGAGCAT GCCGCACGGG GTCTTCGTCC CCGTGGTGCG CCACAACGAG ACCTTCCTCG AAGAGGAGGG CGCGCGGATG CCCGGGGAGG ACTGGACCTG GGACGACCTC GCGGAGTTCC TCATCGACTA CAGCGCGAAC AACGCGCAGG GGCGTCGGGG CGCGACCTAC ACACCCGACC AGGACATGGC CTTCGAGGCG TGGCTGCGCC AGCGCGGCCA GGACCTGTGG ACCGAGGACG GGAACGTCGG CTTCGACGAG GAGGCCCTCG GCGACTGGTT CGAGTGGTGG CGCGTCCTCC TCGACGAGGG GGCGGTCCTG AGCCTGGGCG AGCAGGAGGG CATGCAGCCG GACTTCTCCG CGGTCGGCGA CCGGGTCCTG CTCAACTTCG GCAGTTCCAA CCACATCATC GACGAAGCCG CCATGTTCCC CGACTGGAAG TACCGTCTGC GGTCCGTGCC GGTGGGCGCG GACGCCGCCG ACGGCCACCG CTTCCTCTAC TACCCCCGGC TGGCCGTCTA CCAGGGCATC GACGACGCCA ACGTCGAGGC GGCGGGCAAG CTCATCGACT TCAACGTCAA CAACGTCGAG TTCCTGCGCA CCGTCGGCCT CACCATGGGC GCGCCGCCCA ACCCCCGCCT CCTCGTCGAG GCCTACGACT TCGCCTCCGA CGACGAGAAG GAGATGCTGG CCGTGGTGGA GGCGGACCGC GCCGAGCCGC AGCGCCCCCG CTACGAGGCC CCGCCCGGGA CGGGGACCTG GCGCGAGGCC ATGTCACGGG CCTCGGAGAA CGTCGCCCTG GGCAACGCCG GGGTCGCCCA GGTGACCGAG GAGCTCATCG CCGAGATCCG CTCCGGCATC GACCGGGGAG CGTAG
|
Protein sequence | MKLHTHWGDL PRSEGNVMRD FDRRAFLALT GMGAVGASLL LSGCGGSGSG NGLRYAWWGN TVRQQNYTEA LEAFQEANPD ITVEPEFAEY TAFQERMTTQ MAARNVADVF WIASPQVLTY KANGLYRRLD DIPTLDLSDY SAEDIESFSL GGELLSMPHG VFVPVVRHNE TFLEEEGARM PGEDWTWDDL AEFLIDYSAN NAQGRRGATY TPDQDMAFEA WLRQRGQDLW TEDGNVGFDE EALGDWFEWW RVLLDEGAVL SLGEQEGMQP DFSAVGDRVL LNFGSSNHII DEAAMFPDWK YRLRSVPVGA DAADGHRFLY YPRLAVYQGI DDANVEAAGK LIDFNVNNVE FLRTVGLTMG APPNPRLLVE AYDFASDDEK EMLAVVEADR AEPQRPRYEA PPGTGTWREA MSRASENVAL GNAGVAQVTE ELIAEIRSGI DRGA
|
| |