Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1608 |
Symbol | |
ID | 9245458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1969770 |
End bp | 1970810 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | periplasmic binding protein |
Protein accession | YP_003679543 |
Protein GI | 297560569 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0163277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.602881 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCGTC GCGCCCCGCG CCCCCTTCCC CTCCTGCTCG CCTCCGCCCT GCTGCTCACC GCCTGCGCCC CCTCCCCGGA AGGGGGGAGC GGCGGCACGG AGGCCGCCGA CGCCGACGGC GCCCGCGCGG TCGACAACTG CGGACACCGG GTGGGCGTCG ACGCCCCGCC CGAGCGCGTC GTCTCCCTCA ACCAGGGCAG CACCGAGATC CTGCTCTCCC TGGGGCTGGC GGACCGCCTC GTCGGCACCG CCACCTGGAC CGATCCGATC ATGGAGGGGT TGGAGGAGGC CAACGGGGGC ATCCCCCGCC TGGCCGACAA CGCCCCCTCC TTCGAGGTGG TCCTGGACGC CGAACCGGAC TTCGTGACGG CCTCCTTCGT CTCCACGCTC GGCACGGGCG GTGTCGCCAC CCGCGAGCAG TTCGAGGAAC TGGGCGTGCC CACCTACGTC TCCCCGACCG ACTGCGCCAC CGGCAAGGAC AACGACAGCG GGGGAGACGG CTCGCGCAGC GAACCGCTCA CCCTCGACGC CGTGTACGGC GAGATCCGCG ACCTGGCCCT GCTGTTCGGG GTCGAGGAGC GCGGGGAGGA GCTGATCGCC GAGCTGGAGG GGCGGGTGGC GGCGGCCACC GCGGACCTGG ACGCCTCCGG CGTCTCCCTC ATGTACTGGT TCGCCAACTC CCAGTCGCCC TACCTCGCGG GCTGCTGCGG CGCCCCCGGC GCCATCACCC GCGCGGTCGG CGCGCGCAAC GCCTTCGACG ACACCCACGA CGAGTGGCCC CAGATCAACT GGGAGACCGT CGCCGACCGC GACCCCGACG TCATCGTCCT CGGCGACCTG ACCCGCGACT CCCAGACCGC GGAGTCCGCC GACGCCAAGA TCGCGTTCCT GGAGTCCCAC CCCGCCACGA GCAACCTCAC CGCCGTGCGC GAGGAGCGCT ACGTCCTGCT CAGCGGCCAG GCCATGAACC CCTCGATCCG CACGGTCGAG GGGATCGAGC AGGTCGCCGA GGGTCTGCGC GGCCTCGGGC TCGGCCGGTG A
|
Protein sequence | MLRRAPRPLP LLLASALLLT ACAPSPEGGS GGTEAADADG ARAVDNCGHR VGVDAPPERV VSLNQGSTEI LLSLGLADRL VGTATWTDPI MEGLEEANGG IPRLADNAPS FEVVLDAEPD FVTASFVSTL GTGGVATREQ FEELGVPTYV SPTDCATGKD NDSGGDGSRS EPLTLDAVYG EIRDLALLFG VEERGEELIA ELEGRVAAAT ADLDASGVSL MYWFANSQSP YLAGCCGAPG AITRAVGARN AFDDTHDEWP QINWETVADR DPDVIVLGDL TRDSQTAESA DAKIAFLESH PATSNLTAVR EERYVLLSGQ AMNPSIRTVE GIEQVAEGLR GLGLGR
|
| |