Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2301 |
Symbol | |
ID | 9246151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2749354 |
End bp | 2750421 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | rhamnose ABC transporter, periplasmic rhamnose-binding protein |
Protein accession | YP_003680229 |
Protein GI | 297561255 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.743326 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGG GCCGCCCACG CCACCTGCTC CTGGCCGCGT CCGCGTTCGC GGTGCTGATG ACCGCCGCCT GCGGCGGGGG CACCACCATC GAGGACGCCC AGGAGGAGAA CGAGACCGGG CCGACCGGGA CCGCCGACCC GGACGCGCGG ATCCCCGAGG GCCTGAAGAT CGACTTCCTG CCCAAGCAGC TCAACAACCC CTTCTTCGAG ATCGTCAACC AGGGCGGCGC CGAGGCCGTC GAGGAGGTCG GCGGAACGGC CACCGAGCGC GGCGGCACCG AGGCCACGGC CGACTCCCAG GTGGAGTACG TCAACGCGGC CAGCCAGGCG GGCAGCGACG TCATCGTGAT CGCCGCCAAC GACCCCGACG CGGTCTGCCC CGCGCTCAAC GAGGCCCGCG ACAACGGCGC GGCCATCGTC GGCTACGACT CCGACGCCAA CTGCACCGAC GTGTTCGTCA ACCAGTCCTC CACCGAGCTC ATCGGCCGCA CCCTGGTCGA GATGATCGCC GAGGACCTGG GCGGCGAGGG TACCTTCGCG GTGCTGTCGG CCACCCCCAA CGCCACCAAC CAGAACGCCT GGATCGCGGC GATGGAGGAG GTGCTGGCCG AGGAGGAGTA CGCCGACCTG GAGCTGGTCG AGACCGTCTA CGGCAACGAC GACGACCTGG AGTCCTTCCA GGAGATGCAG GGCCTCATGC AGTCCCACCC CGACCTGGAC GGCGTGGTCT CGCCGACCAC GGTCGGCATC GCCGCGGCGG CCCGCTACGT CAGCGACTCC GAGTACCGGG GCGAGGTGGC CGTCACCGGC CTGGGCACGC CCAACCAGAT GCGCGAGTTC GTGCACGACG GCACCGTGGG GCGGTTCGCC CTGTGGAACC CGCTCGACCT GGGCTACCTG GCCGGGTACA CCGGCGCGGC GCTCAGGGCC GGACAGATCA CCGGCGCCGA GGGCGAGACC TTCACGGCCG GACGCCTGGG CGAGTTCACC TTCGAGACCG AGGGGGAGAT CGTGCTCGGC CCGCCGCAGG TCTTCGACGC CGGGAACGTG GACGACTTCG ACTTCTGA
|
Protein sequence | MTQGRPRHLL LAASAFAVLM TAACGGGTTI EDAQEENETG PTGTADPDAR IPEGLKIDFL PKQLNNPFFE IVNQGGAEAV EEVGGTATER GGTEATADSQ VEYVNAASQA GSDVIVIAAN DPDAVCPALN EARDNGAAIV GYDSDANCTD VFVNQSSTEL IGRTLVEMIA EDLGGEGTFA VLSATPNATN QNAWIAAMEE VLAEEEYADL ELVETVYGND DDLESFQEMQ GLMQSHPDLD GVVSPTTVGI AAAARYVSDS EYRGEVAVTG LGTPNQMREF VHDGTVGRFA LWNPLDLGYL AGYTGAALRA GQITGAEGET FTAGRLGEFT FETEGEIVLG PPQVFDAGNV DDFDF
|
| |