Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4533 |
Symbol | |
ID | 9248413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5376098 |
End bp | 5377018 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_003682426 |
Protein GI | 297563452 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.664024 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.640764 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGA AGAGGGCGCC CCAGGCGCCG CGGAGCGCCG ACGCCGAGAG CGTCGGAGGT GCCGAACGCA CCCAGTTCCA GATGATCCTG TCCCGGTTCC TGCGCCACCG GGCCGCCATG ATCAGCCTGG TCGTCCTGGT GGTCGTCGTC CTGGCCGCCT TCGTGGGCCC GCTCCTGTGG AGGTGGGACC ACACCGTCCA CCTGGAGATC CCGCCGAGCG TGCCGCCCAA CGCGGACCAC CCGCTGGGCA CCACCACCGC CGGGCACGAC GTCCTGGGCC AGCTCATGCG CGGCGCCCAG CAGACGCTCA AGGTGGCGTT CACGGTGTCG GTCATGGGCA CCGTCATCGG CTCCCTGTGG GGCGCCACGG CCGGTTACTA CGGCGGCCGG ATCGACGCGC TCATGATGCG CGTGGCCGAC GTGTTCATGA TCGTGCCCCT GCTGGTGATG GTCGCCGCGA TCGCGGGCAA CGCCCGCGCC GGGACCACCT GGTACGCGGT GGCCCTCATC ATCGGCTTCT TCTCGTGGGC GCAGATCGCC CGCGTGGTGC GCAGCGTGGT CCTGTCCCTG CGCGAGCAGG AGTTCGTGGA GGCGGCCAAG GCCGCGGGGG CCTCGCCCGG GTGGATCATC GTCCGCCACC TGCTGCCCAA CGCGGCCGGG CCGATCATCG TCGCCGCCAC GCTGCTGATC GCGGTGGCGA TCCTGCTGGA GGCGGGCATG TCCTTCCTCC AGTTCGGCAT CCAGCCCCCC GACATCTCGC TCGGCCAGAT GATCAGCGAC GCGCGCACGG CCGTCTCCAC CAGGCCCTGG CTGTTCTACC CGCCGGGCCT GCTGCTGCTG GTGATCTGCC TGACGATCAA CTTCATCGGC GACGGGCTGC GCGACGCCCT CGACCCACGA CAGACCATGG TGCGGCGATG A
|
Protein sequence | MSEKRAPQAP RSADAESVGG AERTQFQMIL SRFLRHRAAM ISLVVLVVVV LAAFVGPLLW RWDHTVHLEI PPSVPPNADH PLGTTTAGHD VLGQLMRGAQ QTLKVAFTVS VMGTVIGSLW GATAGYYGGR IDALMMRVAD VFMIVPLLVM VAAIAGNARA GTTWYAVALI IGFFSWAQIA RVVRSVVLSL REQEFVEAAK AAGASPGWII VRHLLPNAAG PIIVAATLLI AVAILLEAGM SFLQFGIQPP DISLGQMISD ARTAVSTRPW LFYPPGLLLL VICLTINFIG DGLRDALDPR QTMVRR
|
| |