Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0631 |
Symbol | |
ID | 9244473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 776259 |
End bp | 778154 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003678583 |
Protein GI | 297559609 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00116155 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.250691 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCC ACCACCCTCC CCCCGGTTCC CCCGTGCGCC GCACCGTCCC TCCCCACGCC CCTTCCACCG GCGCGTTCCC GCGCCGGGCC CCGCGCCGTA TCGCCGCCAC GGCCGCCGCG CTGCTCCTCC TGGCGGGCAC CGCCGCGGCC CCCGCCGCCG CGGACACCTC CGACGGCCAG ACCCTCAGCA TCGCCACCTC CCAGCAGGTG GACTCCTTCA ACCCCTTCAC CGCGCAGCTC GCGATCACCA CCAACGTCCT GCGCCACGTC TACGACTCCC TCGTCACGGT CGACCCCCAG ACGAACCAGC CCGCCCCCTC CCTCGCCGAG TCCTGGGAGT CCAGCGACGA CGGCCTCACC TGGACCTTCC ACCTGCGCGA GGGCGTCCGG TTCTCCGACG ACGAGCCCCT GACCGCCGAC GACGTGGTCT GGACCTTCAC CACCATGATG GAGAACGAGG CCGCGGCCGT CGCCAACGGC AACTACGTCT CCGGCTTCGA CACGGTCACC GCCGAGGACG ACCACACCGT GGTCATCGAA CTCGACGAGC CGCAGGCCAC CATGACCTCC CTCAACGTCC CGATCGTCCC CAGGCACGTC TGGGAGCCGA TCCTGGAGCG CGAGGGCGAC GCCTTCGCCG ACTACGGCAA CGAGGACTTC CCGACCGTCG GCAGCGGCCC CTTCGTCCTC ACCGGCCACG ACCGGGGCCG CTCCATCACC CTGGAGGCCA ACCCCGACCA CTGGCGCGGC GCACCCGCCT TCGAGCGGGT CGTCCTGCGC TACTACTCCG AGAAGGACGC CGCGGTGGAG GCGCTGCGCA GCGGTGAGGT CTCCCTGGTC TACGAACTCA CCCAGGCGCA GGCCGCAGCG CTGGAGTCGG CGCAGGACGT CCGGGTCAAC ATCGCCGACG GCAAGCGCTT CCAGGCCTTC ACCATCAACC CCGGCGCGGT CACCCAGGAC GGGGAGGAGT TCGGCGACGG GCACCCCGCC CTCGCCGACC GCACCCTGCG CCAGGCCATC GTCATGGCCA TCGACAACCA GGAGATCGTC GACAAGGCGC ACGGCGGCGA GGCCGTGGCC GCGGGCGGCT ACATCCCGCC CCGCTACGAG GACTTCCACT GGGCGCCCGA GGGCGAGGAG GCCGTCCTCG ACTTCGACCC CGAGGCGGCC AACGCCATGC TCGACGAGGC CGGGTACGAG CGGGGTGAGG ACGGCGTGCG CGTCTCACCC GAGGGCGACC GCCTCGAACT GCGGATGCAC GTCCACCAGG ACCGGCCCGA CAACGTCAAC ACCGGGTTGG TCATCGTCGA GCGCCTGGCC GACATCGGCA TCGAGGTGGA GAACCTCACC GTCGACCCCG GCGTGCTCAG CGACGCCCTC TTCGCGGGCG AGTACGACCT CATCTTCACC GGCTGGACGG TCAACCCCGA CCCCGACTAC GTGCTGAGCA TCCACACCTG CGGCGCCCTG CCCACCGAGC CGGGCACCAT GCAGGGCGAC GCCTACTTCT GCGACGAGGA GTACGACGAG CTCTACGAGG CCCAGCTCGC CGAGTACGAC CGCCAGGCCC GCGCGGAGAT CATCCACCAG CTCCAGGAGG TCCTCTACCG CGAGGCCGTC GTGAACGTGC TGGCCTACCC CAACATCATG GAGGCCTACC GCACCGACCA CATCGCCTCC ATCCAGTACG AGCCCGCCGA GGGCGGCAAC ATCTGGGGAC AGGACGGCTA CTGGGCCTGG TGGTCGGCCG AACCCGCCGC CGAGCGGACC GCGGGCGCGG CCTCCGGTCC CTCCGCCGGG GTCTGGATCG GCGTCGGGGC CGTCGTGCTC GTCCTCGCCG CGGTCGGGGG CTTCCTGCTG CTGCGCCGAC GTTCCACCAT GGAGGACCGC GAGTGA
|
Protein sequence | MNAHHPPPGS PVRRTVPPHA PSTGAFPRRA PRRIAATAAA LLLLAGTAAA PAAADTSDGQ TLSIATSQQV DSFNPFTAQL AITTNVLRHV YDSLVTVDPQ TNQPAPSLAE SWESSDDGLT WTFHLREGVR FSDDEPLTAD DVVWTFTTMM ENEAAAVANG NYVSGFDTVT AEDDHTVVIE LDEPQATMTS LNVPIVPRHV WEPILEREGD AFADYGNEDF PTVGSGPFVL TGHDRGRSIT LEANPDHWRG APAFERVVLR YYSEKDAAVE ALRSGEVSLV YELTQAQAAA LESAQDVRVN IADGKRFQAF TINPGAVTQD GEEFGDGHPA LADRTLRQAI VMAIDNQEIV DKAHGGEAVA AGGYIPPRYE DFHWAPEGEE AVLDFDPEAA NAMLDEAGYE RGEDGVRVSP EGDRLELRMH VHQDRPDNVN TGLVIVERLA DIGIEVENLT VDPGVLSDAL FAGEYDLIFT GWTVNPDPDY VLSIHTCGAL PTEPGTMQGD AYFCDEEYDE LYEAQLAEYD RQARAEIIHQ LQEVLYREAV VNVLAYPNIM EAYRTDHIAS IQYEPAEGGN IWGQDGYWAW WSAEPAAERT AGAASGPSAG VWIGVGAVVL VLAAVGGFLL LRRRSTMEDR E
|
| |