Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2932 |
Symbol | |
ID | 9246784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3502082 |
End bp | 3503755 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | alpha amylase catalytic region |
Protein accession | YP_003680848 |
Protein GI | 297561874 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCCTGG AGAGCAGCCC CGACTGGTGG AAGTCCAGCG TCGTCTACCA GATCTATCCG AGCAGTTTCA ACGACAGCGA CGGCGACGGC GTGGGCGACC TGCCCGGCGT CATCGACAGA CTCGACCACC TCCAGCTGCT GGGCGTGGAC GTGGTGTGGC TGTCGCCCGT CTACCCGTCG CCCTGGGACG ACAACGGGTA CGACATCAGC GACTACCTGG ACATCCACCC CCGGTTCGGC ACCCTGGCCG ACTGGGACCG CCTGCGCGAC GAACTGCACG GGCGCGGCAT GCGCCTGGTG ATGGACCTGG TCGTCAACCA CACCAGCGAC GAGCACCCCT GGTTCACCGC CTCCCGCGCT GGGGACCCCG AACACCGCGA CTTCTACTTC TGGCGGCCCG GGCGCGACGG AGGCCCGCCC AACAACTGGG GGTCGGTCTT CTCCGGTCCC GCCTGGACCC GCGACGAGAC CAGCGGCGAG TACTACCTGC ACCTGTTCAC CCGCCGCCAG CCCGACCTGA ACTGGGAGAA CCCCAGGGTC CGCTCGGCGG TGCACGACGT GGCGCGCTGG TGGCTGGACC GCGGCGCCGA CGGCTTCCGC ATGGACGTCA TCAACTTCGT CTCCAAGACC CCCGAGATAC CCGACGGGCC CCTGGCGGGG GACCACGGGA TCTTCGCCGA CCACGCCGTC AACGGCCCCC GCCTGCACGA GTTCCTGCAC GAGATGCACC GGGAGGTGTT CGAGGGCCGC GACGTGCTCA CCGTGGGCGA GATGCCCGGA GTGGACGTGG AGCAGGCCCT GCTGCACACC ATGCCCGAGC GGCGCGAGCT GTCCATGGTC TTCCAGTTCG AGCACATGGA CCTGGACCAC GGGCCCGGCG GCAAGTTCGA CCCCCGGCCC CCGGACCTGC GCCGCCTGCG GGAGTCCCTC AACCGGTGGC AGACCGGCAT GGGCGAGCGC GGGTGGAACA GCCTGTACTG GAACAACCAC GACCAGCCCC GGGTGGTGTC GCGCTTCGGC GACGAGGACC ACCGGGTGGC GTCGGCCACG GCGCTGGCCA CGACCCTGCA CATGATGCGG GGCACGCCCT ACGTCTACCA GGGCGAGGAA CTGGGCATGA CCAACGCCCA CATGGGCGAC ATCGCCGACT ACCGCGACGT GGAGACGCTC AACCACCACC GCGCGGTGGT GGACACGGGC AGGGCCGACC CGGGGGAGGC GATGGCGGCC ATCGCCCGGA TGAGCCGCGA CAACGCGCGC ACGCCCATGC AGTGGGACGC CTCGCCGGGC GCCGGTTTCA CCACCGGAAC CCCGTGGATC GCGATCAACC CCAACCACAC CGAGATCAAC GCCGAGGCCG CCGTGGCCGA CCCGGACTCG GTCTTCCACT ACTACCGGCG GCTCATCGCG CTGCGCCGGG AGCACCCGGT GGTCGTGCAC GGCCGATTCG AGCCGCTGCT GGAGGACGAC CCGGCGGTCT ACGCCTACCG GAGGGTCCTC GACGACCGGG TGCTGCTGGT GGTGGCCAAC TGGAGCGCGC GGACGGTGCC GCTGGAGCTG GACCCCTCCG CCGCCGGGCG CGACCCGGTC CGCCTGATCG GCAACCACCC CGACGAGGGC GCCTTCGCGC CGCTGCGGCC CTGGGAGGCC CGCGTCCACC TGGGAGGCCG CTGA
|
Protein sequence | MPLESSPDWW KSSVVYQIYP SSFNDSDGDG VGDLPGVIDR LDHLQLLGVD VVWLSPVYPS PWDDNGYDIS DYLDIHPRFG TLADWDRLRD ELHGRGMRLV MDLVVNHTSD EHPWFTASRA GDPEHRDFYF WRPGRDGGPP NNWGSVFSGP AWTRDETSGE YYLHLFTRRQ PDLNWENPRV RSAVHDVARW WLDRGADGFR MDVINFVSKT PEIPDGPLAG DHGIFADHAV NGPRLHEFLH EMHREVFEGR DVLTVGEMPG VDVEQALLHT MPERRELSMV FQFEHMDLDH GPGGKFDPRP PDLRRLRESL NRWQTGMGER GWNSLYWNNH DQPRVVSRFG DEDHRVASAT ALATTLHMMR GTPYVYQGEE LGMTNAHMGD IADYRDVETL NHHRAVVDTG RADPGEAMAA IARMSRDNAR TPMQWDASPG AGFTTGTPWI AINPNHTEIN AEAAVADPDS VFHYYRRLIA LRREHPVVVH GRFEPLLEDD PAVYAYRRVL DDRVLLVVAN WSARTVPLEL DPSAAGRDPV RLIGNHPDEG AFAPLRPWEA RVHLGGR
|
| |