Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0249 |
Symbol | |
ID | 9244083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 308080 |
End bp | 310050 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | alpha amylase catalytic region |
Protein accession | YP_003678204 |
Protein GI | 297559230 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.852341 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGGAC GCCTGCCCAT CCTCGATATC TCTCCAGAGA ACGACCTCGG ACCGGTGAAA GCCGTTCCGG GCGAACGCTT CACCGTGGGA GCGACGGTGA TCCGCGAGGG CCACGACTCC CTCGCCGCGG GCGTCGTCGT CTACTCCCCC GAGGGACGCC GCGAACAGCT CGTCCCCATG CGCGAGCACG CCCCCGGCAC CGACCGCTAC GAGGCCTGCG TCAGCCTCCC CACGGAGGGG ACGTGGTCCT TCGCCGTCGA GTCGTGGACC GACCCCTTCG CCACCTGGCA CCACGTCGCG GGCGTCAAGC TCCCGCTGGG CCAGGACACC GAACTCGTCC TGGAGGAGGG CGCCCGCCTG CTCGCCCGCG CCGGGCGACG GGTCCCGCGC CGCCCCTTCC TGAACGCCGC CGCCAAGGCG CTGCGCGACA CCTCGATGAC CCCGGTGGAG CGCTTCGAGG CGGCGCTCAC CCCCGAGGTC CTGGCCGAGA TGGAGCGCGC GCCCCTGCGC GAGCTGGTCA CCAGGTCCAA GCGCACCAGC GTCGTCGTCC ACCGCGAGCG CGCGCTGTTC GGCTCCTGGT ACGAGTTCTT CCCGCGCTCG GAGGGCGCCC AGGTCGACAC CGCGCCCGGC CAGGAGCTCT CGGGCACCCT CGCCACCGCG GCCAAGCGCC TGCCCGCCAT CGCCGACATG GGCTTCGACG TGGTCTACCT GCCGCCCATC CACCCGGTCG GCACCACCCA CCGCAAGGGC GCCAACAACG CGCTGACCGC CGGTCCCGGC GACCCCGGTT CGGTGTGGGC CATCGGGTCG GCCGACGGCG GCCACGACGC GGTCCACCCC GACCTGGGCA CCCTCGCCGA CTTCGACGCC TTCGTCGCCG AGGCCCGCGA GCACGGCATG GAGATCGCCC TGGACCTGGC CCTGCAGTGC TCCCCCGACC ACCCCTGGGT GACCGAGCAC CCCGAGTGGT TCACCGCCCG GGCCGACGGC TCCATCGCCT ACGCCGAGAA CCCGCCCAAG AAGTACCAGG ACATCTATCC GCTCAACTTC GACCGCGACT TCGAGGGCCT GTACGCGGAG GTCCTGCGGG TGGTCGAGCA CTGGATCGCG CACGGCGTAC GCGTCTTCCG CGTCGACAAC CCGCACACCA AGCCGGTCGC TTTCTGGCAG AAGCTGCTCG CCGACGTCGC CGACAGGCAC CCCGACGTGC TGTTCCTCGC CGAGGCCTTC ACCCGCCCCG CCATGATGCG CACGCTGGCC AAGGTCGGCT TCCACCAGTC CTACACCTAC TTCACCTGGC GCAACGGCAA GGACGAGCTG ACCGACTACC TCACCGAGCT GAGCCGGGAG AGCGCCCACT ACCTGCGCCC CAACCTCTTC GCCAACACCC CGGACATCCT GCACGCCTAC CTCCAGCACG GCGGTCGGCC CGCGTTCGCC GTCCGCGCCG TGCTGGCGGC CCTGCTCTCC CCCACCTGGG GCGTCTACTC CGGCTTCGAA CTGTGCGAGA ACACCCCCGC CGGGCCGGGC AGCGAGGAGT ACCTCGACTC GGAGAAGTAC CAGTACCGCC CCCGCGACTG GGCCGCGGCC GAGGCCTCCG GCGAGACCCT CACCGGCCTC ATCACGCTGC TCAACCGGCT GCGCCGGGAG CACCCGGCCC TGCGGGAGCT GCGCAACCTG CGCTTCCACC ACGTGGACCG GCCCGAGATC GTCTGCTTCT CCAAGCACCG GCCCGGAACC GGCCCCAAGG ACCCCGACGA CGCCGTGATC GCCGTCGTCA ACCTCGACCC GCACCACGCA CGCGAGGCGA CGGTGCACCT GGATCTGCCG TCCATCGGCC TCACAAGGGA GGAGGAGTTC AGGGTGACCG ACGAGCTGAC CGGCCGTTCC TACACCTGGG GTGCGGACAA CTACGTCCGT CTCGACCCCG CGGCCGGTCC CGCGCACGTG TTCACCGTCA GCGGCAGATA G
|
Protein sequence | MIGRLPILDI SPENDLGPVK AVPGERFTVG ATVIREGHDS LAAGVVVYSP EGRREQLVPM REHAPGTDRY EACVSLPTEG TWSFAVESWT DPFATWHHVA GVKLPLGQDT ELVLEEGARL LARAGRRVPR RPFLNAAAKA LRDTSMTPVE RFEAALTPEV LAEMERAPLR ELVTRSKRTS VVVHRERALF GSWYEFFPRS EGAQVDTAPG QELSGTLATA AKRLPAIADM GFDVVYLPPI HPVGTTHRKG ANNALTAGPG DPGSVWAIGS ADGGHDAVHP DLGTLADFDA FVAEAREHGM EIALDLALQC SPDHPWVTEH PEWFTARADG SIAYAENPPK KYQDIYPLNF DRDFEGLYAE VLRVVEHWIA HGVRVFRVDN PHTKPVAFWQ KLLADVADRH PDVLFLAEAF TRPAMMRTLA KVGFHQSYTY FTWRNGKDEL TDYLTELSRE SAHYLRPNLF ANTPDILHAY LQHGGRPAFA VRAVLAALLS PTWGVYSGFE LCENTPAGPG SEEYLDSEKY QYRPRDWAAA EASGETLTGL ITLLNRLRRE HPALRELRNL RFHHVDRPEI VCFSKHRPGT GPKDPDDAVI AVVNLDPHHA REATVHLDLP SIGLTREEEF RVTDELTGRS YTWGADNYVR LDPAAGPAHV FTVSGR
|
| |