Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0436 |
Symbol | |
ID | 9244275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 529302 |
End bp | 530321 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Exonuclease RNase T and DNA polymerase III |
Protein accession | YP_003678389 |
Protein GI | 297559415 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0464133 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.956681 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAACGC TGCGGATGGG CGCGCTGACC CGGCGCAGGG TCGGCATGCG GGCGGCGGAC CTGGAGTACG CGGTGCTCGA CATGGAGACC ACCGGCCTCG AACCGCGCGA GGGCGCGCGC ATCGTGGAGA TCGCCGTGGT GCGCGTGCGC GGCGACGGCA AGTTCGTGGA GGAGTTCAGC ACTCTGATCG ACCCGCGCGC GCCGGTGGGC GGCCGGGAGT TCCACGGCAT CGGCGAGGGC GACACGGTGG GCGCCCCGAC CGCGGCCCAG GTGGTGCCCA GACTCACCGA ACTGCTCTCC GGGGCGGTCG TGGTCGGCCA CAACCTCGAC TTCGAACAGC GCTTCCTGGC CTCGGAACTG GTGCCCGCCG GGCTTCCCAC GGGCCAGGCC GGGCTGTGCA CGCTGCGCGC GCTGCGCTCC CAGGTGGAGC TGGAGCGGTA CTCGCTGCCC AAGGCCTCCC ACCGGCTCAG CGGCGACTGG CCGACCGGAC AGCACACCGC GCTGGGAGAC GCCCGTGCCT GCGCCAAGCT GCTCGCGGAG ATGCTCACCA ACGCCCCCGG CGAACTGCGC TACGGCGGTC CCGCGCCCAA GCGGCTCACG GTGCCGGACC CGGTTCCCGG CCCGGTCGGC GCTGCGGGGC CCGTCCGCTG GAAGCCGCGC ACGTCCTCGG TGCCCGGCGG ACTGCCCCCG TTGAGCCCCT GGCAGGCGCG GTGGCGGCCC CACGAGCTGG ACCCGCTGCT GTGCGGCGGG GCCTTCGGCG CCACGGACCG CGCCATCGCG GAGATGGCCG CGCACCGGGA CACCCGCTTC CGCGAGCGGC TGGCCGCGGC CGCCGCGGTG ACGGGCGGGC TCGCGGCCAC CGCCGCGGGC GGCCTGCTGC TGCGCATGGC CGGCGGCGGG GGCACGCGCG ACCTCGGTTA CCCGGCCGGA CGCGGCGACG GCTCCCCGGA ACGGCTGCTC GGCCGGATCG GCCGGTCCGT GCTCAGGGAC CGGCCGGGGT TCACAGACCG GACAGGGTGA
|
Protein sequence | MGTLRMGALT RRRVGMRAAD LEYAVLDMET TGLEPREGAR IVEIAVVRVR GDGKFVEEFS TLIDPRAPVG GREFHGIGEG DTVGAPTAAQ VVPRLTELLS GAVVVGHNLD FEQRFLASEL VPAGLPTGQA GLCTLRALRS QVELERYSLP KASHRLSGDW PTGQHTALGD ARACAKLLAE MLTNAPGELR YGGPAPKRLT VPDPVPGPVG AAGPVRWKPR TSSVPGGLPP LSPWQARWRP HELDPLLCGG AFGATDRAIA EMAAHRDTRF RERLAAAAAV TGGLAATAAG GLLLRMAGGG GTRDLGYPAG RGDGSPERLL GRIGRSVLRD RPGFTDRTG
|
| |