Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1813 |
Symbol | |
ID | 9245663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2218690 |
End bp | 2219688 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | Integrase catalytic region |
Protein accession | YP_003679747 |
Protein GI | 297560773 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0764832 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCACA CTACCCACGC CAACGCCACA CTCACACCCG CAGGGCGCCT GGCGTTGGCC CGCTGCGTGG TCGAAGACGA CTGGCCGCTC AGGCGTGCCG CCGAACGCTT CCAGACCAGT ACCACCACCG CCAAACGCTG GGCCGACCGC TACCGCACCC AAGGCCAGGC CGGGATGGAC GACGCCTCCA GTCGCCCGCA CCGCTCACCC CAGCGCACCC CCGCCCGCCG CGAACGCCGC GTGATCAAAC TCCGCTGCAC CCGCCGATGG GGCCCGGCCC GCATCGGCGG CCACCTGGGC ATGCATCCCT CGACCGTGCA CCGGATCCTG ACCCGCTACC GCATGCCCCG ACTGTCCCAC CTGGACCGGG CCACTCACCG GGTAGTACGC CGCTACGAGC GCGCTCGACC GGGCGAGCTC GTGCATGTGG ACATCAAAAA GCTCGGCAAC ATCCCCGCCG GGGGCGGACA CCGTACCCAG GGACGCGCTC AAGGCCGCCG CAATCGCACC ACCACCGCGC ACGCCCCGCG CAACAACCAC GGCAACCCCA AGCTGGGCTA CGGCTACCTG CACACCGCCA TCGACGACTA CTCCCGCCTG GCCTACACCG AGATCCTGGC CGATGAGAAA AAGGAGACCG CCACCGGGTT CTGGGAGCGA GCGCACGCCT ACTTCGTCTC AGCGGGGGTC GTGGTGGAGC GGGTGCTAAC CGACAACGGG GCCTGCTATA CGTCCCACCT GTGGCGCGGC CTGCTCTATG GCCAGGGCAT CAAGCACAAA CGCACCCGCC CCTACCGGCC CCAGACCAAC GGCAAGGTGG AGCGCTTCCA TCGCACCCTG GCCGACGAAT GGGCCTACGC CCGCCCCTAC CAATCCGAGA GCGAAAGGCG GGAGGCGTTC CCCGGGTGGT TGCATCACTA CAATCACCAC CGGTTCCACA CCGCGATCAG CGGCCCTCCC GCCTCCCGCG TTCCTAACCT CTCAGGTCAG TACAGCTAG
|
Protein sequence | MPHTTHANAT LTPAGRLALA RCVVEDDWPL RRAAERFQTS TTTAKRWADR YRTQGQAGMD DASSRPHRSP QRTPARRERR VIKLRCTRRW GPARIGGHLG MHPSTVHRIL TRYRMPRLSH LDRATHRVVR RYERARPGEL VHVDIKKLGN IPAGGGHRTQ GRAQGRRNRT TTAHAPRNNH GNPKLGYGYL HTAIDDYSRL AYTEILADEK KETATGFWER AHAYFVSAGV VVERVLTDNG ACYTSHLWRG LLYGQGIKHK RTRPYRPQTN GKVERFHRTL ADEWAYARPY QSESERREAF PGWLHHYNHH RFHTAISGPP ASRVPNLSGQ YS
|
| |