Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1779 |
Symbol | |
ID | 9245629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2177271 |
End bp | 2178389 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | transposase, IS605 OrfB family |
Protein accession | YP_003679713 |
Protein GI | 297560739 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.238729 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.359878 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGGCG TGTCGAAGAT CGTCGTGCAG CTCAAGCTCA CGCCGTCGCC CGAGCAGGCG GCGGTGCTGA CCTCGACCCT GCGCGATCTC AACACCCACA CCACCTGGGT GGCGAGGGTG GCCCACGAGC AGGGTGTGAT GCGCGACTAC GAGCTGCGCA AACATACCTA CCAGCAGTTG CGTGAGGCCG GGGTGGGCTC GCAAGCGGCC CAGCACGTGA TCAAGAAGGT GTGCGACGCC TACCACGCCC GCCGCTCCAA TCTGAACAAC GGCAACTACG GACCCCAAGG ATCGGCCCTC CGGGAGCGGA TCGAATCCAC ACCGATCGCC TTCCGCCCCG ACTCGGCGCA CCCCTACGAC GCGCGCGACC TGTCCTTCGC CATGGACGCG CGCACGATCT CACTGTGGAC CTTCCAGGGC CGATTGAAGG ACGTGCCCTT CGTCGGCTCC CCCGACCAGA TCAAGATGCT GGCCGAACAC AAACGCGGTG AGGCCGACCT GCTCTGCCGC GACGATGCCT GGTTTCTCGC GGTGACCGTC GAGGTGCCCG ACGCTCCTGA GATCGACCCC AACGGGTTTC TTGGGGTGGA TCTGGGGATT GTCAACATCG CCACCACCAG TGACGGCCGG GTCATGGCCG GGCGCCAGAT CAACCGGTAC CGCCGTCGGC AGCTCAGGCT GCGCCAGAAG TTGCAGGCCA AGGGCAGCCG GTCCGCCAAG CGCCTGCTCA ACAAGCGGCT CCGCCGTGAA GCACGGTACG CCAGAAACAT CAACCACCAG ATCTCGAAAC GCATCGTGGC CGAGGCCGAA CGCACCGGGC GCGGTATCTC CCTTGAGGAT CTCAGGGGGA TCCGCGCCCG GGTACGGCAA CGCAGGCCCC AACGGGTCAC GCTGCACTCC TGGTCCTTCC ACCAACTGGG CGCCTTCATC GCCTACAAAG CGCGCCTGGA GGGCGTGCCG GTGGTGTTCG TGGACCCGGC GCACTCCTCA CGCGAGTGCG CCGCATGCTC CTACACTCAC AAGGCCAACC GGGTCTCACA GGCCTTGTTC GTCTGTCGGG ACTGCGGCGT CGTTGCGCAC GCGGGGCGCA AGTCACACGT CCCACCCGAC CACCCCTAG
|
Protein sequence | MVGVSKIVVQ LKLTPSPEQA AVLTSTLRDL NTHTTWVARV AHEQGVMRDY ELRKHTYQQL REAGVGSQAA QHVIKKVCDA YHARRSNLNN GNYGPQGSAL RERIESTPIA FRPDSAHPYD ARDLSFAMDA RTISLWTFQG RLKDVPFVGS PDQIKMLAEH KRGEADLLCR DDAWFLAVTV EVPDAPEIDP NGFLGVDLGI VNIATTSDGR VMAGRQINRY RRRQLRLRQK LQAKGSRSAK RLLNKRLRRE ARYARNINHQ ISKRIVAEAE RTGRGISLED LRGIRARVRQ RRPQRVTLHS WSFHQLGAFI AYKARLEGVP VVFVDPAHSS RECAACSYTH KANRVSQALF VCRDCGVVAH AGRKSHVPPD HP
|
| |