Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3143 |
Symbol | |
ID | 9246999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3759550 |
End bp | 3760965 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | protein of unknown function DUF901 |
Protein accession | YP_003681058 |
Protein GI | 297562084 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0279599 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCC GCCCCGGTGC GGGCGGCGAC TCCGACGGCG GCGATCCGCT CGACGGCGGC GTCCACCCGG CGGAGGACGG CGGCCACGGC GCCGGCGATG ACGGCGGTGA CGGCGGCGAA CGGCTGGCCC GGCCGCTGCC CGAGCAGGTG CGCGCCCGGG TCGTCGAGTA CGGCTCGGAC GTGCTCGGCG GCATGCGCGC GAGCGACCTG CCCCCGCTGC TGCGCAGGGT CGCCAGGTTC GAGCCGCGCC GCAGGGCCCG GCTCGCCGGA CCGCAGATCG CCGCCCAGCT GGAGAACGAC GAAACCTTCC GCGGCATGGT CGCGGCCCGC GTCGACCAGG TGTGGCCCGA GCTGGCCGAG GGCCTGCGCT CGGGCGTGGT GCCCCCCGCG GCCGACCCCG TGGCCGTGGC CGCCTGCGCC TACCTGCTGC GGCCCCCGGG GTGGCCCGGC ATCGTCGAGG ACGTCCACCG GGAGCTGGAG CGCCAGACCA GCGTCAAGGA GGCGGACCAG GCCGCGGAGG CCCTGGACGC CGCCCGCCGC CAACTGGACG AGACCCGGCA CGACCACCAG GAGGAGCTGG AGCGGCTGCG GTCCCAGATC AAGGCCCAGC GCACCGAGAT CGCCGAGCTG CGCCGCAAGG TGCACACCGA GCGGCAGCGG GCCAAGGAGG CCACCGAGCG GGCCTCGCGT GCGCTGACCG AGACGGCCGG ACGCGAGTCG GAGTCCGCCG CGCGGGTCGG CGCGCTGGAG TCGCAGAACC GGCGGCTGAG ATCGAGGCTG GCCACGGCCG AGGCCCAGCT GGACAACGCC CGGCGGGCGG TGCGCGCCGG ACGCAACGCC GACGAGGCGC GGCTGCGCGT GCTGCTGGAC GTACTGGTGG AGGCCTCCCA CGGCCTGCGC CGCGAACTGG CGCTGCCCAC CGTCCTGGAC AGCCCCGCCG ACCTGGTGGC CGAGACCGAG CAGCAGCGGC GGGTGTCCCT GGGCGGGCTG CCCGACGACG ACCCCGGCCT TCTGGAGCAC CTGCTCACCG CGCCCCGGGT GCACCTGCTG GTGGACGGCT ACAACGTCAC CAAGACCGGC TACGGGACCC TCCCCCTGGC CGACCAGCGC ACCCGGCTGA TGAACTCCCT GGAGGGGCTG GCCAGCCGGA CCAAGGCCGA GATCACGTGC GTGTTCGACG GCGCGGACGT GGACACCCCG CCGGTGATGG CGGCGGCGCG CCGGGTGCGG CTGCTGTTCA GCGCGCCCGG GGAGACCGCG GACGAGCTGA TCGTGCGGCT GGTGCGCGCC GAACCCCCGG GGCGACCGAT CGCGGTGGTC ACCTCCGACC GCGAGATCGT GACGGCGGTG CGCCGCGCCG GGGCGCGCGC GGTGCCCTCG ACGATCTTCC TGCGCCGCCT GGAGGCGCAC GGCTGA
|
Protein sequence | MSARPGAGGD SDGGDPLDGG VHPAEDGGHG AGDDGGDGGE RLARPLPEQV RARVVEYGSD VLGGMRASDL PPLLRRVARF EPRRRARLAG PQIAAQLEND ETFRGMVAAR VDQVWPELAE GLRSGVVPPA ADPVAVAACA YLLRPPGWPG IVEDVHRELE RQTSVKEADQ AAEALDAARR QLDETRHDHQ EELERLRSQI KAQRTEIAEL RRKVHTERQR AKEATERASR ALTETAGRES ESAARVGALE SQNRRLRSRL ATAEAQLDNA RRAVRAGRNA DEARLRVLLD VLVEASHGLR RELALPTVLD SPADLVAETE QQRRVSLGGL PDDDPGLLEH LLTAPRVHLL VDGYNVTKTG YGTLPLADQR TRLMNSLEGL ASRTKAEITC VFDGADVDTP PVMAAARRVR LLFSAPGETA DELIVRLVRA EPPGRPIAVV TSDREIVTAV RRAGARAVPS TIFLRRLEAH G
|
| |