Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0817 |
Symbol | |
ID | 9244662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1007907 |
End bp | 1009553 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | protein of unknown function DUF885 |
Protein accession | YP_003678767 |
Protein GI | 297559793 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0452595 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGGC GGTTCCGCGA GGTGGCCGAG CGCGTGCTCG ACTCCCTGCT CCACGACGCC CCCGAGTGGG CGCTGGACCT GGGCGACACC CGTGGCGCCT CCCGTCTGTC CGACCACTCG GCCGAGGCCG ACGTGCGCCG CGTCTCCGTG CTCACCGACG CCCTCGGATC CCTGGACGAG ATCGACCCCG ACCTCATCCC GGCCGGCGAC CGGGTCGACC TGGAGGTGCT GCGGACCCGC GTCAGCGCCG ACCTGTGGCA CACCGCGGAA CTGCGCCCGC ACACGTGGGA TCCGCTGCTG TACTCGCCGG GCGAGGCCCT GCACGCCCTC GTGGAGCGTG AGGTCCTGCC CCTCCCCGAA CGCCTGTCGG CGCTCGCCGC GCGCTGCGCG GCCCTGCCCG TCCACCTCGC CACCGCGCGC TCGCGCCTGT CGGAGGGCCC CGGCATGCCC CGCGTGCACG TGGAGACGGC CCTGGCCCAG GCGGCCGGGG CCCGCGCCAT GCTCACCTCC GACGTGCCCG CCCCGGCCGA GGGAGCGCCC TCCGCCCTGG AACCGGCCCG CGAGGCCGCC CTGGCCGCCG TGGAGGAGCA CGCCGCCTGG CTGCGGGACC GTCTGGAGAC CGCCACCGCC GACCCCCGTC TGGGCGAGCG CGACTTCGCC GCCCAGCTCT GGTACACCCT CGACTCCGAG CTCTCGCCCG AGGCGCTGCT GGTGCGCGCC GAGAGCGACC TGCTGGCCAC GGAGGAGGCG ATCGCCGAGA CGGCGGCCGA GTACCTGGGC GGGGCGCGCC GCCGGGAGGG GGTGGCCGAG GCGCTCGCCG AGCTGGCCGC ACGGGGCGCC ACCGACGCCG ACACCGTCCG CCCCGCCTGC GCCGACGCCC TCCTGCACCT GAACGAGCGG GTGCGCGCGC TGGACATCGT GACGGTCCAC GACGACCCGG TCCGGATCGT GCCGATGCCC GAGGCCCGCC GCGGGGTGTC GGTGGCCTAC TGCGAGCCCC CCGGCCCCCT CGACCCGCGG TCCGGGGAGC AGCCGACCCT GGTAGCGGTG GCCCCGCCGC CGGAGGACTG GCCCGCCGAG CGCAGGGAGT CCTTCTTCCG CGAGTACAAC GCGGTCATGC TGCGCGACCT CATGGCCCAC GAGGCCGTTC CCGGGCACGC TCTCCAGCTC GCCCACGCCG CCCGGCACGA GGGCGGCACC CGGGTGGGCC GGGCCCTGTG GAGCGGCACC TTCGTGGAGG GCTGGGCGGT CTACGCCGAG GAGGTGCTGG CCCGCCACGG CTGGTCCGGC GACCGGCGCG AGGACCTGGC GCTGCGCCTG GTGCAGCTCA AGATGCGCCT GCGGATGATC ATCAACGCGA TCCTGGACGT GCGCCTGCAC ACCGGCGACC TCACCGAGGC CGAGGCGATC TCCCTGATGA CCCGGCGCGG ACACCAGGAG GAGGGCGAGG CCGTCGGCAA GTGGCGCCGC GCCCAGCTCA CCAGCGCCCA GCTGTCCACC TACTACGTGG GCTACGCGGA GGTCTCCGAC ATCGCCAGCG ACCTGGCCCT GGCCCGGCCC GCGCTCACCG AGCGCGAGCG CCACGACGCG ATGCTCGCCC ACGGCAGCCC GCCCCCGCGC CACCTGCGCA CCCTGCTGGG GCTGTAG
|
Protein sequence | MTRRFREVAE RVLDSLLHDA PEWALDLGDT RGASRLSDHS AEADVRRVSV LTDALGSLDE IDPDLIPAGD RVDLEVLRTR VSADLWHTAE LRPHTWDPLL YSPGEALHAL VEREVLPLPE RLSALAARCA ALPVHLATAR SRLSEGPGMP RVHVETALAQ AAGARAMLTS DVPAPAEGAP SALEPAREAA LAAVEEHAAW LRDRLETATA DPRLGERDFA AQLWYTLDSE LSPEALLVRA ESDLLATEEA IAETAAEYLG GARRREGVAE ALAELAARGA TDADTVRPAC ADALLHLNER VRALDIVTVH DDPVRIVPMP EARRGVSVAY CEPPGPLDPR SGEQPTLVAV APPPEDWPAE RRESFFREYN AVMLRDLMAH EAVPGHALQL AHAARHEGGT RVGRALWSGT FVEGWAVYAE EVLARHGWSG DRREDLALRL VQLKMRLRMI INAILDVRLH TGDLTEAEAI SLMTRRGHQE EGEAVGKWRR AQLTSAQLST YYVGYAEVSD IASDLALARP ALTERERHDA MLAHGSPPPR HLRTLLGL
|
| |