Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4820 |
Symbol | |
ID | 9248704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5711050 |
End bp | 5712393 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | Protein of unknown function DUF2252 |
Protein accession | YP_003682710 |
Protein GI | 297563736 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.923578 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACC GGCTCGATTC CACCGACAGC TCCGCGGAGC GCCGCGACCT CATCGTCGCC ACGCTGGAGA ACGCCTTCTC GGACCTGATG TCCGCCGACC CGGCGGCGTT CCGCGTCAAG TTCCGCAAGA TGGCGGCCAA CCCGTTCGCC TTCTACCGGG GCAGCGCCGC GCTCTTCTAC GACGACGTCT CGGGCATGGA CGACCCCTGG GCCGACGAAC GCACCTCGCG GGTGTGGATC CAGGGTGATC TGCACGCGGA GAACTTCGGC ACCTACATGG ACTCCACCGG GCGGCTGGTG TTCGACGTCA ACGACTTCGA CGAGGCCTAC CTCGGCCACT TCACCTGGGA CGTGCTCAGG TTCGCGGCCA GCATCGGGGT CATGGGCTGG CAGAAGGCCC TGTCCGACGA GGACATCAGC GCGCTCCTGC CGCACTACGT CGACGCCTAC ATCGCCCAGG TGCGCGAGTT CGCGACGACG GGCAACGACT CGGAGTTCTC GCTCAAGCTG GGCAACACCG ACGGCACCGT GCACGACGTG CTCCAGAAGA CCCGGCTCAA CAGCCGCGCC GAGATGCTGT CGTCCATGAC GACCCGCGAC GGGTACACCC GCCGCTTCGC CGAGGGCCCC CGGGCCCGCC GCCTGGACGA CGCCGAGCGG GAACGGGTCC TGGCCGCCTA CGAGGCCTAC CTGGGCACCA TCCCCGAGGA CCGGCGGTAC GCGTCGATCA ACTACGCGGT CAAGGACGTG GTGGGCAGCG GCGGCTTCGG GATCGGCTCG GCGGGGCTGC CCGCCTACAC CCTGCTCATC GAGGGCCAGT CGGAGGCCTG GGACAACGAC ATCGTGCTGT CCATGAAGCA GGGCAACGTG GCGGCGCCCT CGCGTGTGGT GACCGACCAG CGCATCATGG ACCACTTCCA GCACCACGGG CACCGCACCG CGATGTCCCA GCGGGCGCTC CAGGCGCACG CGGACCCGCT GCTGGGCCAC ACCGAGATGG GCGGCGTGGG CTTCGTGGTG AGCGAGGTCT CCCCCTACAC CAACGACCTG GACTGGGACG ACCTCACCGA GCCCGCGGAG ATCGCGCCGG TGCTGGACTA CCTGGGGCGC GCCACCGCCA AGGCGCACTG CGTGTCCGAC TCGCACGCGG ACGCCACGCT CGTGCGCGGT CAGACCGAGG AGGCGGTCAT GGCGGTGCTC GACGGCCGCG AGGCGGAGTT CACGCGGTGG TGCGTGGACT TCGCGCACCG GTACGCGGCC CAGACCCGGG CCGACTACTC GCTGTTCGTG GACGCCTTCC GCAACAACGC CATCAGCGCG GTGAGGTCCA GCCGGGACCT GTGA
|
Protein sequence | MLDRLDSTDS SAERRDLIVA TLENAFSDLM SADPAAFRVK FRKMAANPFA FYRGSAALFY DDVSGMDDPW ADERTSRVWI QGDLHAENFG TYMDSTGRLV FDVNDFDEAY LGHFTWDVLR FAASIGVMGW QKALSDEDIS ALLPHYVDAY IAQVREFATT GNDSEFSLKL GNTDGTVHDV LQKTRLNSRA EMLSSMTTRD GYTRRFAEGP RARRLDDAER ERVLAAYEAY LGTIPEDRRY ASINYAVKDV VGSGGFGIGS AGLPAYTLLI EGQSEAWDND IVLSMKQGNV AAPSRVVTDQ RIMDHFQHHG HRTAMSQRAL QAHADPLLGH TEMGGVGFVV SEVSPYTNDL DWDDLTEPAE IAPVLDYLGR ATAKAHCVSD SHADATLVRG QTEEAVMAVL DGREAEFTRW CVDFAHRYAA QTRADYSLFV DAFRNNAISA VRSSRDL
|
| |