Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5333 |
Symbol | |
ID | 9249233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 499466 |
End bp | 500950 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | protein of unknown function DUF309 |
Protein accession | YP_003683219 |
Protein GI | 297564246 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0481251 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTGGACG ACTCGGCCGC CTACGTCGAC GGCCCCTGGA CCCACAGGAC CGTCAGCGCG GCGGGGGCCC GGTTCCACGT CGCCGAGGCC GGTGACGGGC CGCTGGTGCT CCTCCTCCAC GGCTTCCCCC AGTTCTGGTG GGCCTGGCGC GCCCAACTGA CCGCGCTCGC CGACGCCGGT TACCGCGCGG TCGCCGCGGA CCTGCGCGGC TACGGCGCCA GCGACAAGAC CCCGCGCGGC TACGACCTCG TCACCCTCGC CCAGGACGCC GCCGGACTGG TCCGCGCCCT CGGCTCACGG GACGCGGCCG TGGTCGGGCA CGGCCTGGGC GGCCTCGTCG CCTGGACGAT GACCGCCTAC CACCCCGGCA CCGTGCGCGC CCTGGCCGCG GTGTCGTCGC CGCACCCGCT GCGAGCGGCC CGCGTCCTGG CCTCCGGCGG TCCCGGCGTC CGCCACATGC TCCGGGCACA GCTGCCGATC CTCCCCGAGC ACCGGCTCCT GAGCGACGGG TGCGAACGCG TGGGCGACCT GCTCCGGGAG TGGTCGGGCC CCGGCTGGCC CGACACCGAG GCCGAGGAGC ACTACCGCCG CGCCTTCGCC ATCCCCAAGG TCTCCCACTG CTCCCTGGAG AGCCACCGCT GGATCTTCCG GTCGCGGTGG CGCACGGACG GACTCCGCTA CGACGCGCGG ATGCGCCGTG CTCCCGTGCG CGTCCCCGTC CTCCAGCTCC ACGGGACCCT CGACCCGGTG TGCCCGCCCG GACCGGCACG CGCCTCACGG GGCGTGGTGA CGGGGCCCTA CCGTTGGAGG CAGGTCCACG GCGCGGGACA CTTCCCGCAC GAGGAACGAC CGGAGGAGGT CTCCCGCGCG CTCGTCGAGT GGCTCGCGGA GGTCTCGGCG GTGAGGCGGA CGGAGTCCTC GCAGGCGAGG GGGAACGGCG GTGAGGCACT GATGGCGACG GAGACCGCGG ACGGGCACCG GGAGTCCGGG GGCCGAGGAG GCGGGCGGGA CCGCAACGAG ACCGGGCAGG CGCAGAACCA GCGCCCCCGG GACCGGTACG GGCGCCCGAT GCCGCACGGG AGCCGGGGCG AGGTGGAGCG GGTCCCCGAC GACGCCGAGT TCTCCGCGGA GGAAGGTCTG GAGGAGGCCC AGCGGCTGCT CGACCAGGGG TACGCTTTCA CCGCCCACGA GGTCCTCGAA GCCGTGTGGA AGTCCGCGCC CGACCCCGAG CGGGAACTGT GGCGCGGCCT CGCCCAGACG GCCGTGGGGG TCACCCACGC GCAGCGCGGC AACATGGTCG GCGCGGCACG TCTGCTGAGG CGGGGCGCGG ACCGCGTGGA GCCGTTCGGG CCGGACGCCC CCCACGGGGT GGACGTGGCG GGGGTGGCGG CGTTCGCCCG CGCCCTGGCC GACGACCTCG ACGCCGGGCG CGCCCGCCCG GGTGACGGGA TCGACCCTTC GGGGATGCGC CTGCTCGGCG GGTAA
|
Protein sequence | MLDDSAAYVD GPWTHRTVSA AGARFHVAEA GDGPLVLLLH GFPQFWWAWR AQLTALADAG YRAVAADLRG YGASDKTPRG YDLVTLAQDA AGLVRALGSR DAAVVGHGLG GLVAWTMTAY HPGTVRALAA VSSPHPLRAA RVLASGGPGV RHMLRAQLPI LPEHRLLSDG CERVGDLLRE WSGPGWPDTE AEEHYRRAFA IPKVSHCSLE SHRWIFRSRW RTDGLRYDAR MRRAPVRVPV LQLHGTLDPV CPPGPARASR GVVTGPYRWR QVHGAGHFPH EERPEEVSRA LVEWLAEVSA VRRTESSQAR GNGGEALMAT ETADGHRESG GRGGGRDRNE TGQAQNQRPR DRYGRPMPHG SRGEVERVPD DAEFSAEEGL EEAQRLLDQG YAFTAHEVLE AVWKSAPDPE RELWRGLAQT AVGVTHAQRG NMVGAARLLR RGADRVEPFG PDAPHGVDVA GVAAFARALA DDLDAGRARP GDGIDPSGMR LLGG
|
| |