Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0262 |
Symbol | |
ID | 9244096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 326518 |
End bp | 327783 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | protein of unknown function DUF21 |
Protein accession | YP_003678217 |
Protein GI | 297559243 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.558818 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGGCA TCGGGGCCCA GATCGGCCTC GTCCTGGTCC TGGTCGTGGT GAACGCCGTG TTCGCGGGCA GCGAGATCGC CCTGATCACG CTGCGGGAGG GGCAGATCAG GCGGCTGGAG GAGCGCGGCC CGGGTGGGCG CGCGGTGGCC CGACTGGCCC GCGACCCCAA CCGGTTCCTG GCCACCATCC AGATCGGCAT CACCCTCGCG GGCTTCCTCG CCTCCGCCAC CGCCGCCGTC TCCCTGGCCC GGCCGCTGGT CGAGCCGCTG GGCTTCCTCG GCTCCTACGC GGCCCCGGTG TCGGTCGTGC TGGTCACCGT CGTGCTCACC TTCGTCACCC TGGTCCTGGG CGAACTCGCG CCCAAGCGCA TCGCCATGCA GCGCGCCGAG CCCTGGGCAC TGCTCGTGGC CCGCCCGCTC AACGCGCTGG CGCTGCTCTC GCGCCCCGCG ATCTGGCTTC TCAGCGCCTC CACCGACCTG GTCGTCCGGC TCGGCGGGCT CGACCCGCAC GGCGCCAGGG AGGAGGCCAC CGAGGAGGAG CTGCGCGACA TGATCGAGGC CCAGGGCCAT ATGACCCCCG AACAACGCAC CATCCTCTCC GGGGCCTTCG ACATCACCGG GCGGACGCTG CGCCAGGTCC TGGTCCCGCG TCCCGACGTG GACACCGTCC CGGCCGACCT GCCCGCGTGC GAGACAGCCC TGCTCCTGGC CGAGCACGGC CACTCGCGCG CCCCGGTGGT CGGCCGGGAC GACGTGGACG ACGTCGTCGG CGTGGTGCAC TGGTCCGACC TCCTGCGGGG CGAGGGCGCG GCCCGGGAGC TGGCCCGCGA ACCGCTGCTG CTGCCCGACT CGCTGACGGT CTCCGCCGCC CTGCACCGGC TCACCGTCGA ACGCCAGCAG CTGGCCGTCG TGATCGGTGA GAGCGGCGAG GTCAGCGGCA TCGTCAGCCT GGAGGACCTG CTGGAGGAGG TCGTCGGCGA GATCTACGAC GAGACCGACA CCGACGAGCG CGCCCCCGCC CGGCTGGACG GCGGCGCCCT GCGCCTGCCC GGCGTGTACC CCGTCCACGA GCTGGAGGAC CTCGGCGTCG TCCTCACCGA CCGGCCCCGG GGCAGTTACG TGACCGTCGC CGGGATGGTC CTGGTCCTGC TCGGCCACAT CCCCGACGAG CCCGGGGAGA GCGTGGACCT GGGCGGCTGG ACGGCCACGG TCACCGAGGC CGACGGCCGC GTCGTCAGCG AACTCCTGCT CACCCCCGCC CGCTGA
|
Protein sequence | MEGIGAQIGL VLVLVVVNAV FAGSEIALIT LREGQIRRLE ERGPGGRAVA RLARDPNRFL ATIQIGITLA GFLASATAAV SLARPLVEPL GFLGSYAAPV SVVLVTVVLT FVTLVLGELA PKRIAMQRAE PWALLVARPL NALALLSRPA IWLLSASTDL VVRLGGLDPH GAREEATEEE LRDMIEAQGH MTPEQRTILS GAFDITGRTL RQVLVPRPDV DTVPADLPAC ETALLLAEHG HSRAPVVGRD DVDDVVGVVH WSDLLRGEGA ARELAREPLL LPDSLTVSAA LHRLTVERQQ LAVVIGESGE VSGIVSLEDL LEEVVGEIYD ETDTDERAPA RLDGGALRLP GVYPVHELED LGVVLTDRPR GSYVTVAGMV LVLLGHIPDE PGESVDLGGW TATVTEADGR VVSELLLTPA R
|
| |