Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0850 |
Symbol | |
ID | 9244695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1042368 |
End bp | 1044992 |
Gene Length | 2625 bp |
Protein Length | 874 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003678800 |
Protein GI | 297559826 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0147973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.214983 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGA CCCCCGCACC GAATCCGGCT TCCGGTCCGG CCTCTGGTTC CGGCCCCGCC TGCGGTCAGG GCCCCGTGGG CGGACGCGGC CCCCTAGCGC TGAAGCTGGC GCGCGGCCGT GCCGGGGTGC TGGCCGCTGT CGCCGTCGCG GTCCTGGGCG GATCGGCGTT CGTCACCGTC GGCGGCGTGC TGGCCGACAC CGGGCTGCGC TCGTCGGTCC CCGCCGAACG CCTCCGGGGC GCGGACGCCG TGGTCGCCGC CCCCCAGACG GTCGAGCAGG CCGAGGACCT GGACCTGCCC CTGCCCGAGC GGGCCGGACT GCCCGAGGAG CTCACCGCCC GGCTGGCCGC CCTGCCCGAG GTCGGCACCG CCGTCGGCGA CGTGGGGTTC CCGGCCGCCG TGCACACCGG GGACGGGCCC GCCACGGGAG CCGACCCCCG CACCACCGGC CACGGGTGGT CCTCCGCCGC GCTGCTGGAG GACGCCCCCT TGGAGGGGGA GGCGCCGCGG GGGCCGGGGG ATGTGGTCCT GGACGCCGCC ACGGCGGCCG CCGCCGACGC CGGGGTCGGC GACCGGGTGC GGGTCACCGC GGCGGGGCGC ACCGGCGACC ACCGGCTCTC CGGGGTGCTG GACACCGGCG CCACCGTGGT CCTGTTCGAC GACGAAACCG CCGCCGACCT GGCCGGACGC ACCGAGGGCC CGCGCGAGGG CACCGTGGAC CTGGTGGCCC TGCGCGCCGC GCCCGGCGTG TCGCAGGAGC GGTTCACCGG GGCGGTGCGG GAGGCCCTGG CGGGCGAGGA CGCGCTCGTG CGGACCGGGG ACCGGATCGG CGACGCCGAG TCCCCGGCCG CCGGAGCCGC ACGCGGTGTC CTGGTCGCGG TGGCCGGTTC GCTCAGCGGT GTCCTGGTGA TGACCATCGG CCTGACCGTC GCCGGGGCGC TGTCGGTCTC CGTGGCCGCC CAGCGCCGCG ACCTGGCGCT GCTGCGGGCC GTGGGCGCCA CACCCCGGCA GATCCGCCGC CTGGTCGCCG CCCCCAACCT CCTCGTCACC CTGGCGGCCC TGCCCTTCGG CGTGGCCGGC GGCTATCCGC TGGCCGGGGT CGCGCTGGAC TGGTTCGCCT CCCTGGGCCT CGTGCCCCCG GGCCTGCCGC CGGTGTTCGG ACCGCTGCCC GCCCTGGCCA CGGCGGTGTT GATGGTGCTC GCCGTGTGGC TGGCCTCGCT GGCCGCGGTC GGCCGGACCG CGGCCGCGCC GCCCACCGAC GCGCTCGCCG AGTCCGTCGC CGAACCGCGG GTTCCCGGCC GGGCGCGGAC CTGGACGGGG GCGGCGCTGC TCCTGGCCTC GGTGGGGGCG TCGGTCCCGG CGCTGGTGCT GGGCGACGAG ACGGCTGCGG TCGGCCCGGC CTCGGCCAGC CTGCTCGCGG TGATCGGGCT GGCCCTGGTC GGCCCCGCCC TGCTGCGGGC GGTGAGCGGG GCGCTCGGCC GCCGCCTGCC CGCGCGTTCG TCCGCCCTGA CCTGGCTGGC GGTGCGCAAT CTGCACGGCC ACTCCCACCG GGTGGCGGGC GCGGTCAGCG CGCTGGCGAT GCTGGTGGCC TTCGCGCTCA GCCAGGGGTA CGTCAACACC ACGCTGCTGG CCGCGCAGAC GGAGCAGCGA CAGGACGGCG AGCTGGCCGC GAGCACGCTG ACGGCGCCTG CGCTCGGCGG GCTTCCGCTG GGGCTGGCCG AGACCGTTCG GGAGGACCCG TCCGTCGCCG CGGCCGTCAC CGCCACGCCG ACCGCCCTCG TGTGGACCAC CGAGCTGCCG TTCGATGAGG GGGTGCTGCA CCAGGAGACG CCCGCGCTGG TCCTGGGGCC GGGAGCCCCG GAGGTCGTCG ACGTGGGTGT CACCGAGGGC GACCTGGGCG CGCTGACCGG CGACACCGTC GCGCTGGGCG CGCAGACCGC GCGCTCCCTC GGAGCGGAGC CGGGGCAGAG GGTGGCGTTC CGCATGGGGG ACGGGACCGA GGCCAGCGCC GAGGTGGTCG CGCTGTACGA GCGCGAACTC GGTTTCGGCC CGGTCCTGGT CTCGCGGGAC CTGGCCCGGG GACACCTGAC CGGGGACCTG CCCGCCGAGT TGCGGGTGCT CGCCGAACCC GGGCGGGAGT CCGAGGTGCG CGGGGTCCTC GACGGCCTGG CCTCGGCGCA TCCGGGGGTG GAGGTCGCCG ACACCGCGCC GGACGCGGCC GGGAGCGGGC TCGCGGCGAA CACGGTGCTC AACCTCGTGG TGCTGCTCTG CCTGGCGGGG TACGTGCTGC TCTCGGTGGC CAACCGCCTG TCCGCCCAGA CCCTGCGCCG CCGCGCGGAG ACGGACTCGC TGCACGCCGT GGGCATGACC CCCGCGCAGG TCCGGTCGCT GCTGCGCAGG GAGGCCGCCC TGATCGCGGT CGGCGCGGTC GTCGCCGGTG TCCTGGCCTC GGCGGTCCCG CTCGCCTTCG CGGGGATGGG GCTCCTGGGG CGTCCGTGGC CCGGCGGCCC GTTCTGGCTG CTGCCCGCGA CCGTGCTGGT GGTCGCGGTG GTGTCCTGGC TGTGCGCGGA GCTGCCCGCG AGGAGGCTGA CCGCGGACCG GTGGCGCGCC GGGGCCGGGG CCTGA
|
Protein sequence | MTATPAPNPA SGPASGSGPA CGQGPVGGRG PLALKLARGR AGVLAAVAVA VLGGSAFVTV GGVLADTGLR SSVPAERLRG ADAVVAAPQT VEQAEDLDLP LPERAGLPEE LTARLAALPE VGTAVGDVGF PAAVHTGDGP ATGADPRTTG HGWSSAALLE DAPLEGEAPR GPGDVVLDAA TAAAADAGVG DRVRVTAAGR TGDHRLSGVL DTGATVVLFD DETAADLAGR TEGPREGTVD LVALRAAPGV SQERFTGAVR EALAGEDALV RTGDRIGDAE SPAAGAARGV LVAVAGSLSG VLVMTIGLTV AGALSVSVAA QRRDLALLRA VGATPRQIRR LVAAPNLLVT LAALPFGVAG GYPLAGVALD WFASLGLVPP GLPPVFGPLP ALATAVLMVL AVWLASLAAV GRTAAAPPTD ALAESVAEPR VPGRARTWTG AALLLASVGA SVPALVLGDE TAAVGPASAS LLAVIGLALV GPALLRAVSG ALGRRLPARS SALTWLAVRN LHGHSHRVAG AVSALAMLVA FALSQGYVNT TLLAAQTEQR QDGELAASTL TAPALGGLPL GLAETVREDP SVAAAVTATP TALVWTTELP FDEGVLHQET PALVLGPGAP EVVDVGVTEG DLGALTGDTV ALGAQTARSL GAEPGQRVAF RMGDGTEASA EVVALYEREL GFGPVLVSRD LARGHLTGDL PAELRVLAEP GRESEVRGVL DGLASAHPGV EVADTAPDAA GSGLAANTVL NLVVLLCLAG YVLLSVANRL SAQTLRRRAE TDSLHAVGMT PAQVRSLLRR EAALIAVGAV VAGVLASAVP LAFAGMGLLG RPWPGGPFWL LPATVLVVAV VSWLCAELPA RRLTADRWRA GAGA
|
| |