Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4699 |
Symbol | |
ID | 9248581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5578182 |
End bp | 5579435 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | protein of unknown function DUF1205 |
Protein accession | YP_003682591 |
Protein GI | 297563617 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.363631 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.334984 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGTGC TCTTGGCCGC GGCCGCGGAG AAGCCGCACT TCCTCGGCAT GGTCCCGTTG GCGTGGGCGC TGCGCGCCGC GGGCCACGAG GTGCGGGTGG CCAGCCAGCC CGCGCTGGAG CCCGTCGCGG CGGGGGCGGG CCTGCCCTTC ACCGCCGTCG GCAGGGACCA CGCCTTCTGG CGCACGATGA AGGCCTTCGA CCTCCACGAC ACCCTCGACG ACGTCCCGCT GTTCGGCCGC GTCACCGACC CCTACGAGCG GGTTCCCTGG GAGTACCTGC TGGAGGGCTA CCGCCGTGTG GTGCCGTGGT GGTGGCGGAT GGTCAACGAC CCCATGGTCG ACGACCTGGT CGCCCTCTGC CGCGAGTGGC GTCCCGAGCT GGTGGTCTGG GGTTCGGTGA GCTTCTCCGG GGCGATCGCC GCCGAGGCGT GCGGGGCCGC GCACGTGCGC TACCTGTGGG GGGCCGACAT CTTCGCCCGC ACCCGCGCGC GCTTCCTGGC GCGGATGGGC GAACAGCCCG CCTCACAGCG GGAGGACCCC CTGGCCGCGT GGCTGGGGAC CAGGGCGGCC CGGTACGGCG TGGACTTCTC CGAGACCCTG GTCCACGGCC AGGCCACCGT CGAGCAGGTC CCCGCGTCCC TGCGGGTGGA CACGCCCGCG CACCTGGAGT ACCTGCCGGT GCGCTACGTG CCCTACAACG GACGCGCCGT CGTCCCCCAC TGGCTGCGCA CACCCCCCAC CCGCCCCCGG GTCTGCGTCA CCCTGGGCAC CACCCTCATG GGCCAGGACC GGGGCGGGGA CGTGTTCCGG GACCTCCTGG AGGGTCTGGC CGAGCTGGAC GTGGAGGTCG TGGCCACCCT GCCCGCCCGC GAACAGGCCA AGCTCGGCAC CGTCCCCGGC AACGCCCGCC TGGTCGAGTA CGTCCCCCTG CACGCCCTGG CCCCCACCTG CGCCGCCATG GTCGACCACG GGGGGTGGGG GACCGTGCTC ACGGGACTGG ACGCGGGCGT GCCGCAGGTC ATCGTCCCCA GCTGGTTCGA CGATCCGATG CTCGCCGACA TGCTCGCCGC GCGGGACGCG GCCGTCTCCG TCCCCCACCG GACCATGACC GCCGGGGACG TCAGCACGGC GGTCTCCCGG CTTCTGGAGG ACCCCGCCCT GGCCCGGGGC ACCGCGCGCG TGCGCGAGGC GATGCGCGCG ATGCCCTCTC CGGCCGACCT CGCCGACGCG CTCGTCCGCC GGGCGGGGGG CTGA
|
Protein sequence | MRVLLAAAAE KPHFLGMVPL AWALRAAGHE VRVASQPALE PVAAGAGLPF TAVGRDHAFW RTMKAFDLHD TLDDVPLFGR VTDPYERVPW EYLLEGYRRV VPWWWRMVND PMVDDLVALC REWRPELVVW GSVSFSGAIA AEACGAAHVR YLWGADIFAR TRARFLARMG EQPASQREDP LAAWLGTRAA RYGVDFSETL VHGQATVEQV PASLRVDTPA HLEYLPVRYV PYNGRAVVPH WLRTPPTRPR VCVTLGTTLM GQDRGGDVFR DLLEGLAELD VEVVATLPAR EQAKLGTVPG NARLVEYVPL HALAPTCAAM VDHGGWGTVL TGLDAGVPQV IVPSWFDDPM LADMLAARDA AVSVPHRTMT AGDVSTAVSR LLEDPALARG TARVREAMRA MPSPADLADA LVRRAGG
|
| |