Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4700 |
Symbol | |
ID | 9248582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5579642 |
End bp | 5580916 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | protein of unknown function DUF1205 |
Protein accession | YP_003682592 |
Protein GI | 297563618 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.626027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.337723 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGTCG TCGTGGCGTC CCTCGCCGAG AAGACGAACT TCCTGAGCCT GGTGCCCCTG GCGTGGGCGC TGCGCGCCGC CGGGCACGAG GTGCGGGTGG CCAGCCAGCC CGCGCTGGAG CCCGTGGTGC GGGAGACGGG CGTGCCGTTC GTCGCGGTGG GGCGCGACCA CGGGTTCTGG CGCCATCTCA CCGCCCGGTC CTCCTTCGAC GGGATGCGGG GAGGCGTCCC CCTCTTCTCC GTGTACGGCC GGGGCGGGCC GGAGGGCTCC TGGGAGGAGA CCCTGGAGGA GTACCGGCAG GTCGTCACCT GGTGGTGGCG GATGGTCAAC GACCCCATGG TCGACGACCT GGTCGCCCTC TGCCGCGAGT GGCGCCCCGA CCTGGTCGTG TGGGAGCCCA TCACCTTCTC CGGGGCGATC GCCGCCGAGG CCTGCGGGGC CGCGCACGTG CGCTATCCCT GGGGCGCGGA CGTGTTCGGC GCCGTACGCG CGCGCTTCCT GGCGCGGATG GGCGAACAGC CCGCCTCACG GCGGGAGGAC CCCCTGGCCG CGTGGCTGGG GACCAGGGCG GCCCGGTACG GCGTGGACTT CTCCGAGACC CTGGTCCACG GCCAGGCCAC CGTCGAGCAG GTCCCCGCGT CCCTGCGGGT GGACACGCCC GCGCACCTGG AGTACCTGCC GGTGCGCTAC GTGCCCTACA ACGGACGCGC CGTCGTCCCC GAATGGCTGC GCACACCCCC CACCCGCCCC CGGGTGGCCC TGTGTCTGGG CACCAGCACG GCGGCGTGGC TGGGCAGGTT CGGGGTGGAC GTGGCCACGG TTCTGGAGGG TCTGGCCGAG CTGGACGTGG AGGTGGTGGC CACCCTGCCC GCCAGTGAGC AGGCCAAGCT CGGCGCCGTC CCCGGCAACG CCCGCCTGGT CGAGTACGTG CCCCTGCACG CCCTGGCCCC CACCTGCGCC GCCATGATCA CCCACGGCGG GACGGGCACC GTCCTGACCG GTCTGGCCCA CGGGGTCCCG CAGCTCGTCT CGCCCCGGCC CACCTTCGAC GAACCCCTGC TGGCCTCGTC GGTCGCGGCC GAGGGCGCGG CGCTGGTCGT GGACCCCGAC CGCATGGACG CCGCCACCGT CACCGCCGGC GTACGCGCCC TCCTCGAAGA CCCCCGCCAC ACAAGCGCCG CCCGCGCCCT GCGCGCACGC ATGGACGCCA TGCCCACCCC CGCCGACCTC GCCCACACCC TCACCGCGCG CCGGCCCGCA TTCCGAAGCA AATGA
|
Protein sequence | MRVVVASLAE KTNFLSLVPL AWALRAAGHE VRVASQPALE PVVRETGVPF VAVGRDHGFW RHLTARSSFD GMRGGVPLFS VYGRGGPEGS WEETLEEYRQ VVTWWWRMVN DPMVDDLVAL CREWRPDLVV WEPITFSGAI AAEACGAAHV RYPWGADVFG AVRARFLARM GEQPASRRED PLAAWLGTRA ARYGVDFSET LVHGQATVEQ VPASLRVDTP AHLEYLPVRY VPYNGRAVVP EWLRTPPTRP RVALCLGTST AAWLGRFGVD VATVLEGLAE LDVEVVATLP ASEQAKLGAV PGNARLVEYV PLHALAPTCA AMITHGGTGT VLTGLAHGVP QLVSPRPTFD EPLLASSVAA EGAALVVDPD RMDAATVTAG VRALLEDPRH TSAARALRAR MDAMPTPADL AHTLTARRPA FRSK
|
| |