Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0947 |
Symbol | |
ID | 9244792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1164179 |
End bp | 1165300 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | cobalbumin biosynthesis protein |
Protein accession | YP_003678897 |
Protein GI | 297559923 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.33915 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.165482 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTTTA CTTTTTCGGT GCGTATCCGA TTTCTCGGGA CTGCCGGGCA CTCGGGGTGG CCCGCCCCCC AGTGCACCTG TGCCTCCTGC AACAAGGCCC TCTCCGAGCG CCGCCTGCCC CTGCGCGTGA CGGTGGACGA CCGTTTCCGC GTCCGTGAGG CGGGGGTGGT CGGCCCGGTG CCGCCGGGGT ACTCGGTGAG CCCGACCCCC GACGGCGTCC TGGTCGAGGG CCCCGACGGG GGCAGGCTGA TGTACGCGCG CACGGGCCCG GGGCCCCTGC CCGGCGTCCC CCCGGCGGAG GCGCGCGCCG AGGGCTCCTC CGGCTACTCC TCCGCCCGCC CCGACCAGCA GGTGGACATG GTCATCGTGG ACGCCGCGCA GCGCCCGGAG GTCATCGGCG AGCTGCGCCG CTCGGGGGTG ATCGGGGTGA CCACGGCCGT CGTCGCGGTC GGCGGCGACC ACCGCGTCCG CTCACCCGAG GAGTTCGCCC GCCGGGCCCG GCTGTGGGGC GCCCTGGTGC CGGGCGACGG GCAGGTCCTG TCGTGCCCTC CGGCGGTGTG GCCGGAGTCG CGGCCGCGCG GGCCGCACCG GACCCTGGTG ACCGGGGGCG CGCGCTCGGG CAAGTCCACC GAGGCGGAGC TGCGCCTGAT GTCGGAGCCG AAGGTGCTGT ACGCGGCGAC CGGTCCGGAG CCCGACCCGG ACGCGGACCC CGACTGGGCC GACCGGGTGG CCCGGCACGT GCGGCGCCGC CCCTGGTGGT GGCGCACCGA GCAGACCACC GACCTGGCCG CGCTGCTCAA GGGCGCCCGG GGCGCGGTCC TGGTGGACTG CCTGGGCACC TGGCTGACCC GGGTGATGGA CGGGGCGGGG CTGTGGGAGG ACAGCCCACC GCCCGACGCG GAGGAGGAGG TGGAGGCGGC GGTCCACGGG CTGCTGGACG CCTGGCGTTC CACCCAGGCC TACGTCGTGG CGGTCACCAA CGAGGTGGGG TCGGGTGTGG TGCCCGCGAC CCGTGCCGGG GGGCTGTTCC GCGACCACCT GGGCCGGTTG AACCAGTGGG TCGGCGCCGA GTCCGAGGAC GTGGTGCTGG TGGCAGCGGG CCGCGTCCTG GAGCTGCCGT GA
|
Protein sequence | MLFTFSVRIR FLGTAGHSGW PAPQCTCASC NKALSERRLP LRVTVDDRFR VREAGVVGPV PPGYSVSPTP DGVLVEGPDG GRLMYARTGP GPLPGVPPAE ARAEGSSGYS SARPDQQVDM VIVDAAQRPE VIGELRRSGV IGVTTAVVAV GGDHRVRSPE EFARRARLWG ALVPGDGQVL SCPPAVWPES RPRGPHRTLV TGGARSGKST EAELRLMSEP KVLYAATGPE PDPDADPDWA DRVARHVRRR PWWWRTEQTT DLAALLKGAR GAVLVDCLGT WLTRVMDGAG LWEDSPPPDA EEEVEAAVHG LLDAWRSTQA YVVAVTNEVG SGVVPATRAG GLFRDHLGRL NQWVGAESED VVLVAAGRVL ELP
|
| |