Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4854 |
Symbol | |
ID | 9248740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5754796 |
End bp | 5756436 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | coagulation factor 5/8 type domain protein |
Protein accession | YP_003682743 |
Protein GI | 297563769 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.26673 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCT TCCTCATCGA ACCCGGTGCG AAGCTGGCCA ACCGCTACCG CCTAGACCAA GTGGTGAGCG AGACCGGCGG CGCCACACGC TGGAAGGCCA CCGACGAGAC GCTCGCGCGT CCCGTGGCGG TGTGGACCTT CGCCGAGGGC TTCCCGCGCA CCTCCGAGGT CGTGCGCGCC GCCCGGGCCA CGAGCCGCAT TCCCGACGCC CGCGTCACGC AGGTCTTCGA CGCCGACGAC TCCTCCCCCG TCCCCTACGT GGTCGAGGAG TGGGTCATCG GCTCCTCCCT GGCCGACCTC CTCGCGCAGG GCCCCATGGA GCCCGAGCGC GCCGCCGGGC TGGTCGCCGA GGCCGCCGAG GCGATCGCCG CCGCGCACGC CGGAGGCCTT CACCACCTGT GCCTGACCCC GAGCAAGCTC ATGTGGAGCA GCGGCGGCGC GGTCAAGGTG ACCGGCATCG GTGTGGACGC CGCCCTGCTG GGCGCCGGCA ACCCCGACCC CGCGGCCACC GACGCCCAGG GGCTGGGAAA CCTGCTCTAC GCGGCGCTCA CCGGGCACTG GCCCGGCGGC CCGCAGAGCG GCCTGCCCGC CGCACCCGAG GGTCCCGCGG GCCCCTACCC TCCCCACCAC ATCCGCCAGG GCGTCACCGA ACCCCTGGGC ACCATCACCA CGCGCGCCGC CCTGCCGCAG CTCGCGGGGC AGCTGGTGCC CGGACCGCCC ATCGCCTCGC CGGCCGACTT CTGCGCGGCC ATGGCCGAGG TGCCCCGGCT CATCCCCCTG CCCGTCACCC AGGCCGAGTC GGCCCCGCCC GTGCCCGGGA CCTCCCGGCG CACCGGCGAG TTCGACAGCA CCGGCCCGCG CCGCGGCCCT CGCGGTACCC GTGGCGGGGC CGCGGGCGCA CGGGACGACC AGGAGGCGCG CCACGGTTCC GTCGGACGGA CGGGATCCTC CTCGCGGGGC GGCGGCTCCG TCCGGGGCGG CTCCGAGGTG CGTGAGCGGC AGGGCCCCTC CCGGATGGAG CAGCGGACCG CGCGCACGCA GCCCTCGCTC CGCAAGATCC TGATCGGCGT GGCGGCCCTG GTGCTGTTCG CCGGTGTGGT CGTCGGCGCC TGGACGGTCG GGACGATGTT CAGCGCGGGC GGAGGCGAGG AGGCCCCGCC CGAGGGCGGC GGCGGCCAGG CCGCGGACGG CGGCGGGGAG GTGGAGCTGA GCCCGCTGGA GATCCAGGGG TCACGCGGCC TCAACCCGCA CGGCAACACC GACGAGCACT CGGACAAGGC GGGCCGCGCC CACGACGGGG ACACCGCCAC CGAGTGGAAC ACCCAGGGCT ACAGGGACCC GCTCAGCGAC ATCAAGCCCG GCGTCGGACT CCAGCTCGAC CTCGGCGCCG TCCACGAGGT CCACGAGGTG GACCTCAACC TCGGCGGCAG CGGCTACGAG TTCCAGATCC TGGCCGGCGA GAGCGACTCG GACAGCGAGA CCGGTTACGA GGTCGTCGGA TCGGGCACGG GCGGCTCCCA GGTCGTCACC CTCGACGAAC CGGTCGAGGC CCGCTACGTC GTGGTCTGGT TCACCGAGCT GGCCGGGTCC GGCGAGTGGA GGGGCACGGT CTACGAGGCC GAGGTACGAG GGGTCGAGTA G
|
Protein sequence | MSTFLIEPGA KLANRYRLDQ VVSETGGATR WKATDETLAR PVAVWTFAEG FPRTSEVVRA ARATSRIPDA RVTQVFDADD SSPVPYVVEE WVIGSSLADL LAQGPMEPER AAGLVAEAAE AIAAAHAGGL HHLCLTPSKL MWSSGGAVKV TGIGVDAALL GAGNPDPAAT DAQGLGNLLY AALTGHWPGG PQSGLPAAPE GPAGPYPPHH IRQGVTEPLG TITTRAALPQ LAGQLVPGPP IASPADFCAA MAEVPRLIPL PVTQAESAPP VPGTSRRTGE FDSTGPRRGP RGTRGGAAGA RDDQEARHGS VGRTGSSSRG GGSVRGGSEV RERQGPSRME QRTARTQPSL RKILIGVAAL VLFAGVVVGA WTVGTMFSAG GGEEAPPEGG GGQAADGGGE VELSPLEIQG SRGLNPHGNT DEHSDKAGRA HDGDTATEWN TQGYRDPLSD IKPGVGLQLD LGAVHEVHEV DLNLGGSGYE FQILAGESDS DSETGYEVVG SGTGGSQVVT LDEPVEARYV VVWFTELAGS GEWRGTVYEA EVRGVE
|
| |