Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2630 |
Symbol | |
ID | 9246481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3140989 |
End bp | 3142239 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Sel1 domain protein repeat-containing protein |
Protein accession | YP_003680553 |
Protein GI | 297561579 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0395405 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCCGGC CCTGGAAGCG TCCCGATCTG CCCCCGGGTG GGCTCGACGA GCTCAACCGC GCGCTGCATG AGTTGCACCA CCGGGCCGGC TGGCCCTCCT CCCGACAGAT CCAGCGCGCC CTGGACGCCA AGGGGGTGCC CATGTCCCAC ACCAAGGTTC ACGACACCCT CACCAAGCCG GACCTGCCGC CCAAGGGCGC GGTCGAGATG ATCACCGAGG TCCTGGCGGA GGCGGTCCGC GGCGCCGACG CCGACACCGA GGTCGAGCGC CTCCTGGACC TGTGGCAGGA CGCCTCCGAC GGCACCCTGT CCTCTTCTTC CGCGGCGCAC GGCGAACCCC GGGCCGAAGC ACCGCCCCCG GCGCTCCGGC CTCCGACCGG CGGCGGACAG GGCGGGGACG AGCAGGCCTA CGAGTGGGGA GACGTGCCGC GGGAGTGGAG GGACAGGGCC GAGGCCGGTG AACCCGACGC CATGATCAGC ATCGGCCTCA GGTTCGGGGT CTGGGGCGAG AAGGACAAGG CGGAGACCTG GTACCGCCGT GCCGTCGAGG CGGGCAGTAC CCGGGCCATG GACAACCTCG GGAGCCTGTT GGAGGACCGG GGCGACCTGG GGGAGGCCGA GGAGTGGTTC CGCCGCGCCG CCGAGGACGG TCACACCGAT GCCATGGACA ACCTCGGGAG CCTGCTGGAG GGCCGGGGCG AGCTGGACGA GGCCGAGGGG TGGTTCCGCC GCGCGGTCGA GGACGGTCAC ACCGATGCCA TGAACAACCT CGGTGTCCTG CTGCGGGGAC AGGGTGAGCT GGACGAGGCC GAGGGGTGGT TCCGCCGTGC CGCCGAGGAC GGACACCTCC AGGCCATGAA CGACCTCGGC GTCCTGTTGC GGGGACGGGG GCGGCTCGAC GAAGCCGAAT CCTGGTTCCG CAACGCCGCC GGCAAGAACG GCAACGCGCA CGCCATGTAC AACCTCGGGT CCCTGTTGGA GGACCGGGGC GAGCTCGGCG GGGCCGATGT GTGGTACCGG CGCGCCGCCA AGAACGGCAA CACCCAGGCC ATGTACAACC TCGCGTTTCT GCTTCACCGA GAGGGGGACA AGGACGAGGC CGAGACCTGG TACCGCCGTG CCGCTGAGTT CGGCCACACC GCCGCCATGT ACAACCTCGG CGTGCTGCTC CAGGGGCGGG GCAGGCCCGG GGAGGCCCAG GGGTGGTGGC AGCGGGCGTT GGCCGCGACG GGTCGGCGCC GCGGGCGCTG A
|
Protein sequence | MPRPWKRPDL PPGGLDELNR ALHELHHRAG WPSSRQIQRA LDAKGVPMSH TKVHDTLTKP DLPPKGAVEM ITEVLAEAVR GADADTEVER LLDLWQDASD GTLSSSSAAH GEPRAEAPPP ALRPPTGGGQ GGDEQAYEWG DVPREWRDRA EAGEPDAMIS IGLRFGVWGE KDKAETWYRR AVEAGSTRAM DNLGSLLEDR GDLGEAEEWF RRAAEDGHTD AMDNLGSLLE GRGELDEAEG WFRRAVEDGH TDAMNNLGVL LRGQGELDEA EGWFRRAAED GHLQAMNDLG VLLRGRGRLD EAESWFRNAA GKNGNAHAMY NLGSLLEDRG ELGGADVWYR RAAKNGNTQA MYNLAFLLHR EGDKDEAETW YRRAAEFGHT AAMYNLGVLL QGRGRPGEAQ GWWQRALAAT GRRRGR
|
| |