Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3231 |
Symbol | |
ID | 9247088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3863018 |
End bp | 3864808 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | protein of unknown function DUF1446 |
Protein accession | YP_003681143 |
Protein GI | 297562169 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.113119 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTGCTTGT TTTTCCGTGT GATCGGTGTC ACCCTCGTGT CCATGACCGC TGCTGCCACC CCGGCGCCCC CGCTGCTGGT CGCGAACGCC TCCGGTTTCT ACGGGGACCG CTTCGCCGCC GTCCACGAGA TGCTCACCGA GGGGCGCGTG GACGTCCTCA CCGGCGACTA CCTGGCCGAG CTCACCATGG CCATCCTCGG CCGCGACCAG CTCGGCGACC CCGGCCGCGG CTACGCCCGG ACCTTCCTGA GCCAGATGCG CGAGACCCTG GCGCTGGTCA TGGAACGCGG CACCAGGGTG GTCACCAACG CGGGCGGCCT CAACCCGCGC GGCCTCGCCG ACCGGCTCAC CGAACTCGCC GACGGGCTCG GCCTCGAACC CCGCATCGCC TGCGTCACCG GCGACGACCT CATCGACCGC GCCGAGGAAC TCGACCTGGG CACCCCCCTG ACCGCCAACG CCTACCTGGG CGCCTTCGGC ATCGCCGCCT GCCTGGAGGC GGGCGCCGAC ATCGTCGTCA CCGGCCGCGT CACCGACGCC TCCCTGGCCG TGGGCCCGGC CGCCGCCCAC TTCGGCTGGA CCCCCGGGGA CCTGGACGCC CTGGCCGGGG CCACCGTCGC CGGGCACGTC ATCGAGTGCG GGACCCAGGC CACCGGCGGC AACTACGCCC TGGCCGCCGA ACTCCTGCGC GAGGGCCGCG ACCTGGACCG GCCCGGCTTC CCCCTCGCCG AGATCCACGC CGACGGCAGC GCGGTCATCA CCAAGCACCC CGGCACCGGC GGCGCCGTCA CCACCGGGAC CGTCACCGCC CAGCTCGTCT ACGAGGTCGC CGGAGCCCGC TACCCCGGCC CCGACGTCAC CGCCCGCCTG GACACCGTGC GCCTCACCAG GCAGGGGCCC GACCGCGTCC TGCTCAGCGG AACCCGCGGC GAGGAACCTC CCCCCGACCT CAAGGTCGGA CTGACCAGCC TCACGGGCTT TCGCAACGAG GTCGAGTTCC TGGTCACCGG CCTGGACGCC GGGGCCAAGG CCGCACAGGC CGAACGCCAG ATGCGCGCAG CCCTCGCCGA CCGCGCGCCC GACGACCTGC GCTTCACCCT CGTGCCAGCC CAGGACCCCC ACGGCGACAC CCAGGACGCG GCCACCGCAC GCCTGCGGGT GGTCGCCCGC GACCACGACC CCGCGGTCGT GGGCCGCTCC TTCGGGGCGG CCGCCGTGGA GCTGGCCCTG GGCAGCTACG CCGGATTCCA CCTCACCGCG CCGCCGCGCG AGGCCCGACC CGACGGGGTC GCCGCCACGC ACGCGCTCGT GCCCGCCTCC GAGGTCGCCC ACACCGCGAT CCTGCCCGAC GGCGCCAGGA TGCCCGTCGC GCCCGCGCCC CGCACCCGGG CCCTGACCGG GGTCGCCGAG CCCCCGCTGC CCGAACCCCT GCCCCGAGGC CCCGCGCGCC CGGTCCCGCT CGGCCTGGTC CTGGGCGCGC GCAGCGGCGA CAAGGGCGCC GACGCCAACC TGGGCGTGTG GGTGCGCGGA GAGACGGCGT GGCGGTGGCT GGCCACCACC CTGACCGCCG ACCTGCTGCG CGAACTCCTG CCCGAGACCG CCGGACTGCG CGTCACCCGG CACCTGCTGC CCAACCTGCG GGCCGCCAAC TTCTGGATCG AGGGCCTGCT CGCCCCCGGC ACCGCGCGCC GCGAGGGCGT GGACCCCCAG GCCAAGGGGC TGGGCGAGTG GCTGCGCGCC CGCCGCGTCC CCGTCCCCGA GACCGTGTTG GCGGAGGTGG AGCAGCCGTG A
|
Protein sequence | MCLFFRVIGV TLVSMTAAAT PAPPLLVANA SGFYGDRFAA VHEMLTEGRV DVLTGDYLAE LTMAILGRDQ LGDPGRGYAR TFLSQMRETL ALVMERGTRV VTNAGGLNPR GLADRLTELA DGLGLEPRIA CVTGDDLIDR AEELDLGTPL TANAYLGAFG IAACLEAGAD IVVTGRVTDA SLAVGPAAAH FGWTPGDLDA LAGATVAGHV IECGTQATGG NYALAAELLR EGRDLDRPGF PLAEIHADGS AVITKHPGTG GAVTTGTVTA QLVYEVAGAR YPGPDVTARL DTVRLTRQGP DRVLLSGTRG EEPPPDLKVG LTSLTGFRNE VEFLVTGLDA GAKAAQAERQ MRAALADRAP DDLRFTLVPA QDPHGDTQDA ATARLRVVAR DHDPAVVGRS FGAAAVELAL GSYAGFHLTA PPREARPDGV AATHALVPAS EVAHTAILPD GARMPVAPAP RTRALTGVAE PPLPEPLPRG PARPVPLGLV LGARSGDKGA DANLGVWVRG ETAWRWLATT LTADLLRELL PETAGLRVTR HLLPNLRAAN FWIEGLLAPG TARREGVDPQ AKGLGEWLRA RRVPVPETVL AEVEQP
|
| |