Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1476 |
Symbol | |
ID | 9245326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1808508 |
End bp | 1809914 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | protein of unknown function DUF1212 |
Protein accession | YP_003679413 |
Protein GI | 297560439 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.783892 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCCCGA GCGAGGACCG CCTGCTCAGC AGGATGCGGG ACTGGCGCGA GGCGCACGAG CGGGAGCTGA CCCCGGACGG CTTCGACGAG ACCGAGGACG TCAACCTCCC CGACGCCCGC GCCATCGACC TGGTGCTGCG GGTGGGCGAG CTGATGCTCG CCAGCGGAGA GGGCACGGAG GCCGTCAGCG AGGCGATGCT CAGCCTCTCG GTCGCCTTCG ACCTGCCCCG CTCGGAGGTC TCGGTCACCT TCACCACCAT CACCCTGTCC ACCCACCCCG GGGGCGAACA CCCCCCGATC ACCGGCGAAC GGGTGGTGCG CCGCCGCACC CTGGACTACT TCCGCGTCAA CGAACTGCAC ACCCTGGTGC AGCAGTGCGC GCTCGGCCTG CTGGAACTGG AGGACGCCGC CGCCCGCCTG ACCCAGATCA GGCGCGCCCG CATGCCCTAC CCCAACTGGC TCATCGCGGT CGGGTTCGGC CTCATCGCCT CCAGCGCGAG CCTCATGGTG GGCGGCGGCC TGATCGTGGC GACCGCGGCC TTCCTGGCCA CCGTCATGGG CGACCGCACG TCGGTGTTCC TGGCCAAGCG GGGCGTCGCG GAGTTCTACC AGATGGCGGG CGCCGCGGTG GTGGCCGCCA CGATCGGCGT GGCGCTGCTG TGGGCCAGCA CCACGCTGGA CCTCGGCCTC CAGGCCGGGG CGATCATCAC CGGCAACATC ATGGCCCTGC TGCCCGGACG CCCGCTGGTC TCCAGCCTCC AAGACGGCAT CAGCGGCACC TACGTGTCGG CGGCGGCGCG CCTGCTGGAG ACCTTCTTCA TCCTGGGCGC GATCGTGTCC GGCGTGGGCG CGGTCGCCTA CACCGCCCAG CGCCTGGGCG TGAACATCAA CCTGGAGGAC CTCCCCTCCG CGGGAACCTC GATGGAGGTC CCCGTACTGA TCGGCGCGGC GGGGATCGCG GTGGCCTTCG CGATCTCGCT CGCCGTACCG CCCCGGATGC TGCCGATGAT CGGCGTGCTC GGCGTGATGA TCTGGGTGAT CTACGCGAGC ATGCGCGACC TGCTGCACGT GCCCGCGGTG GTGGGCACGG TCGCGGGCGC GGTCGCGGTG GGCGTGGTGG GCCACTGGCT GGCGCGGCGC ACGCGCAGAC CGGTGCTGCC CTACCTGGTC CCGTCGATCG CCCCGCTGCT GCCCGGCAGC ATCCTGTACC GGGGCCTGAT CGAGATCACG CAGGGCGACC CCTCCGCCGG GCTGCTCAGC CTCGCGGAGG CGGTCACGGT GGGCCTGGCG CTGGGCGCGG GGGTCAACCT CGGCGGCGAG CTGGTCAGGG CCTTCCAGCA CGGCGGACTG GCCGGCGCGG GGATGCGCAG CCGCCCTGCG GCGCGCCGGA CACGCGGCGG GTACTGA
|
Protein sequence | MPPSEDRLLS RMRDWREAHE RELTPDGFDE TEDVNLPDAR AIDLVLRVGE LMLASGEGTE AVSEAMLSLS VAFDLPRSEV SVTFTTITLS THPGGEHPPI TGERVVRRRT LDYFRVNELH TLVQQCALGL LELEDAAARL TQIRRARMPY PNWLIAVGFG LIASSASLMV GGGLIVATAA FLATVMGDRT SVFLAKRGVA EFYQMAGAAV VAATIGVALL WASTTLDLGL QAGAIITGNI MALLPGRPLV SSLQDGISGT YVSAAARLLE TFFILGAIVS GVGAVAYTAQ RLGVNINLED LPSAGTSMEV PVLIGAAGIA VAFAISLAVP PRMLPMIGVL GVMIWVIYAS MRDLLHVPAV VGTVAGAVAV GVVGHWLARR TRRPVLPYLV PSIAPLLPGS ILYRGLIEIT QGDPSAGLLS LAEAVTVGLA LGAGVNLGGE LVRAFQHGGL AGAGMRSRPA ARRTRGGY
|
| |