Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4251 |
Symbol | |
ID | 9248125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5067395 |
End bp | 5069008 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Domain of unknown function DUF1814 |
Protein accession | YP_003682146 |
Protein GI | 297563172 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.176604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.224045 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTCT CCGGTGACTT CGAGACCCAC CTGACCCTGG ACGCGACCGC CCCCGGGCGC GTCGCGGAGG CGTCGGAGTG GGCGCGCGAG CACGGGCTCA AGTTCACCCA CATCGAGCTG GACCGGGGCG AGTCCCCGTC ACAGCCGATG GTCACCTACC ACTCCGACGG GAGCACCCTG GCGCGGGAGC TGGCCGTGGC GGAGCGCTGG GCGGCCCTGC TGGCAGGGGC CGGGTTCGCG GTCACCCGCA CCAAGCTGGA GGTGTCCCGC CGGGCCGCCG GGGTGCCCTG GGACCGGGAG GAGGCCGAGC TGTTGCCGGA GTCGTGCTAC TTCGAGACGC ACGTCAAACT CCTGCTCCCC GCGTCGGCCG ACCTGGCGGC GCTGTCCGCG ATCGTGGAAC CCCACCGCGC CCGGTTGTCG CGCAACGCGC GCAGGGTCCG CGACGACGGC TTCCAGGAAC GGTTCGTCAC CCAGCGGTGC TCACGTGTGG GCCACAGGGA GGCCGCCCGG TTCGAACACG CGCTGTTGAA GGCCCTGGAG AGGGCCGGGG TGACCTTCGA GGACAAGGAG GGGTGGCAGC CCAGGGTCCT GTCCGTCGAG CGGGAGTTCG TCGTCCACGA CACCGCCCTG TCCGTGGACG CGGGGTGGAT GGACGCCGCC CCGGTCCGCG ACGCCGACGA GGTTCAGCCG AGCGTGTACG CGCCGGACGG CTACCGGCAG CGCCCGCCCG GCACCTACGT GCCCAACACC GACGGCCCCG AGGCGAGTCA GGGCAAGGTG TTCGACCCCG CGCTCAAGCA CCTGGACGAC GCCTACCGGG CGGGTGAGCC GGTGTTCACC GACCCCGGCC TGGGCTCCCG CTGGTGGGAG GCCAACCGGC GGGCGATGGA GCTGGCCCTG CGCGCGATCT CGGGTACACC GTGGCGGGAC GGCCTCATGC TGCGCGGCAG CATGCTCATG CCGGTGTGGG TGGGTGACGC CGCCCGGCGC CCCCGAGACC TGGACTTCGT GGTGGTCCCG GCCGAGACCG CCCCGTTCGG GGACCCGGCC GACCGCATGT TCGCGGACGT GGTCGGGGCC GTCACGGACG CTTCCGCGCA GGGGATCTCC TTCGACGCCG AGGGCGTGCG GCTGGAGAGC ATCTGGACCT ACGAGCGGGC CCCGGGTCGC CGCGTGGTCG TCCCGTGGCG GGCCGGGGGC CTGCCCCCGG GCACCGTGCA GATCGACGTG GTGTTCAACG AGTCGCTGCC CGAGCCGCCG GTGGCGGTGT CGGTGGCGGG GTCGGACGTG CTGGCGGCCG GGGCGGAGCT GTCCCTGGCC TGGAAGGTGC TGTGGCTGTA CACGGACACC TACCCGCAGG GCAAGGACCT CTACGACGCG GTCCTGCTGG CCGAGAGCGC GCGGCCCTCG CGTGAGCTGC TGGTCGGTGT GCTGCGCCCC GAACTGGGCG ACCGGGCCGA GACCGTGGAC GAGCGCTTCC TGCGGGAGGA GGGCAGCCTC GACTCCGACG AGTGGGACGA CTTCGTCAGC GACTGCCCGT GGGTGGAGGG CGACGCCGGG GAGTGGGTGG ACCGCTTCGA GGCGGCGATG GCCCCCGTGT TCCGGGGGGA GTGA
|
Protein sequence | MEFSGDFETH LTLDATAPGR VAEASEWARE HGLKFTHIEL DRGESPSQPM VTYHSDGSTL ARELAVAERW AALLAGAGFA VTRTKLEVSR RAAGVPWDRE EAELLPESCY FETHVKLLLP ASADLAALSA IVEPHRARLS RNARRVRDDG FQERFVTQRC SRVGHREAAR FEHALLKALE RAGVTFEDKE GWQPRVLSVE REFVVHDTAL SVDAGWMDAA PVRDADEVQP SVYAPDGYRQ RPPGTYVPNT DGPEASQGKV FDPALKHLDD AYRAGEPVFT DPGLGSRWWE ANRRAMELAL RAISGTPWRD GLMLRGSMLM PVWVGDAARR PRDLDFVVVP AETAPFGDPA DRMFADVVGA VTDASAQGIS FDAEGVRLES IWTYERAPGR RVVVPWRAGG LPPGTVQIDV VFNESLPEPP VAVSVAGSDV LAAGAELSLA WKVLWLYTDT YPQGKDLYDA VLLAESARPS RELLVGVLRP ELGDRAETVD ERFLREEGSL DSDEWDDFVS DCPWVEGDAG EWVDRFEAAM APVFRGE
|
| |