Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0260 |
Symbol | |
ID | 9244094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 324192 |
End bp | 325856 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | protein of unknown function DUF1023 |
Protein accession | YP_003678215 |
Protein GI | 297559241 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.160803 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGACC TCTCCTCCAT GTCCCTCGGT CTGCCCGAGG ACATCGCCGC CGACGTCGGC GCCATCGAGA CGGGCGCCGA CCAGCTCGAC GCCGTCCGAC AGAACATGCT CGGCCAGTCC GAGGGCACCC ACAGCCGGTT CCGGTCCTCC GCCGGGGAGT TCACCGACCT CGTCGCCTGG AACATCGTCT CCAGTTCGTC CCAGGAACTG TCCTCCTGGC AGGAGGCCGC CGCTTCCCTG ACCTACGGCG CGGCGGTCCT GCGCCAGTGG GGGCTCGACA TCGAGACCTA CCGTGCCGAG CGCGCCAAGC TGGAGACCCG CTGGGAGGAG GAGAAGGCCG ACGCTGAAGC GGCCGTCGGC GCCTCCGAGG GGAGCGGCTC CATCCTCGGT GAGGGAACCC GTGAGGGGAT GAAGGTCGCC CAGCTGGAAA CGCTGCGCGC GGAACTGCTC TCCGAGCACT CCGGGCACTG GGAAACCCTG ATGGAGCAGG CCGAGCAGAC CGAGAGGGAC CTGCGCGACG GCCCCAGCCA GGACAGCCTG GAGCGTCTGA TCGAGTCGTC CCTGCTCACC GGCGGCCAGC TGTCCTACTT CGGTGACGCG GTCCCCAGCA TGGTCCCGGA CGAGCTGAGC GGTGACGAGC ACCCGTCCGT CGTCAACCTG TGGTGGACCT CGATGACCGA GGAGGAGCAG GACCAGGCCG TGCGGGACCA TCCCGAGCTG CTGCGCGAGC TCGACGGCAT CCCGGCGGCC GTGCGCGACC GACTGAACCG CGACCATCTC GACGACGAGA TCGAGCGGTT CGAGGAGGAG ATCGCCGAAC GGGACGAGGA GATCGGGGAG GCGGCCGCCC GGGGCAGCAA CGGGTCCGAC GCGATCGCTT TGGCCATGGC GAACGACGAC ACGCTCGACA ACCAGCTCCA GGAGCTGAAG GAGCTGCGGG AGAGCCTGGA GGACGAAAGC GCTGACAGGT ACCTGCTGGC TCTGGACACC GGGGGCGACG GACGGGCGAT CGTGGCCAAC GGCAACCCCG ACACCGCCGA CAACGTGGCG ACCCTGGTGC CGGGCACCAC GACGACCTGG GAGAGCATCA ACGACCAGAT GGGACGCGCG GACGCTTTGG CGGACTCTGC GAACCGGGTC AGCCGCGACC AGGACCACTC CGTCATCAGC TGGATCGGCT ACGACGCCCC CAACGTCCCC GAGGCGGCCT TCGAAGGACG GGCGGAGGAC GCGGTCAGCG AGCTGAGCAG TTTCCAGGAC GGACTGCGCT CCACCCATCA GGGGCCGCCG TCCCATAACA CCGTCATAGG CCACAGTTAC GGTTCCACGG TGGTCGGGCA CACCGCGCAG AGCGACGCCG GGCTCGACAC GGACGAAGTG ATACTCGTGG GCAGCCCCGG AACCAACGCC GACCACGTGA CCGACCTGAA TCTTCCCGCC GAGAACGTGC ACGTCTCAAC GGCGGAGAAT GACGGCATCA CCAACCTGAC GGGCCTCACG CACGGCATGG ACCCGACCGA TCCGGAATTC GGAGCGAACG TGTTCGAGTC CGACCCTGGC AGCGAGGGTG GCACGTGGCC CCTCGGTGAC GCCCATTCGG AGTACTTCGA CGAGAACACG AGTTCGCTGA GGCACATGGG CTCTGTCATC GCGGGACAAG AGTAG
|
Protein sequence | MSDLSSMSLG LPEDIAADVG AIETGADQLD AVRQNMLGQS EGTHSRFRSS AGEFTDLVAW NIVSSSSQEL SSWQEAAASL TYGAAVLRQW GLDIETYRAE RAKLETRWEE EKADAEAAVG ASEGSGSILG EGTREGMKVA QLETLRAELL SEHSGHWETL MEQAEQTERD LRDGPSQDSL ERLIESSLLT GGQLSYFGDA VPSMVPDELS GDEHPSVVNL WWTSMTEEEQ DQAVRDHPEL LRELDGIPAA VRDRLNRDHL DDEIERFEEE IAERDEEIGE AAARGSNGSD AIALAMANDD TLDNQLQELK ELRESLEDES ADRYLLALDT GGDGRAIVAN GNPDTADNVA TLVPGTTTTW ESINDQMGRA DALADSANRV SRDQDHSVIS WIGYDAPNVP EAAFEGRAED AVSELSSFQD GLRSTHQGPP SHNTVIGHSY GSTVVGHTAQ SDAGLDTDEV ILVGSPGTNA DHVTDLNLPA ENVHVSTAEN DGITNLTGLT HGMDPTDPEF GANVFESDPG SEGGTWPLGD AHSEYFDENT SSLRHMGSVI AGQE
|
| |