Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3393 |
Symbol | |
ID | 9247258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4056331 |
End bp | 4057746 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | putative secreted protein |
Protein accession | YP_003681304 |
Protein GI | 297562330 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.327426 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCAC CTCCCGCAGC ACCACGGGCC AGGAAGGCGC TGCGGCGCAC CGCGGCCCTC GCGTCCGCCG CGGCCGCGTC CCTGCTCCTC GGGCTCCTCG GCCCGGCCGC CGCCCAGGCG GACGAGGCGC CCGCCGAAGA CATCCACAGC GACAACATCA CCCACGTCGC CCACACGCCC AAGCCCTCGG CCGTGCGCAA CGTCAACTCC GACCTGGCGT TCAGCGGCGA CTACGCCATC GGCGGCAACT ACGACGGCTT CGTCATCTAC GACATCTCCG AGCCGGAGGA GCCGCAGGTC GTCTCCGAGG TGCTGTGCCC GGGCGGACAG GGCGACGTGT CGGTCAGCGG CGACCTGCTC TACTTCTCGG TGGACTATCC GCGAGCGAGC ACCGAGTGCG GGGCACCCTC CGTCCCGGTG ACCGACCCGG ACGGCTTCGA GGGGATCCGG ATCTTCGACA TCTCCGACAA GGCCAACCCC CAGTACGTGT CGGCGGTGCG CACCGACTGC GGCTCGCACA CCAACACCCT GGTGCCGAGC AAGACCGGTG ACAGCGACCT GATCTACGTG TCGTCGTACT CGCCCTCGGA GCGCTTCCCG AACTGCCAGC CGCCGCACGA CAAGATCTCC GTCATCGAGG TCCCGCACGA CGCTCCCGAG GAGGCCGCGG TCGTCAACGA ACCGGTCCTG TTCCCCGAGG GCGGCAACCA CGAGCAGGAC GGGCTGCTGC TGCCCACCCA GGGCTGCCAC GACATCACCG TCTACGCCGA GCGCGACATC GCCGCGGGCG CCTGCATGGG CGACGGCGTG CTGATGGACA TCTCCGACCC GGTGAACCCG GTCGTCACCG AGGTGGTCCA GGACGAGAAC TTCGCGTTCT GGCACTCGGC GACCTTCACC AACGACGCCC GGACCGTGCT GTTCACCGAC GAGCTCGGCG GAGGCGGCGC CCCGACCTGC ACCGAGGAGG TCGGCCCCCA GCGCGGCGCC AACGCCATCT ACGCCATCGG CGGCGGCGAC TCGCCGGAGC TGGAATTCGC CAGCTACTTC AAGATCGACC GCCACCAGGG CGACCAGGTG TGCGTGGCGC ACAACGGCTC GCTGATCCCG GTGCCCGGCC AGGACTACTT CGTGCAGTCG TGGTACCAGG GCGGCGTCTC GGTGATCGAC TTCAACGACC CGGGCGCCCC GAGCGAGATC GGCTTCTTCG ACGTGGACTC CCGCGTCGAG GAGGGTGTGC AGGACAACGA CACCTGGTCG ACGTACTACT ACAACGGCTA CGTGTACTCG TCCGACATCG AACGCGGCCT GGACGTGCTG CGGATCGACG ACCCGCGCGT GCGCGCGGCC GAGCGGGTGC GGATGGAGGA GTTCAACCCG CAGAGTCAGG AGAGCTACCG GCCGGGACGG CGCTGA
|
Protein sequence | MPAPPAAPRA RKALRRTAAL ASAAAASLLL GLLGPAAAQA DEAPAEDIHS DNITHVAHTP KPSAVRNVNS DLAFSGDYAI GGNYDGFVIY DISEPEEPQV VSEVLCPGGQ GDVSVSGDLL YFSVDYPRAS TECGAPSVPV TDPDGFEGIR IFDISDKANP QYVSAVRTDC GSHTNTLVPS KTGDSDLIYV SSYSPSERFP NCQPPHDKIS VIEVPHDAPE EAAVVNEPVL FPEGGNHEQD GLLLPTQGCH DITVYAERDI AAGACMGDGV LMDISDPVNP VVTEVVQDEN FAFWHSATFT NDARTVLFTD ELGGGGAPTC TEEVGPQRGA NAIYAIGGGD SPELEFASYF KIDRHQGDQV CVAHNGSLIP VPGQDYFVQS WYQGGVSVID FNDPGAPSEI GFFDVDSRVE EGVQDNDTWS TYYYNGYVYS SDIERGLDVL RIDDPRVRAA ERVRMEEFNP QSQESYRPGR R
|
| |