Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1499 |
Symbol | |
ID | 9245349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1837908 |
End bp | 1839278 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | formyl transferase domain protein |
Protein accession | YP_003679435 |
Protein GI | 297560461 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.290814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000921551 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAGACC GTCGGCGCTA CTGCTACGCC TCCGGGCTGG GCCTCGGTGT TCCGGCGCTG GAGGAGCTGT GCGCCCGAGG CTTCCCGCCC GGTCTGGTGG TGTCCCACCC CGCCGAGTTG GCCCACTGCT CCGGCTACCA CGACTACGGG GCCCTCGCCG ACCGGCTCGG TCTGCCGCAC CTGCGCGCCG CCCTCGACTC GGGCGAGGTG CGCGAGGCCC TCACCTTCCA CGGCATCGAC CTGATGGTCG TCGCGGGCTG GTCGGGAACC GTCCCGGAGG AGGTCCTGTC CTCCCTGGCT CTGGGCGGGG TCGGGCTGCA CCCGGCGCCG CTGCCCGTCG GCCGGGGCCG GGCGCCCATC CCGTGGACGA TCCTGCGCGA CATGCGCTCC AGCGCCGTGA CCCTCTTCCA CATCGAAGGG GAGGAGCACA GCGGCGACAT CGTCGACCAG GCCTGGTTCG ACGTGGCCCC GGACGCCACC GCCGCCGGTC TGTACGAACG CGTGGGGCTC CTCCAGGCGG AACTGCTCGT GCGCCACATG GAGGGCCTGC TGGAGGGCAC CGCGCCCCGG CGGCCGCAGA GCGGCCACGC GTCGGTGTGG CCGCGCCGGC GCCCCAGCGA CGGCCACCTC GACCTCACCG CCTCCGGAAG CGACGTGGAC CGGATGGTGC GCGCGCTGGC CGAGCCCTAC CCGGGGGCGT TCGCGATGTT CGGCAGCGCC CGGATCACGC TGTGCTCGGG ACGCCTGGTG GGCGGGGTCG CCGGCGGGGC GCCCGGGCAG GTGGTCGCGA CCGGCCGGGG GCGGGAGTGG GGGATCACCT GCGGGGACGG GGCCGTGTTC GTGCCCGAGG TGCTGCGGGT GGACGAGGGG GTGCGCGCCC GGCCGACCTC GTTGGCGATG TTCCGGCCCG GGACCTTCTT CGAGGCCCCC TCCCAGCACA TGCTGGAGGG CACCCGGCGG GCGCCCCTGC CCGGACAGGC GCCGAGCGGA CCGAACAGGG TCGTGCCCGC CGCCCGGACC GCGCCGGAGG CACGGGCCGC GGAGCCCGGG GTCCCGGGGG CGGGGGAGGG CGGCGCGCGG TCGCGGGCTC CGGCCCCGGA GGAGGGGGCT TCCGGGGCGA ACGTGTCGGG GGCACCGGAC GCGTCGGGGA CTTCGGGTGC GGCCGGGGAG GCCGGGGCGG AGGTTCCGGA GGCCCGGGGG TCCGAGCAGC GGGTTTCTGA GGGGCGCGTC CCGGAGGCGC AGGCGCCGCA GGCAGGGGTT CCCGACACGC GGGGGCCGGA GCCCCAGACC CCGGAGGCGC AGGCGGGGGA CGGTACGGAC CTCGCGCCCG GGATCATGGC GGGCGACCCC GGTCCGGCCG ACCAGCGCTG A
|
Protein sequence | MSDRRRYCYA SGLGLGVPAL EELCARGFPP GLVVSHPAEL AHCSGYHDYG ALADRLGLPH LRAALDSGEV REALTFHGID LMVVAGWSGT VPEEVLSSLA LGGVGLHPAP LPVGRGRAPI PWTILRDMRS SAVTLFHIEG EEHSGDIVDQ AWFDVAPDAT AAGLYERVGL LQAELLVRHM EGLLEGTAPR RPQSGHASVW PRRRPSDGHL DLTASGSDVD RMVRALAEPY PGAFAMFGSA RITLCSGRLV GGVAGGAPGQ VVATGRGREW GITCGDGAVF VPEVLRVDEG VRARPTSLAM FRPGTFFEAP SQHMLEGTRR APLPGQAPSG PNRVVPAART APEARAAEPG VPGAGEGGAR SRAPAPEEGA SGANVSGAPD ASGTSGAAGE AGAEVPEARG SEQRVSEGRV PEAQAPQAGV PDTRGPEPQT PEAQAGDGTD LAPGIMAGDP GPADQR
|
| |