Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3147 |
Symbol | |
ID | 9247003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3765091 |
End bp | 3766386 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | ErfK/YbiS/YcfS/YnhG family protein |
Protein accession | YP_003681062 |
Protein GI | 297562088 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGGAC GAGTCCGCAC ACCCCGCGGT CTGGCGGCCG CCGTCGCGGT CGCGCTCGCC GCGACGGCCT GTACCGGCCC CGCCCGGCAG GAGCCCGCGG CGGCTCCCAG CGAGGAGGCC CAGACCGGGC CGCGGATCAG CGTCACCCCC GAGGAGGGGG CCTCCTCCGT CGCCCCCGAC ACCCCGGTGC GGGTGTCGGT GGAGGAGGGC TCGCTCACCG ACGTCCGCGT CGAGCAGGCC CCCTCCGCGG AGGAGGGCGG GGACGCCGGG GCGGCCGGGG ACGCCGAGCG GTGGGAGTTC ACCGGCACCC TCAGTGAGGA CGGCACCCGG TGGGTGAGCG ACTGGAACCT GGACCCGGGC TCGGCGGTCA CCGTCCGCGC CACCGCCGAG GACGACGCCG GTGAGGCCTC CGAGACGGTC GTGGAGTTCT CCACGAAGGA GGCCGTGCCC GGTCAGCGCC TCGAACTGGC CTCGAACTTC CCCACCTCCG GCGACACCGT CGGCGTGGGC ATGCCGGTCA TCGTCAACTT CGACCTGCCG GTGACCAACA AGGCCCAGGT GGAGAACTCC ATGGAGGTGA CCTCCGAGCA GGAGGTGGAG GGCGCCTGGA ACTGGGTCGG CGACAAGACC GCGGTGTTTC GCCCCCGCGA GTACTGGGAG CCCCACCAGC AGGTCAGCGT GGACATGCGC CTGTCGGGGG TGGAGGCCTC CGAGGGCGTC TACGGGATCG AGAACCACCG CCTGGAGTTC GAGGTCGGCC GCGAGATGGT CTCGACCATG CACGTGCCCG ACCACGAGAT GCTGGTGGAG ATCGACGGCG AGCCCGCGCG CACCATCCCC GTGAGCAACG GCGAGGCCTC CAAGCGCTTC AACACCACCA CCTCGGGGAC GCACCTGACC ATGGAGAAGT ACGAGTCCCT GGTCATGGAC GCGGCCACCC TGGGCATCCC CGAGGACTCG CCGGACTACT ACAAGCTGGA CGTGGACTGG GCGGTGCGCA CCTCCAACAG CGGCGAGTTC ACCCACGCCG CCCCCTGGAA CGACCGGATC GGGTCGGCCA ACACCTCCAA CGGCTGCACG AACATGTCGG TGGAGGACGC CCGCTGGTTC TACGAGAACT CCCTGATGGG CGACGTCCTG GAGACCACCG GGACCGACCG GGAGCTGGAG TGGGACAACG GCTGGGGTTT TTGGCAGCGG TCCTGGGACG AGTGGCTGTC CCACAGCGCC ACCGGTGAGC CGCAGGTGAC CGACGGGTCG GGCACCCCCG GTTCCGTGCA CGGCGAGGGG AACTAG
|
Protein sequence | MKGRVRTPRG LAAAVAVALA ATACTGPARQ EPAAAPSEEA QTGPRISVTP EEGASSVAPD TPVRVSVEEG SLTDVRVEQA PSAEEGGDAG AAGDAERWEF TGTLSEDGTR WVSDWNLDPG SAVTVRATAE DDAGEASETV VEFSTKEAVP GQRLELASNF PTSGDTVGVG MPVIVNFDLP VTNKAQVENS MEVTSEQEVE GAWNWVGDKT AVFRPREYWE PHQQVSVDMR LSGVEASEGV YGIENHRLEF EVGREMVSTM HVPDHEMLVE IDGEPARTIP VSNGEASKRF NTTTSGTHLT MEKYESLVMD AATLGIPEDS PDYYKLDVDW AVRTSNSGEF THAAPWNDRI GSANTSNGCT NMSVEDARWF YENSLMGDVL ETTGTDRELE WDNGWGFWQR SWDEWLSHSA TGEPQVTDGS GTPGSVHGEG N
|
| |