Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4612 |
Symbol | |
ID | 9248493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5476327 |
End bp | 5479158 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | RNA polymerase, sigma-24 subunit, ECF subfamily |
Protein accession | YP_003682504 |
Protein GI | 297563530 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0211251 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.677149 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGACG TCCCCGACGA TCACACAGAG ACTCTGGTCG GTGACGGTGA GTTGATTCAG TGGGTGCGGG ACGGCGACAC CGGCGCCTAC GCCACCCTCT ACGAGCGCCA TGCCGCGGCG GCGCGCGGAC TGGCCCGGCA ACTGCTGCGC GGCGAGGCGG AGGTGGAGGA CGCCGCCGCC GAGGCGTTCA CCCGCGTGCT CAGCGTCATC CAGCGAGGAG GGGGCCCGCA GGACTCCTTC CGCCCCTACC TGCTCACCGC CGTCCGCAAC GCCGCCTACG ACCGGGGACG CGGGGAGAGG CGCCAGGTCG TCACCGACGA CATGGAGAGC CTCGTCCCCG GCCAGCCCTT CGTCGATCCC GCGCTCGAAG GGCTGGAGCG CTCCCTCATC GCCCGGGCCT TCCTCTCGCT CCCCGAGCGC TGGCAGTCGG TGCTCTGGCA CACCGAGATC GAGGGGGCCA AACCGGCCGA GGTCGCGGAC GTGCTCGGCA TGAAGCCGAA CGGCGTCGCC GCCCTCGCCT ACCGGGCGCG GGAGGGGCTG CGGCAGGCCT ACCTCCAGAT GCACCTGGCG GGCGGGAACG CCGCCGAGGC CTGCCGCCCG ACCCTCGGAC TGCTCGGCGC CCACGTCAGG GAGGGGCTGT CCAGGCGCGA CACCGCCAAG GTCGACCGGC ACATGGACGG CTGCGCCGAC TGCCGCGCGG TCTACGCCGA GCTGACCGAC GTCAACGTCG GCCTGCGCGG GGTCGTCCTG CCGCTGGTGG CGGGCGCGGG GGCGGCGGGC TACCTCTCCG CCACCCCCGC CGGGGGCGCC TGGTGGGGGA GGATGTCGCG CCGCCAGCAG CAGGCGGCGG CCGGAGGCAC GGCGGCGGCC GGTGTGGCGG TCGCGGTCGC CCTGGCCCTG ACCAGCGCGC CGGAACCGCT TCCCGAACAG CAGCCGCCCC CGGCGGCGGC CCCGTGGGAG CAGCCTCCGG CGCCGACCGC TCCGGACGAG CCCGAGCCCG CTCCGGACGC GCCGCGGCCC TCTCCTCCGG CCAGCGACCG GCCGCGCCCC GACGCGGACG AGCGGCCCGA GCCCGCCGAG CCGGTCCCGG CGGTGCCGCC CGCCGACGTG CCGGAGGAGG AGGTGGCGCA GGAGCCCGGG CCGCGGTTCG CCGCGGGGAT CGACCCGGTC GGCTCCCTGC TCCCGGGCAG CGAGGGGATC ATGGTCCTGG ACGTGCGCAA CATCGGCGGC GGCGCGGCCG AGGAGGTCGT CGCCCAGCTC ACCCTGCCGC CGGGCGTGGA GATGGTCTCC TCCGGCGGTG CGGGAAACGC GCTTCCCAGG GCGGTGGGAC ACGGCGACTG GAGCTGTTCC GCGGGCAGCG GGGGCGGCCG CTGCGCCCAC CCGGGGATGG CGGCGGGGGA GGACGGCACC CAGTTCATCG ACGTGCGCGT GGCCCCGGAC GCCGAGGTCG GGGTTCCGGC GACGGTGTCG GTGTCCGCCG CGGGCGTGAC CGCCGAGGCG ACGGGGGAGC GGGGCGTGAG CGCGGAGGGC GTCACGGCCC GCTACGCCAC CGCGGGCCGG GTACGCGCGG AGAGCGTCGG CAACGCCCTG ATGACCTGCG TCGAACCGGA GCCGAGGGGT CGCTGGCCGT GGCCGTGGTG GGACTGGCCG TACGCCCCGG ACGTCCCCGA CCCGCGTCCG CAGGGTCCCG GGACCGAGCC GGGCACGGAG TCCTCCCCCG CTCCGGGCGC CCCCGCGCCC CCGAGACCGG AGCCGCCGGA AGCCGAGGCG ACCATGGAGG AGGATGCGGT CCCTGGTCGG GAAGAGGGCG AAGGGGCTCC TGACGGGACG GTGGGCGCGC CGGACAACAA CGTGCCCACG GAAACGGCGC GCGGACCGTT CTCCGAAGGC GTGGCGCGGC ACGGCGGCCA CGGCCCGGCG ACGGTGGCCC ACCACGCCCC CGGGTCCCAC GGCGCCTCCG AGGCACAGAA CGCTCCCCGG GCGCAGGACG GCCCCGGGTT CCACGGCGCT CCCGAGGCCC AGGACGGCCC CTGCGCCCGG GCCAGGCTGC GCCAGGGGCC GCGCCTGGAC AACGACCACT GGACGATGGT CCCGCTGGAC GCCGACGACG ACCCCTCCAC CACCTCCTCC AGTTCGGCGA CCTGGGAACT CCCCGAGGGC GGCGGGGTGC GCTGGGCGGG GCTCTACTTC TCCGGGACCG GGACCCCCGA CGCCCCCTCC GTCCGGGTCA GGGGACCGGG CATGACGGAC TACCGCACCG TCGAGGCCAC CAGCAACCGT GTCGCCGAGC TGCCCGGCTA CCCCGCCTAC CAGGCGTTCG CCGAGGTCAC GGACCTGGTG CGGGCCCAGG GCGGCGGCCA GTGGTGGGTC GGCGACGCCC CCGTGAGCGA GGGCCGCGGC CACTACGCGG GCTGGAGCCT CGTGGTCGTG CTGGAGGACC CCCGGGTGGG CACCCGCAAC CAGGTGATGG TCCTCGACGA CACCCGGGTC TCCTTCCACG GCGGCGGGGG CGGCCCCTTC GCGGTGTCGG GCCTGCTGCC CGCCGCCGTA CCCGCCCGGA TCGACGTGGT GGCCTGGGAG GGCGACCCCG ACCTGGGCGG GGACCGGGTG ACCGTGGACG GCGCGGCGGC GGAGCCGGTC GGCGGCTACG GGCGGACGGA CAACGCGTTC ACCGGCTCGG CCCGCGGCGC GGTCGGCGAC CCGCTCGCGT TCGGCACCGA CGTGGTCCGA TTCGACTCAG TACTTGGCCG AGAAACGGAC ATCCGAATCC TGACCGAACA GGACGCCGTG ATGGTGGGGG CAGTGGTCCT GACGGCCCCC ATGCGTAGTT GA
|
Protein sequence | MNDVPDDHTE TLVGDGELIQ WVRDGDTGAY ATLYERHAAA ARGLARQLLR GEAEVEDAAA EAFTRVLSVI QRGGGPQDSF RPYLLTAVRN AAYDRGRGER RQVVTDDMES LVPGQPFVDP ALEGLERSLI ARAFLSLPER WQSVLWHTEI EGAKPAEVAD VLGMKPNGVA ALAYRAREGL RQAYLQMHLA GGNAAEACRP TLGLLGAHVR EGLSRRDTAK VDRHMDGCAD CRAVYAELTD VNVGLRGVVL PLVAGAGAAG YLSATPAGGA WWGRMSRRQQ QAAAGGTAAA GVAVAVALAL TSAPEPLPEQ QPPPAAAPWE QPPAPTAPDE PEPAPDAPRP SPPASDRPRP DADERPEPAE PVPAVPPADV PEEEVAQEPG PRFAAGIDPV GSLLPGSEGI MVLDVRNIGG GAAEEVVAQL TLPPGVEMVS SGGAGNALPR AVGHGDWSCS AGSGGGRCAH PGMAAGEDGT QFIDVRVAPD AEVGVPATVS VSAAGVTAEA TGERGVSAEG VTARYATAGR VRAESVGNAL MTCVEPEPRG RWPWPWWDWP YAPDVPDPRP QGPGTEPGTE SSPAPGAPAP PRPEPPEAEA TMEEDAVPGR EEGEGAPDGT VGAPDNNVPT ETARGPFSEG VARHGGHGPA TVAHHAPGSH GASEAQNAPR AQDGPGFHGA PEAQDGPCAR ARLRQGPRLD NDHWTMVPLD ADDDPSTTSS SSATWELPEG GGVRWAGLYF SGTGTPDAPS VRVRGPGMTD YRTVEATSNR VAELPGYPAY QAFAEVTDLV RAQGGGQWWV GDAPVSEGRG HYAGWSLVVV LEDPRVGTRN QVMVLDDTRV SFHGGGGGPF AVSGLLPAAV PARIDVVAWE GDPDLGGDRV TVDGAAAEPV GGYGRTDNAF TGSARGAVGD PLAFGTDVVR FDSVLGRETD IRILTEQDAV MVGAVVLTAP MRS
|
| |