Gene Ndas_4612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4612 
Symbol 
ID9248493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5476327 
End bp5479158 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content76% 
IMG OID 
ProductRNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003682504 
Protein GI297563530 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0211251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.677149 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGACG TCCCCGACGA TCACACAGAG ACTCTGGTCG GTGACGGTGA GTTGATTCAG 
TGGGTGCGGG ACGGCGACAC CGGCGCCTAC GCCACCCTCT ACGAGCGCCA TGCCGCGGCG
GCGCGCGGAC TGGCCCGGCA ACTGCTGCGC GGCGAGGCGG AGGTGGAGGA CGCCGCCGCC
GAGGCGTTCA CCCGCGTGCT CAGCGTCATC CAGCGAGGAG GGGGCCCGCA GGACTCCTTC
CGCCCCTACC TGCTCACCGC CGTCCGCAAC GCCGCCTACG ACCGGGGACG CGGGGAGAGG
CGCCAGGTCG TCACCGACGA CATGGAGAGC CTCGTCCCCG GCCAGCCCTT CGTCGATCCC
GCGCTCGAAG GGCTGGAGCG CTCCCTCATC GCCCGGGCCT TCCTCTCGCT CCCCGAGCGC
TGGCAGTCGG TGCTCTGGCA CACCGAGATC GAGGGGGCCA AACCGGCCGA GGTCGCGGAC
GTGCTCGGCA TGAAGCCGAA CGGCGTCGCC GCCCTCGCCT ACCGGGCGCG GGAGGGGCTG
CGGCAGGCCT ACCTCCAGAT GCACCTGGCG GGCGGGAACG CCGCCGAGGC CTGCCGCCCG
ACCCTCGGAC TGCTCGGCGC CCACGTCAGG GAGGGGCTGT CCAGGCGCGA CACCGCCAAG
GTCGACCGGC ACATGGACGG CTGCGCCGAC TGCCGCGCGG TCTACGCCGA GCTGACCGAC
GTCAACGTCG GCCTGCGCGG GGTCGTCCTG CCGCTGGTGG CGGGCGCGGG GGCGGCGGGC
TACCTCTCCG CCACCCCCGC CGGGGGCGCC TGGTGGGGGA GGATGTCGCG CCGCCAGCAG
CAGGCGGCGG CCGGAGGCAC GGCGGCGGCC GGTGTGGCGG TCGCGGTCGC CCTGGCCCTG
ACCAGCGCGC CGGAACCGCT TCCCGAACAG CAGCCGCCCC CGGCGGCGGC CCCGTGGGAG
CAGCCTCCGG CGCCGACCGC TCCGGACGAG CCCGAGCCCG CTCCGGACGC GCCGCGGCCC
TCTCCTCCGG CCAGCGACCG GCCGCGCCCC GACGCGGACG AGCGGCCCGA GCCCGCCGAG
CCGGTCCCGG CGGTGCCGCC CGCCGACGTG CCGGAGGAGG AGGTGGCGCA GGAGCCCGGG
CCGCGGTTCG CCGCGGGGAT CGACCCGGTC GGCTCCCTGC TCCCGGGCAG CGAGGGGATC
ATGGTCCTGG ACGTGCGCAA CATCGGCGGC GGCGCGGCCG AGGAGGTCGT CGCCCAGCTC
ACCCTGCCGC CGGGCGTGGA GATGGTCTCC TCCGGCGGTG CGGGAAACGC GCTTCCCAGG
GCGGTGGGAC ACGGCGACTG GAGCTGTTCC GCGGGCAGCG GGGGCGGCCG CTGCGCCCAC
CCGGGGATGG CGGCGGGGGA GGACGGCACC CAGTTCATCG ACGTGCGCGT GGCCCCGGAC
GCCGAGGTCG GGGTTCCGGC GACGGTGTCG GTGTCCGCCG CGGGCGTGAC CGCCGAGGCG
ACGGGGGAGC GGGGCGTGAG CGCGGAGGGC GTCACGGCCC GCTACGCCAC CGCGGGCCGG
GTACGCGCGG AGAGCGTCGG CAACGCCCTG ATGACCTGCG TCGAACCGGA GCCGAGGGGT
CGCTGGCCGT GGCCGTGGTG GGACTGGCCG TACGCCCCGG ACGTCCCCGA CCCGCGTCCG
CAGGGTCCCG GGACCGAGCC GGGCACGGAG TCCTCCCCCG CTCCGGGCGC CCCCGCGCCC
CCGAGACCGG AGCCGCCGGA AGCCGAGGCG ACCATGGAGG AGGATGCGGT CCCTGGTCGG
GAAGAGGGCG AAGGGGCTCC TGACGGGACG GTGGGCGCGC CGGACAACAA CGTGCCCACG
GAAACGGCGC GCGGACCGTT CTCCGAAGGC GTGGCGCGGC ACGGCGGCCA CGGCCCGGCG
ACGGTGGCCC ACCACGCCCC CGGGTCCCAC GGCGCCTCCG AGGCACAGAA CGCTCCCCGG
GCGCAGGACG GCCCCGGGTT CCACGGCGCT CCCGAGGCCC AGGACGGCCC CTGCGCCCGG
GCCAGGCTGC GCCAGGGGCC GCGCCTGGAC AACGACCACT GGACGATGGT CCCGCTGGAC
GCCGACGACG ACCCCTCCAC CACCTCCTCC AGTTCGGCGA CCTGGGAACT CCCCGAGGGC
GGCGGGGTGC GCTGGGCGGG GCTCTACTTC TCCGGGACCG GGACCCCCGA CGCCCCCTCC
GTCCGGGTCA GGGGACCGGG CATGACGGAC TACCGCACCG TCGAGGCCAC CAGCAACCGT
GTCGCCGAGC TGCCCGGCTA CCCCGCCTAC CAGGCGTTCG CCGAGGTCAC GGACCTGGTG
CGGGCCCAGG GCGGCGGCCA GTGGTGGGTC GGCGACGCCC CCGTGAGCGA GGGCCGCGGC
CACTACGCGG GCTGGAGCCT CGTGGTCGTG CTGGAGGACC CCCGGGTGGG CACCCGCAAC
CAGGTGATGG TCCTCGACGA CACCCGGGTC TCCTTCCACG GCGGCGGGGG CGGCCCCTTC
GCGGTGTCGG GCCTGCTGCC CGCCGCCGTA CCCGCCCGGA TCGACGTGGT GGCCTGGGAG
GGCGACCCCG ACCTGGGCGG GGACCGGGTG ACCGTGGACG GCGCGGCGGC GGAGCCGGTC
GGCGGCTACG GGCGGACGGA CAACGCGTTC ACCGGCTCGG CCCGCGGCGC GGTCGGCGAC
CCGCTCGCGT TCGGCACCGA CGTGGTCCGA TTCGACTCAG TACTTGGCCG AGAAACGGAC
ATCCGAATCC TGACCGAACA GGACGCCGTG ATGGTGGGGG CAGTGGTCCT GACGGCCCCC
ATGCGTAGTT GA
 
Protein sequence
MNDVPDDHTE TLVGDGELIQ WVRDGDTGAY ATLYERHAAA ARGLARQLLR GEAEVEDAAA 
EAFTRVLSVI QRGGGPQDSF RPYLLTAVRN AAYDRGRGER RQVVTDDMES LVPGQPFVDP
ALEGLERSLI ARAFLSLPER WQSVLWHTEI EGAKPAEVAD VLGMKPNGVA ALAYRAREGL
RQAYLQMHLA GGNAAEACRP TLGLLGAHVR EGLSRRDTAK VDRHMDGCAD CRAVYAELTD
VNVGLRGVVL PLVAGAGAAG YLSATPAGGA WWGRMSRRQQ QAAAGGTAAA GVAVAVALAL
TSAPEPLPEQ QPPPAAAPWE QPPAPTAPDE PEPAPDAPRP SPPASDRPRP DADERPEPAE
PVPAVPPADV PEEEVAQEPG PRFAAGIDPV GSLLPGSEGI MVLDVRNIGG GAAEEVVAQL
TLPPGVEMVS SGGAGNALPR AVGHGDWSCS AGSGGGRCAH PGMAAGEDGT QFIDVRVAPD
AEVGVPATVS VSAAGVTAEA TGERGVSAEG VTARYATAGR VRAESVGNAL MTCVEPEPRG
RWPWPWWDWP YAPDVPDPRP QGPGTEPGTE SSPAPGAPAP PRPEPPEAEA TMEEDAVPGR
EEGEGAPDGT VGAPDNNVPT ETARGPFSEG VARHGGHGPA TVAHHAPGSH GASEAQNAPR
AQDGPGFHGA PEAQDGPCAR ARLRQGPRLD NDHWTMVPLD ADDDPSTTSS SSATWELPEG
GGVRWAGLYF SGTGTPDAPS VRVRGPGMTD YRTVEATSNR VAELPGYPAY QAFAEVTDLV
RAQGGGQWWV GDAPVSEGRG HYAGWSLVVV LEDPRVGTRN QVMVLDDTRV SFHGGGGGPF
AVSGLLPAAV PARIDVVAWE GDPDLGGDRV TVDGAAAEPV GGYGRTDNAF TGSARGAVGD
PLAFGTDVVR FDSVLGRETD IRILTEQDAV MVGAVVLTAP MRS