Gene Ndas_5116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5116 
Symbol 
ID9249009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp261850 
End bp265317 
Gene Length3468 bp 
Protein Length1155 aa 
Translation table11 
GC content67% 
IMG OID 
ProductDNA-directed RNA polymerase, beta subunit 
Protein accessionYP_003683003 
Protein GI297564030 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.303948 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAGCCT CGCGCAACGC CTCCGCTAAC GCCCTTGGTC CGCACCGCGT TTCCTTCGCC 
CGTATCCAGG AACCACTGGA GGTCCCGAAC CTCCTCGCCC TTCAGACCGA GTCGTTCGAC
TGGCTGCTGG GCAACGACAG GTGGAAGACC CGGGTTGAGA CGGCCCGAAA CGCTGGCCGC
AAGGACGTTC CGGAGCAGTC CGGTCTCGAA GAGATCTTCG AGGAGATCAG CCCGATCGAG
GACTTCTCGG GCACGATGTC CCTGTCGTTC CGCGACCACC GGTTCGAGCC GCCCAAGTAC
TCAGAGGATG AGTGCAAGGA CAGGGACATG ACCTACTCCG CCCCGATGTT CGTCACGGCG
GAGTTCATCA ACAACGACAC CGGTGAGATC AAGAGCCAGA CGGTGTTCAT GGGCGACTTC
CCGCTCATGA CCGAGACCGG CACCTTCATC ATCAACGGCA CCGAGCGCGT CGTCGTGTCC
CAGCTGGTGC GCTCCCCGGG CGTGTACTTC GACAAGTCCG TCGACAAGAC CTCGGACAAG
GACCTCTACG GCTGCAAGGT CATCCCGTCC CGCGGCGCCT GGCTGGAGTT CGAGGTCGAC
AAGCGCGACT TCGTCGGCGT CCGCATCGAC CGCAAGCGCA AGCAGGGCGT CACCGTCCTG
CTCAAGGCGC TGGGCTGGAC CACCGACCAG ATCCTGGAGC GCTTCGGCCA GTACGAGTCC
ATCCGCAACA CGCTGGAGAA GGACCCCACC GCGGGCACCG ACGACGCGCT GCTGGACATC
TACCGCAAGC TGCGCCCCGG CGAGCCGCCC ACGAAGGAGT CGGCCCAGGC GCTGCTGGAG
AACCTGTACT TCAACCCCAA GCGCTACGAC CTGGCCAAGG TCGGCCGTTA CAAGATCAAC
AAGAAGCTCG GCCTGGACAC CGACTACCGG CAGGGCACCC TCACCGAAGA GGACATCGTC
GCCACGATCG ACTACCTCGT CCGCCTGCAC GCGGGTGAGA CCGAGAAGGA GACCGTCAAC
GGTCCCCGGC CGATCGAGAC CGACGACATC GACCACTTCG GCAACCGCCG CCTGCGCACC
GTCGGCGAGC TCATCCAGAA CCAGGTCCGC CTGGGCCTGG CCCGCATGGA GCGCGTCGTC
CGCGAGCGGA TGACCACCCA GGACGTCGAG GCGATCACGC CGCAGACCCT GATCAACATC
CGTCCGGTCG TGGCCTCCAT CAAGGAGTTC TTCGGCACCT CGCAGCTGTC GCAGTTCATG
GGCCAGACCA ACCCGCTGGA GGGCCTGACG CACAAGCGCC GCCTCTCCGC GCTGGGCCCG
GGCGGCCTGT CCCGCGAGCG CGCGGGCTTC GAGGTCCGCG ACGTGCACCC CTCGCACTAC
GGCCGCATGT GCCCGATCGA GACGCCCGAG GGACCGAACA TCGGTCTGAT CGGCTCGCTC
GCCGCCTACG GCCGGGTCAA CTCCTTCGGC TTCGTGGAGA CCCCGTACCG CCGCATCATC
GACGGCAAGG TCTCCGACCA GGTCGACTAC CTGACCGCGG ACGAGGAGGA CCTCCACGTC
ATCGCGCAGG CCAACACGCC GATGAACCCC GACGGCTCCT TCGCCGAGGC CGGTGTGCTC
GTGCGCCGCA AGGGCGGTGA GTTCGAGCAG GTCAGCACGG ACGAGGTCGA CTACATGGAC
GTGTCGCCGC GCCAGATGGT GTCGGTCGCC ACCGCCATGA TCCCGTTCCT GGAGCACGAC
GACGCCAACC GCGCCCTCAT GGGCTCGAAC ATGCAGCGCC AGGCCGTGCC GCTGCTCATG
GCCGAGTCGC CGTTCGTCGG CACCGGCATG GAGTACCGCG CCGCCACCGA CGCCGGTGAG
GTCGTCCTGG CGCAGAAGGC CGGTGTCGTC GAGGACGTCA CCGCCGACTA CGTCACCGTC
ATGGCCGACG ACGGCACGCG CAAGACGTAC CGCATGGGCA AGTTCCAGCG CTCCAACCAG
GGCACCTGCT TCAACCAGCG GCCGATCGTG GCCGAGGGCC AGCGGGTCGA GGAGCGCCAG
GTCCTCGCGG ACGGCCCCTC CACCGACCAG GGTGAGATGT CGCTGGGCAA GAACCTGCTC
GTGGCGTACA TGTCCTGGGA GGGCCACAAC TACGAGGACG CGATCATCCT CTCCCAGCGC
CTGGTGCAGG ACGACGTCCT GTCCTCGATC CACATCGAGG AGCACGAGGT CGACGCCCGC
GACACCAAGC TGGGCCCGGA GGAGATCACC CGCGAGATCC CCAACGTCAG CGAGGAGGTC
CTGGCCGACC TCGACGACCG GGGCATCATC CGCATCGGCG CCGAGGTCGT GGACGGCGAC
ATCCTCGTCG GCAAGGTCAC GCCCAAGGGC GAGACCGAGC TGACCCCGGA GGAGCGCCTG
CTGCGCGCGA TCTTCGGAGA GAAGGCGCGC GAGGTCCGCG ACACCTCGCT CAAGGTGCCG
CACGGCGAGA CCGGCAAGGT CATCGGCGTC CGCGTGTTCA GCCGCGAGGA CGGCGACGAG
CTCGCCCCCG GCGTCAACGA GATGGTCCGC GTCTACGTGG CCCAGAAGCG CAAGATCACC
GACGGTGACA AGCTCGCGGG CCGCCACGGC AACAAGGGCG TCATCTCCAA GATCCTGCCC
CAGGAGGACA TGCCCTTCCT GGAGGACGGG ACGCCGGTCG ACATCATCCT CAACCCGCTG
GGCGTGCCCG GCCGTATGAA CGTCGGCCAG GTCCTGGAGG TCCACCTGGG CTGGTTGGCC
AAGAACGGCT GGCTGGTCGA GGGGGTCGAG GAGGAGTGGC AGAAGTCGCT GGAGGCCATC
GGCGCCACCG ACGTGCCGCC GGACTCCCGC GTGGCCACGC CGGTCTTCGA CGGCCTGCGC
GGCGACGAGC TCAGCGGCCT GATCAAGTCG GTGCGCCCCA ACGCGGACGG CAACCGCCTG
ATCAACGAGG ACGGCAAGGC GCGTCTGTTC GACGGCCGCA CCGGCGAGCC CTTCGCCGAG
CCGATCTCCG TCGGCTACAA GTACATCCTC AAGCTCCACC ACCTCGTGGA CGACAAGATC
CACGCGCGCT CCACCGGCCC GTACTCCATG ATCACCCAGC AGCCGCTGGG CGGTAAGGCG
CAGTTCGGCG GTCAGCGCTT CGGTGAGATG GAGGTGTGGG CGCTGGAGGC CTACGGCGCC
GCCTACGCCC TCCAGGAGCT GCTCACCATC AAGTCCGACG ACGTGGTGGG ACGGGTCAAG
GTCTACGAGG CCATCGTCAA GGGCGAGAAC ATCCCCGAGC CGGGCATCCC TGAGTCCTTC
AAGGTGCTCA TCAAGGAGAT GCAGTCGCTC TGTCTGAACG TGGAGGTGCT GTCCAGTGAC
GGTATGTCCA TCGAGATGCG GGACAGCGAC GAGGACGTCT TCCGCGCGGC GGAAGAACTG
GGAATCGACC TGGGTCGGCG GGAGCCGAGC AGTGTCGAAG AGGTCTAA
 
Protein sequence
MAASRNASAN ALGPHRVSFA RIQEPLEVPN LLALQTESFD WLLGNDRWKT RVETARNAGR 
KDVPEQSGLE EIFEEISPIE DFSGTMSLSF RDHRFEPPKY SEDECKDRDM TYSAPMFVTA
EFINNDTGEI KSQTVFMGDF PLMTETGTFI INGTERVVVS QLVRSPGVYF DKSVDKTSDK
DLYGCKVIPS RGAWLEFEVD KRDFVGVRID RKRKQGVTVL LKALGWTTDQ ILERFGQYES
IRNTLEKDPT AGTDDALLDI YRKLRPGEPP TKESAQALLE NLYFNPKRYD LAKVGRYKIN
KKLGLDTDYR QGTLTEEDIV ATIDYLVRLH AGETEKETVN GPRPIETDDI DHFGNRRLRT
VGELIQNQVR LGLARMERVV RERMTTQDVE AITPQTLINI RPVVASIKEF FGTSQLSQFM
GQTNPLEGLT HKRRLSALGP GGLSRERAGF EVRDVHPSHY GRMCPIETPE GPNIGLIGSL
AAYGRVNSFG FVETPYRRII DGKVSDQVDY LTADEEDLHV IAQANTPMNP DGSFAEAGVL
VRRKGGEFEQ VSTDEVDYMD VSPRQMVSVA TAMIPFLEHD DANRALMGSN MQRQAVPLLM
AESPFVGTGM EYRAATDAGE VVLAQKAGVV EDVTADYVTV MADDGTRKTY RMGKFQRSNQ
GTCFNQRPIV AEGQRVEERQ VLADGPSTDQ GEMSLGKNLL VAYMSWEGHN YEDAIILSQR
LVQDDVLSSI HIEEHEVDAR DTKLGPEEIT REIPNVSEEV LADLDDRGII RIGAEVVDGD
ILVGKVTPKG ETELTPEERL LRAIFGEKAR EVRDTSLKVP HGETGKVIGV RVFSREDGDE
LAPGVNEMVR VYVAQKRKIT DGDKLAGRHG NKGVISKILP QEDMPFLEDG TPVDIILNPL
GVPGRMNVGQ VLEVHLGWLA KNGWLVEGVE EEWQKSLEAI GATDVPPDSR VATPVFDGLR
GDELSGLIKS VRPNADGNRL INEDGKARLF DGRTGEPFAE PISVGYKYIL KLHHLVDDKI
HARSTGPYSM ITQQPLGGKA QFGGQRFGEM EVWALEAYGA AYALQELLTI KSDDVVGRVK
VYEAIVKGEN IPEPGIPESF KVLIKEMQSL CLNVEVLSSD GMSIEMRDSD EDVFRAAEEL
GIDLGRREPS SVEEV