Gene Ndas_4911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4911 
Symbol 
ID9248798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp42718 
End bp45729 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content76% 
IMG OID 
ProductBeta-galactosidase 
Protein accessionYP_003682800 
Protein GI297563827 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.114803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.456703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCCA TCCGCAAGCC CGCCTACCTG GAGGACTTCG CGCCCTCCGA GGGGACCCTC 
GCGCCCCGCG CCCGCTTCTC CTCCGACGCC CCGTCCCTGG ACCTCAACGG CACGTGGCGG
TTCCGGCTGC TCGACCGCGC GCGGGCCGAC ACCGACGGGT TCGAGGAGCC CGGCCACGAC
GACTCCGACT GGTCCGAACT GCCGGTGCCC GCCCACTGGC AGATGCACGG CCACGGCGCC
CCCGCCTACA CCAACATCTC CTACCCCTTC CCGATCGACC CCCCGTACGT GCCCGATGAC
AACCCGACCG GCGACCACCG GCGCGTCTTC GACCTTCCCC GGGAGTGGCC CGAGGGGGCG
GCGGTCCTGC GCTTCGACGG GGTCGACTCC TGCTTCCGCG CGTGGCTGAA CGGCACCGAG
CTGGGCTTCT CCACCGGGAG CCGCCTGGCC GCCGAGTTCG AGGTCGGCCA CCTGCTGCGC
CCGGGGCGCA ACACCCTGGC GGTGCGCGTC CACCAGTGGT CGGCGGCCAG CTACCTGGAG
GACCAGGACA TGTGGTGGCT GTCGGGGATC TTCCGCGACG TCACCCTCCT CGCCCGGCCC
GCCGACGGGG TCGGCGACCT CGCGGTGAGC GCCGCCTACG ACCACACCAC GGGCACGGGT
ACCCTGCGCG TGGACGTGGC CGGATCCGCC GACGCCCTGG TCAGCGTTCC CGAACTGGGG
CTGGTGGACG CCGCCGCCGG TGTGGACCAC GGGGTCGGCG CGGTCGAGCC GTGGACGGCC
GAGACGCCCC GCCTGTACGA CCTGGAGGTG CGCACACCGG GGGAACGGGT CCGGACGCGG
ACGGGCTTTC GGACCGTGGA GGTCTCCGGC GGGCTCCTGC GGGTCAACGG CCGCCCCCTG
CTGCTGCGGG GCGTGAACCG CCACGAGTGG CACCCCGACC ACGGCCGCGC CGTGCCCCGG
GAGACCATGC GCGAGGACGT GCTGCTGATG AAGCGGCACA ACGTCAACGC CGTGCGCACC
AGCCACTACC CGCCCCACCC GGACTTCCTG GACCTGTGCG ACGAGCTGGG CCTGTGGGTC
GTGGACGAGT GCGACCTGGA GACCCACGGG TTCGAGGAGG TCGGCTGGCG CGGCAACCCC
TCCGACGACC CGCGCTGGCG CGAGGCCTAC CTGGACCGGA TGCGCCGCAC CGTCGCGCGC
GACCGCAACC ACCCCAGCGT CGTCCTGTGG TCCCTGGGCA ACGAGTCCGG GACCGGGGAG
AACCTGCGGG CCATGGCGGA GTGGACCCGC GCACACGACC CCTCCCGGCC GATCCACTAC
GAGGGGGACC GAGACAGCGC CTACGTGGAC GTGTACTCGC GCATGTACGC GCCGCACGAG
GAGGTGGACG CCATCGGCCG CCGGGAGGAG GCGCCGACCG CCGACCCGGC CGCCGACGGG
CACCGGCGCG GACTGCCGTT CGTCCTGTGC GAGTACGGGC ACGCCATGGG CACCGGCCCC
GGCGGGCTGG AGGAGTACCA GCGGCTCTTC GAGGAGCACG AGCGCTGCCA GGGCGGGTTC
GTCTGGGAGT GGATCGACCA CGGGGTCCGC CGCCGCGAGC CGGACGGCAC CGCGTGGTTC
GCCTACGGCG GCGACTTCGG CGAGGCCCTG CACGACGGCA ACTTCGTCGC GGACGGCCTG
CTGCTGCCCG ACCGCACGCC CAGTCCCGGG CTGGAGGCCT ACGCCAAGAC CGTCGAACCG
GTCCGCGTGG TCCCCGACCC GGCCGCCGGG ACCGTCCGCG TCACCAACAC CTGGGACTTC
CGGGACACCG CGGGCCTGGC CTTCGTCTGG CGGGTCGAGG GGGAGGGGAT CCCCCTGGGG
GAGGGCCCCC TGGACGTGCC GGTGCTCGCC GCCGGGGAGA GCGCCGAGGC GGTGCTGCCC
GCACTGCCCG AGCCGTCCGG GGAGACCTGG CTGACCGTGT CCGTCCGGCT GGCCGAGGAC
CAGCCCTGGG CCCCGGCCGG GCACGAGCTG GCCTGGGGAC AGGCCGAGGT GGGCGCCCCC
GGGGAGCCGG GCGCGGCAGC GCCGGGGGTT CCGGGTGGGC AGGAAGGGAC GAGTGCGTCG
GAGAGGTCGG GCGCGCTGCG AGAGCCGCGG GAGCCGGGGG ACCCGGGTGC GTGGCCGGGG
GCGGAGGGAG CTGTCGTGCT TCCGGCGTCC GCCGGGCAGG ACGGCATCGT GCGGATCGGC
CCCGGCCTGC TGGACGCGGG GACCGGCGCG CTGCTCTCCG TGGGCGGCCT GCCCGTGGAC
GCGCCGGCCG TGGACCTGTG GCGCGCTCCG ACCGACAACG ACGTGGCCCG GCACGGGGAC
TCCGTGGCCG TCCCCTGGCG CGCGGCGGGG CTGCACCGGC TCACCGAGCG CCTGCTGTCC
ACCGGCTGGG AGGACGGCGC GTTCGTCGTC CGTACACGGC TGGCTCCGGC GGCCCAGGAG
TTCGGCATGC GCGCCGCCTA CCGGTGGGGC GCCGACGGGG AGCGCCTGCT CCTGACCGTG
GAGGCCGAGG CGGAGGGGGA GTGGCCCTGC CCGCTGCCGC GCTTCGGCGT GCACGTCGGC
CTGCCCGCCT CCCTGGGCGA GGCGGAGTGG TTCGGCCGCG GACCCGGGGA GGCCTACCGG
GACGTGTCCC AGGCCGCGCG GGTCGGCCGC TTCTCCAGCA CGGTCGACGG CCTCCAGACG
CCCTACCTGC GGCCCCAGGA GAACGGCAAC CGCGTGGACG CCAGGTGGCT CGCCCTGCGG
GCGGCGGACG GGACGGGCCT GCGGGTGGAG GGCGACCCGG TGTTCGACTT CACCGCGCGC
CGCTGGGACA CGGCGGCGCT CGACGCGGCC GACCACCCGC ACGAACTGGT CCCCGGCGAC
CGGATCCACC TGCACCTGGA CCAGGCGCAC CAGGGCGTGG GCTCGGCCTC GTGCGGCCCC
GGGGTGCTCC CCGCCCACCG CCTTGCGGCC GGGAGCCACC GCCTCCGGCT CGCCCTGGTC
CCGCTGGGCT GA
 
Protein sequence
MPAIRKPAYL EDFAPSEGTL APRARFSSDA PSLDLNGTWR FRLLDRARAD TDGFEEPGHD 
DSDWSELPVP AHWQMHGHGA PAYTNISYPF PIDPPYVPDD NPTGDHRRVF DLPREWPEGA
AVLRFDGVDS CFRAWLNGTE LGFSTGSRLA AEFEVGHLLR PGRNTLAVRV HQWSAASYLE
DQDMWWLSGI FRDVTLLARP ADGVGDLAVS AAYDHTTGTG TLRVDVAGSA DALVSVPELG
LVDAAAGVDH GVGAVEPWTA ETPRLYDLEV RTPGERVRTR TGFRTVEVSG GLLRVNGRPL
LLRGVNRHEW HPDHGRAVPR ETMREDVLLM KRHNVNAVRT SHYPPHPDFL DLCDELGLWV
VDECDLETHG FEEVGWRGNP SDDPRWREAY LDRMRRTVAR DRNHPSVVLW SLGNESGTGE
NLRAMAEWTR AHDPSRPIHY EGDRDSAYVD VYSRMYAPHE EVDAIGRREE APTADPAADG
HRRGLPFVLC EYGHAMGTGP GGLEEYQRLF EEHERCQGGF VWEWIDHGVR RREPDGTAWF
AYGGDFGEAL HDGNFVADGL LLPDRTPSPG LEAYAKTVEP VRVVPDPAAG TVRVTNTWDF
RDTAGLAFVW RVEGEGIPLG EGPLDVPVLA AGESAEAVLP ALPEPSGETW LTVSVRLAED
QPWAPAGHEL AWGQAEVGAP GEPGAAAPGV PGGQEGTSAS ERSGALREPR EPGDPGAWPG
AEGAVVLPAS AGQDGIVRIG PGLLDAGTGA LLSVGGLPVD APAVDLWRAP TDNDVARHGD
SVAVPWRAAG LHRLTERLLS TGWEDGAFVV RTRLAPAAQE FGMRAAYRWG ADGERLLLTV
EAEAEGEWPC PLPRFGVHVG LPASLGEAEW FGRGPGEAYR DVSQAARVGR FSSTVDGLQT
PYLRPQENGN RVDARWLALR AADGTGLRVE GDPVFDFTAR RWDTAALDAA DHPHELVPGD
RIHLHLDQAH QGVGSASCGP GVLPAHRLAA GSHRLRLALV PLG