Gene Ndas_5560 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5560 
Symbol 
ID9249463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp756263 
End bp758266 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content76% 
IMG OID 
ProductBeta-galactosidase 
Protein accessionYP_003683445 
Protein GI297564472 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.3655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCTC CGCTGTGGTT CGGCGGCGAC TACAACCCCG AGCAGTGGCC GGAGGAGGTC 
CAGGCGGAGG ACGTGGAGCT GATGCGCCGC GCCGGGGTCA ACCTCGTGAC GGTCGGCGTC
TTCTCCTGGG CGCTCCTGGA GCCGAAGGAG GGCGAGTACC GGTTCGAGTG GCTGGACCGG
GTGCTGGACC GGCTCGCCGA CGCCGGGATC GGGGTCGCGC TGGCCACCCC GACCGCCTCT
CCCCCGCCGT GGTTCGGGCT CGCCCACCCC GACGCGATGC CGGTGACCGC GGACGGGACC
CGGCTCACCC ATGGCAGCCG CGACACCTAC GACGTGTGCT CCCCCGCCTA CCGGGAGGCC
TCGGTGCGCA TCGCCCGCGC GCTGGCGGAG CGCTACTCCG GGCACCCGGC GCTGCGCCTG
TGGCACGTGC ACAACGAGTA CGGGACGTGG TCGCACTCCG AGCACACCGC CGGGGCCTTC
CGCGACTGGC TGCGGGGTCG CCACGTGGAC CTGGACCGGC TCAACGCGGC GTGGACCACG
GCGTTCTGGA GCCAGCACTA CTCGGAGTGG GAGCAGATCC AGCCGCCGCG GGCGACCCAG
TACCTGCCCA ACCCGGCGCA CGTGCTGGAC TTCCGGCGGT TCCTGTCGGA CGCGATGCTC
GACCACTTCC TGTCCCAGCG CGACGTGCTG CGCGCCGTCC GGCCGGACGT GCCGGTCACG
ACCAACCTGG CCTTCGGCGA CTGGGTGCCG GTGGACCCGT GGCGGTGGGC CGAGCACCTG
GACCTGGTGG CCGTGGACGA CTACCCGGAC CGGACGGGGC AGGGCGGGGC GGAGCAGACC
GCCTTCGCCG CCGATCTGGC CCGCTCGTGG GCGGAGCGCG TGCCCGGCCC GGGTCGGCCG
TGGCTGTTGA TGGAACAGGC GGCGGGTGTG ACCTACACGG GCGAGGTCAC CCGGCCGAAG
GCCCCCGGGG AGACGGCGCG GCACAGCCTG GCGCACGTGG CGCGCGGGTC GCGGGGCGCG
ATGTTCTTCC AGTGGCGGGC GTCCCGGGGC GGCGCCGAGC AGTGGCACTC GGGGATGGTG
CCGCACGCCG GGCCGGATTC GCGGATCTTC CGCGAGGTGT GCGAACTGGG GAGCGTGCTG
CCCCGCGTGG CGGAGGCGCG CGACGCCGAC GTGGTCGCCG ACGCGGCGCT CACCTGGGAC
CCCGAGTGCT GGTGGGCGCT GGGCAGCCCG TCGCTGCCGG CGCGGGGCGT GGACTACCTG
GAGGCGGCGC GTCAGGTGCA CCGGGTGCTG TGGCGGTCGG GGCGCACGGT GGACATGGTG
CGCCCCGACC GCGACCTGCC GCGGGTGCCG CTGCTGGTGG TGCCCGCCCT GTACCTGCTC
TCGGACGAGG CGGCCGAGCG GTTGGCCCGC TACACCGAGG ACGGCGGAAC GCTGGTGGTG
ACCTTCCTGA GCGGGGCCGC CGACCCGGAC GGGACCGTGC GCACCGGCGG GTACCCGGGC
GCGCTGCGGG ACCTGCTGGG GGTGCGGGTG GAGGAGGTGC ACCCGCTGCT GCCCGGGGAC
GCGGTGGGCG TGGACCTGGG TTCGGTGCGC GAGGAGGTCA CCCTGTGGAG CGAGCACGTC
CACCTGGCGG GGGCCGAGGC GGTGGCCCAC TACGCGGGCG GCCCGCTGGA CGGGCTGCCC
GCGGTGACGC GGCGGCGCCA CGGCGCGGGC GAGGCCTGGT ACCTGTCGGC CCGGCTGTCG
GACCGGGGCC TGGCGCGGCT GCTGGCCGAG GCGGCGGGGA CGGGGCCGTT CCCGGAGCCG
GGGCTGGAGG TGGTGCGCCG GGTGGACCGG GACGGCGCGT GGGTGTTCGT CACCAACCAC
GACAACCGGC CGCGGTGGAT CGACCCGGCC CGGTTCGGGC TGGACGCGGG CGCCCGCGAC
CTGGTGTCGG GCGTTTCGGC GCAGGGGATG ACCCTGCCGG GCGGCGGGGT GGCGGTGTTG
CGCGGCCACC CCGTTGACAA TTAG
 
Protein sequence
MSSPLWFGGD YNPEQWPEEV QAEDVELMRR AGVNLVTVGV FSWALLEPKE GEYRFEWLDR 
VLDRLADAGI GVALATPTAS PPPWFGLAHP DAMPVTADGT RLTHGSRDTY DVCSPAYREA
SVRIARALAE RYSGHPALRL WHVHNEYGTW SHSEHTAGAF RDWLRGRHVD LDRLNAAWTT
AFWSQHYSEW EQIQPPRATQ YLPNPAHVLD FRRFLSDAML DHFLSQRDVL RAVRPDVPVT
TNLAFGDWVP VDPWRWAEHL DLVAVDDYPD RTGQGGAEQT AFAADLARSW AERVPGPGRP
WLLMEQAAGV TYTGEVTRPK APGETARHSL AHVARGSRGA MFFQWRASRG GAEQWHSGMV
PHAGPDSRIF REVCELGSVL PRVAEARDAD VVADAALTWD PECWWALGSP SLPARGVDYL
EAARQVHRVL WRSGRTVDMV RPDRDLPRVP LLVVPALYLL SDEAAERLAR YTEDGGTLVV
TFLSGAADPD GTVRTGGYPG ALRDLLGVRV EEVHPLLPGD AVGVDLGSVR EEVTLWSEHV
HLAGAEAVAH YAGGPLDGLP AVTRRRHGAG EAWYLSARLS DRGLARLLAE AAGTGPFPEP
GLEVVRRVDR DGAWVFVTNH DNRPRWIDPA RFGLDAGARD LVSGVSAQGM TLPGGGVAVL
RGHPVDN