Gene Ndas_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1100 
Symbol 
ID9244946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1350284 
End bp1351387 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content69% 
IMG OID 
ProductLuciferase-like, subgroup 
Protein accessionYP_003679048 
Protein GI297560074 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.43762 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTCG GGATCTTCAC CGTCGGTGAC GTGACCACCG ACCCCACCAC CGGCCGTACG 
CCGACCGAGG CCGAGCGCGT CAAGGCGATG GTGACCATCG CGCTCAAGGC CGAGGAGGTC
GGCCTGGACG TCTTCGCCAC CGGCGAGCAC CACAACCCGC CCTTCGTGGC CTCCTCACCG
ACCACGATGC TCGGCTACAT CGCGGCCAAA ACGGACAGGC TCATCCTGTC CACCTCCACC
ACGCTGATCA CCACCAACGA CCCGGTCAAG ATCGCCGAGG ACTTCGCCAT GCTCCAGCAC
CTGGCCGACG GCCGGGTGGA CCTGATGATG GGCCGCGGCA ACACCGGCCC CGTCTACCCC
TGGTTCGGCC AGGACATCCG CCAGGGCATC CCGCTGGCGC TGGAGAACTA CAACCTGCTG
CACCGGCTCT GGCGCGAGGA CGTGGTGGAC TGGGAGGGCA AGTTCCGCAC CCCCCTGCAG
GGCTTCACCT CCACCCCCCG CCCCCTGGAC GGGGTGCCCC CCTTCGTCTG GCACGGCTCC
ATCCGCAGCC CCGAGATCGC CGAGCAGGCC GCCTTCTACG GTGACGGCTT CTTCCACAAC
AACATCTTCT GGCCCGCCAC GCACACCAAG AAGCTCATCT CGCTCTACCG CCGCCGCTTC
GAGCACTACG GCCACGGCAG GGCCGAACAG GCCGTCGTCG GCCTGGGCGG ACAGGTGTTC
ATGCGCAAGA ACTCCCAGGA CGCGGTGAGG GAGTTCCGCC CCTACTTCGA CCACCACCCC
CTGATGGGCG GCGGACCGTC GCTGGAGGAG TACATGGACC AGACCCCGCT GACCGTCGGC
AGCCCCCAGC AGGTCATCGA CAGGACCCTC GCCTTCCGTG ACAGCTTCGG CCACTACCAG
CGCCAGCTGT TCAACGTCGA CGGCGTCGGG ACACCCCTGA AGACGGTCCT GGAGCAGATC
GACGTCCTCG GCGAGGAGGT CGTGCCGGTG CTGCGCGAGG AGTTCGCCGC CGGGCGGCCC
GCGCACGTGC CCGACGCGCC CACCCACGCC TCGCTGCTCT CCGCCCGCGA CACCGGAAAC
GCCTCCGCGA CAGCGACGGG CTGA
 
Protein sequence
MQFGIFTVGD VTTDPTTGRT PTEAERVKAM VTIALKAEEV GLDVFATGEH HNPPFVASSP 
TTMLGYIAAK TDRLILSTST TLITTNDPVK IAEDFAMLQH LADGRVDLMM GRGNTGPVYP
WFGQDIRQGI PLALENYNLL HRLWREDVVD WEGKFRTPLQ GFTSTPRPLD GVPPFVWHGS
IRSPEIAEQA AFYGDGFFHN NIFWPATHTK KLISLYRRRF EHYGHGRAEQ AVVGLGGQVF
MRKNSQDAVR EFRPYFDHHP LMGGGPSLEE YMDQTPLTVG SPQQVIDRTL AFRDSFGHYQ
RQLFNVDGVG TPLKTVLEQI DVLGEEVVPV LREEFAAGRP AHVPDAPTHA SLLSARDTGN
ASATATG