Gene Ndas_2468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2468 
Symbol 
ID9246318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2929785 
End bp2930894 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content70% 
IMG OID 
ProductLuciferase-like, subgroup 
Protein accessionYP_003680394 
Protein GI297561420 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.874778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTCG GAATCTTCTC GGTGAGCGAC GTGACCACCG ACCCCACCAC CGGCCGTACG 
CCGACCGAGG CCGAGCGCGT CAAGGCGATG GTGACCATCG CGCTCAAGGC CGAGGAGGTC
GGCCTGGACG TCTTCGCCAC CGGCGAGCAC CACAACCCGC CCTTCGTGGC CTCCTCACCG
ACCACGATGC TCGGCTACAT CGCGGCCCGA ACGGACAGGC TCATCCTGTC CACCTCCACC
ACGCTGATCA CCACCAACGA CCCGGTCAAG ATCGCCGAGG ACTTCGCCAT GCTCCAGCAC
CTGGCCGACG GCCGGGTGGA CCTGATGATG GGCCGCGGCA ACACCGGCCC CGTCTACCCC
TGGTTCGGCC AGGACATCCG CCAGGGCATC CCGCTGGCGC TGGAGAACTA CAACCTGCTG
CACCGGCTCT GGCGCGAGGA CGTGGTGGAC TGGGAGGGCA AGTTCCGCAC CCCCCTGCAG
GGCTTCACCT CCACCCCCCG CCCCCTGGAC GGGGTGCCCC CCTTCGTCTG GCACGGCTCC
ATCCGCAGCC CCGAGATCGC CGAGCAGGCC GCCTTCTACG GTGACGGCTT CTTCCACAAC
AACATCTTCT GGCCCGCCAC GCACACCAAG AAGCTCATCT CGCTCTACCG CCGCCGCTTC
GAGCACTACG GCCACGGCAG GGCCGAACAG GCCGTCGTCG GCCTGGGCGG ACAGGTGTTC
ATGCGCAAGA ACTCCCAGGA CGCGGTGAGG GAGTTCCGCC CCTACTTCGA CAACGCGCCG
GTCTACGGCG GCGGCCCCTC CCTGGAGGAG TTCACCCGCG AGACCCCGCT GACCGTCGGC
AGCCCGCAGG AGGTCATCGA CCGCACGCTG ACCTTCCGCG ACTCCTTCGG CGACTACCAG
CGCCAGCTCT TCCTCATGGA CCACGCGGGC CTGCCGCTCA AGACGGTCCT GGAGCAGCTG
GACCTGCTCG GCGAGGAGGT CGTGCCGGTG CTGCGCAAGG AGTTCGCCGC CGGGCGGCCC
GCGCACGTGC CCGACGCGCC CACCCACGCC TCGCTGCTGG CCGCCAAGGG GGACGCGGAC
ACCGCCTCCG GCCGGGACGG CCAGCAGTAG
 
Protein sequence
MQFGIFSVSD VTTDPTTGRT PTEAERVKAM VTIALKAEEV GLDVFATGEH HNPPFVASSP 
TTMLGYIAAR TDRLILSTST TLITTNDPVK IAEDFAMLQH LADGRVDLMM GRGNTGPVYP
WFGQDIRQGI PLALENYNLL HRLWREDVVD WEGKFRTPLQ GFTSTPRPLD GVPPFVWHGS
IRSPEIAEQA AFYGDGFFHN NIFWPATHTK KLISLYRRRF EHYGHGRAEQ AVVGLGGQVF
MRKNSQDAVR EFRPYFDNAP VYGGGPSLEE FTRETPLTVG SPQEVIDRTL TFRDSFGDYQ
RQLFLMDHAG LPLKTVLEQL DLLGEEVVPV LRKEFAAGRP AHVPDAPTHA SLLAAKGDAD
TASGRDGQQ