Gene Ndas_5097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5097 
Symbol 
ID9248987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp239264 
End bp240877 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content77% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682984 
Protein GI297564011 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00280261 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.375378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACTCA ACCAGTGGAC GGACAACCAC GCGGACGACG GACGCGGCGC CGACGGGCGG 
CCGACGCCAC GGCACGGCCG CGCCTCGCTG CTGCGCCTGG CCGCCTGGTT CGAGGAGGCC
GACCCGGACC GCGCCGACGC CCTCGCCGCC GCCTCGTACG CCCTCCATCC GGCGCTCCAC
CTGGAGGGGA GGGTGGACGA GGGTGTCGCC GCGACCACCA GCTGGTGGCA GGCCGACGCC
GACCGCAGCA CCGTCACCGA GCACCCGGGC GCCCGACCGC CCGAGCCGGT GAGCGACCAC
CGCGCCCAGC AGGCGCGCCT GCGCGACGCC GCCGAGTCCT CCGCCCACTG GCGCCGCGCC
GGTGCCGCGC AGATCCGCTC CCTGCTGACC GAGCCCACGG GCCGCCGTGC CCGCCTGGAC
CTGTCCGGCG CCGGGATGGA GGTCCTGATG GAACTCCTCA CCGCGGCCCT GGGATCCGGC
GACGCCAGCA GGCGCCCCAC CTCCGCCGGG GACCTGGAGT TCGCGCTGCG CCTGCACGTC
GTCGCCGCGC CGGGCGCCGA CGTCACCATC CGGGGCGAGG GCGGGGAGCT GACCCTGGAG
GGGCTGCGCC TGCTGGTGAC CCCCTACGAG CAGCACAGCC CCGGTGTCCT CGACCCGCTC
CCGGAGGAGC CGGAGGAGGA CGGGGCGGAC CCCCTCGCGG CCGGTGCCCC GGAGGGAGGG
GACGCTGCCC CGGAGGGGGA GGAGGCCTCC GCCGATTCCC GGACCCCCGT GGACCCGCTC
GGCCCCGGGA CCGACGAGGA CCTCTCCGAC CCGTCCGAAG AGCCGTCCGA GGACCCGCCG
GCCCCCGCCG CCCCCGCCGC CTCCGTAAGC CCGGACGCCC CCGCTGACCC CGAGGCATCG
GTGCTCGCCT CCGTTCCGAC GGCTCCCGAG GACCCTGAGG CTCCGGCAGG CACGTCCGTG
CCCGGCGGCT CCTTGCCCCC GTTCGATCCG CGGGTCCCTG GCGCACCTGA GAGTGCGCAG
ACCCCAGACG ATCCGCTGGC CCCCGCCGAC TCTCCGCACT CTGCCGTCGC GGACCGTTCG
ACGGGCCCGC AAGCCCCTGC TGGCACTGAG GCTCCAACGG CTTCCTCCGC CCCGACAGAC
ACGAAGAACC CCGAGACCCC AGCGGCCCCT TTCGACCCTC GGCTCCCGAC TGCCTCGGGC
CGCTCGGCCA GCCCGGGCAC CCCGTCCGCG CCGTTCGCTC CCCGCAACCC CTTCGCCCCG
GCCGCCTCCG AGGACCCGGA GGCCGCACCG TCCCCGGCCG CACCCCGTAC CCCCCAGACC
CCGGCCGCGT CGACGGAACC CCTCTTCCCC CAGCCCCCGA CGGTCCCGCC GTACTCCGCG
CACTCCGACG CCCCGCCCGC CCCGGAGATC CCCGGGACGC CCGCCGCCCC GTCCTACGCG
GACATCCTGG GGATCCCGGT CCCCTCCGCG ACCCCGGAAC CCTCCCCGGC CCCCGACGCG
GCGCCCCGCC CCGAGGACAC CGAACCCACG GGTAACCTCC CTGGACCGGA CCCGCGAGCC
CCCGGTCCGG ACGACCTGTC CGCGAACCCG CCCGAGAAGC CATCGTGGGA GTGA
 
Protein sequence
MGLNQWTDNH ADDGRGADGR PTPRHGRASL LRLAAWFEEA DPDRADALAA ASYALHPALH 
LEGRVDEGVA ATTSWWQADA DRSTVTEHPG ARPPEPVSDH RAQQARLRDA AESSAHWRRA
GAAQIRSLLT EPTGRRARLD LSGAGMEVLM ELLTAALGSG DASRRPTSAG DLEFALRLHV
VAAPGADVTI RGEGGELTLE GLRLLVTPYE QHSPGVLDPL PEEPEEDGAD PLAAGAPEGG
DAAPEGEEAS ADSRTPVDPL GPGTDEDLSD PSEEPSEDPP APAAPAASVS PDAPADPEAS
VLASVPTAPE DPEAPAGTSV PGGSLPPFDP RVPGAPESAQ TPDDPLAPAD SPHSAVADRS
TGPQAPAGTE APTASSAPTD TKNPETPAAP FDPRLPTASG RSASPGTPSA PFAPRNPFAP
AASEDPEAAP SPAAPRTPQT PAASTEPLFP QPPTVPPYSA HSDAPPAPEI PGTPAAPSYA
DILGIPVPSA TPEPSPAPDA APRPEDTEPT GNLPGPDPRA PGPDDLSANP PEKPSWE