Gene Ndas_1159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1159 
Symbol 
ID9245009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1413019 
End bp1414674 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content72% 
IMG OID 
Productprotein of unknown function DUF187 
Protein accessionYP_003679106 
Protein GI297560132 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCCCTG AACGACGAGC CGGCCTCTCA CCGCGCTGGC GCTCCCGGTG GGTGGCCGCG 
GCCGCCGCGT CGGGGCTCAT GATCTCCGGA TACGCCCTCG TCGGCGCGGC CCACTCCCCC
ACCGGGTCCT TTCCCGGCCC CGGCGCGGCC GGTTGCGAGG CCGCCCAGGA CAAGCGGCAG
ATGCGCGGGG CGTGGCTGAC CACGGTCGGC AACATCGACT GGCCCTCCGA GCCGGGCCTG
TCCGCCGAGG ACCAGAAGGC GGAGATGGAC CAGCGCCTGG ACGAGGCGGT GGACCTCGGC
CTGAACACCG TGTTCCTGCA CGTGCGGCCC ACCGCCGACG CCGTCTACGA GTCGGACCTG
GAGCCGTGGT CGAAGTACCT CACCGGCGAG CAGGGCGGCG ACCCCGGCTA CGACCCGCTG
GAGTACGCGG TGGCCGGAGC GCACGAGCGC GGCCTGGAGC TGCACGCCTG GTTCAACCCC
TACCGGGTCG GCATGGACTC CGACATCGAG GAGCTGGCCG AGGACCACCC GGTCAGGGAG
CACCCCGACT GGCTGGTGCG CTACGGCGGC GAGGGCTTCC TGGACCCGGG CAGGCCCGAG
GTCCAGGAGT GGGTGACCCG CGTGATCATG GACGTGGTGG AGCGCTACGA CATCGACGGC
GTGCACTTCG ACGACTTCTT CTACCCCTAC CCCAAGGACG GCGAGGAGTT CGACGACGAC
CGGACGTGGG AGGAGTACGG CGACGGCTTC GAGGACCGCG AGGACTGGCG GCGCGACAAC
GTCAACGGCT TCGTCAGCGG TGTGCACGAG CGCATCGAGG AGGCCAAGCC CTGGGTGCGC
TTCGGGATCT CCCCGTTCGG CATCTGGCGC AACGCCGAGA ACGACCCCGC CGGGTCCGAC
ACCTCGGGCC TGGAGTCCTA CGAGGCCCAG CACGCCGACA CCCGCGCCTG GATCCGGGAG
GGGATGGTCG ACTACGTCGT CCCGCAGCTG TACTGGGAGC GGGGCTTCGA CGCCGCCGAC
TACGAGGAGC TGCTGCCGTG GTGGGCCGAG CAGGTCGAGG GCACGGACGT GGACCTGTAC
GTCGGCCAGG GCGCGTACCG GGTCGGTGAC CGCAACTGGA CCGACGAGGA CGCGCTGAGC
ACCCAGCTGG ACTACTCCTC CGACCACCCC GAGGTCGACG GCGACGTCTA CTTCTCCTTC
AAGTCCCTGA CAGGCGTGGC CGAGGAGGCC TACGCCCACC TGGCCGACGA GCACTACGGC
GACCCCGCCC TGCCGCCCCT GGCGGGAGGG GACCGCGGAG GCCGGTCCCT GGCGGGCGCC
GTGGAGGACG TGACCGCCGA GGTCGCGGAC GAGCACACCG CGGTGGAGTG GGAGCGGGTG
GAGGACGCGC GCTTCTACGC CGTCTACCGC CTGGACGCGC AGGAGGCCGC GCGGGCGGAC
TCGGGCGACC CGGAGGAGTA CTGCGGCGTG CTCTCCTCCG ACAACCTCGT GGGCGTGACC
GGCGGGACCT CGCTGGAGGA CTCCGGCCAC ACCGCCGAGG ACGCCGCGAA GGCCGAGGAG
AACGGTGAGG AGTCCGGTTC CGCGTACGTG GTGACGGCGC TGGACGACTA CAGGGTCGAG
GGGCCCGTGA GCGAGGTCGC CGACCCGCGC GGCTGA
 
Protein sequence
MTPERRAGLS PRWRSRWVAA AAASGLMISG YALVGAAHSP TGSFPGPGAA GCEAAQDKRQ 
MRGAWLTTVG NIDWPSEPGL SAEDQKAEMD QRLDEAVDLG LNTVFLHVRP TADAVYESDL
EPWSKYLTGE QGGDPGYDPL EYAVAGAHER GLELHAWFNP YRVGMDSDIE ELAEDHPVRE
HPDWLVRYGG EGFLDPGRPE VQEWVTRVIM DVVERYDIDG VHFDDFFYPY PKDGEEFDDD
RTWEEYGDGF EDREDWRRDN VNGFVSGVHE RIEEAKPWVR FGISPFGIWR NAENDPAGSD
TSGLESYEAQ HADTRAWIRE GMVDYVVPQL YWERGFDAAD YEELLPWWAE QVEGTDVDLY
VGQGAYRVGD RNWTDEDALS TQLDYSSDHP EVDGDVYFSF KSLTGVAEEA YAHLADEHYG
DPALPPLAGG DRGGRSLAGA VEDVTAEVAD EHTAVEWERV EDARFYAVYR LDAQEAARAD
SGDPEEYCGV LSSDNLVGVT GGTSLEDSGH TAEDAAKAEE NGEESGSAYV VTALDDYRVE
GPVSEVADPR G