Gene Ndas_2986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2986 
Symbol 
ID9246839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3567469 
End bp3569094 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content80% 
IMG OID 
Productprotein of unknown function DUF324 
Protein accessionYP_003680902 
Protein GI297561928 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0321062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.494788 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGCG AGCCAGCCCC GCCCGGGCTG CTGTGGGAGG TCACCCTGCG GCTGTGCCTG 
CTCTCCGACA CCCACGTCGG GGCGGCCCGG GCCAGGCCCC GCCACGCCGC CGGAAGCGAC
GCGGACCTGC ACGTGGACCG CGACCCGGTC ACCGGCGCGC CCCGTCTGCG CGCCACCACC
CTGGCCGGGC TGCTGCGCCA CGAACTCGCC GCCCGCACGG GCGACCCCGA CGACGTCCGC
GCCCTCATGG GCTCGGCGGA GTCCGAGGCC GCGCCGGGCG GGGAGGGGGC CGCCGCGAGC
GCGCTGGACG TGGACGACGC CCGCGCCGAA CTCCCCGAGG ACACAGCGGT AGCGGTCCGC
ACCGGCATCC GGGTCGACCC GGCCGCGGGC ACCGTCCAGC CGGGCCGCAC GTGGCGGTGG
GAGATCCTGC CCGCGGGTAC CGTGTTCACC GCCCACCTGC GCCTGCACGT GCCCGCACCG
GCCGACGAGG CCCGCCTGCT CACCCTGCTG CTCCTGGCCT GCGACGGACT CTCCGGGCCC
GGCGGGTCCG GCCCCGGCAT ACGCGTGGGC GGGCGCACCG GCCGCGGCTA CGGCGCCGTC
CGCGCCACCC ACTGGTCGGC GCGCCGCCAC GACCTCACCG ACGAGCGCGG CTGGCTCGCC
TACCACGCGC GCTCCTGGGC CCGGCGCTGG GAGGAGGGCG CCGACGCGCT CGCCGACGCC
CCCGCCGACC TGGCCGCCGC CCTGACCGGG GCCCTGCGCG CCTGCGGGCG CGGGGCCACC
GCCGCCCACG CGCTCGCCCG CGCGCACCGG CCCGACCGCC GCCACCGCGC CGAACTCCAC
CTGACCCTGG CCGTCGCCGA GCAGCCCGAT CCCACCGCCC CGCCGCCGCG CGACCCCAGG
CCCGGCCTGC TCATGGTCGG CGACGCCCCC GCCCCCGAAC GCCTGGGCGA GGCGGACCGG
GCACACCGCC ACCGCCCCGC CGTCACCGAC CCGGACAAGG CCGCCGTCCA CCCCGCCCCG
GTCCTGGGCG ACACCGCGCT GTTCGCCCTG TTCAAGAGGA TCGGCGGCCG ACTGGTCCGG
GACGCGGCCG AACACCTGGG CGCCGGGCCG GACCGGTGGC GCGACTGGCA CGACCACTGG
TGGGGCGCCG ACACCGACCG GCGCGGCCTC CCGCGCCCCG CCCGGATCCG GCTGCGCACC
GTCCCCGTGC TCACCGGCGG AGCACCCCTG ACCGCCACCC GGCTGACCGT GGACTCCCTC
TTCGGCGACG CCGTGGACGG CCGCCTGTTC ACCACCGACC TGCACTGCGG CGGCAGCGCC
GAGGCGGTCC TGGACGTGCG CGAGCCCGAC GACGCCGTCC GCGGTCTGCT CGCCCTCCTC
GTGCGGGAGC TGGCCACGGT ACCCTTCGAC ACCCTCGGCG CGGGCGCCGG AACCGGGAAC
GGGCGCCTGA CCGCCACCCG CGCCCTCCTG ACCACCCACC CCCCGGGCGG AGGGCCACCG
GACACGGTGG ACCTGCTCAC CGCCCTGTTC GCACCCGACA GCGCCGACGC GGCCACCGCC
CGCGGCTGGC TGGCCGCCCT GCACGCCGCG CTCGCCCCCG CGCCCACCAC GGAGGAGCCC
AGGTGA
 
Protein sequence
MSGEPAPPGL LWEVTLRLCL LSDTHVGAAR ARPRHAAGSD ADLHVDRDPV TGAPRLRATT 
LAGLLRHELA ARTGDPDDVR ALMGSAESEA APGGEGAAAS ALDVDDARAE LPEDTAVAVR
TGIRVDPAAG TVQPGRTWRW EILPAGTVFT AHLRLHVPAP ADEARLLTLL LLACDGLSGP
GGSGPGIRVG GRTGRGYGAV RATHWSARRH DLTDERGWLA YHARSWARRW EEGADALADA
PADLAAALTG ALRACGRGAT AAHALARAHR PDRRHRAELH LTLAVAEQPD PTAPPPRDPR
PGLLMVGDAP APERLGEADR AHRHRPAVTD PDKAAVHPAP VLGDTALFAL FKRIGGRLVR
DAAEHLGAGP DRWRDWHDHW WGADTDRRGL PRPARIRLRT VPVLTGGAPL TATRLTVDSL
FGDAVDGRLF TTDLHCGGSA EAVLDVREPD DAVRGLLALL VRELATVPFD TLGAGAGTGN
GRLTATRALL TTHPPGGGPP DTVDLLTALF APDSADAATA RGWLAALHAA LAPAPTTEEP
R