Gene Ndas_3763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3763 
Symbol 
ID9247632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4517921 
End bp4519669 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content77% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681667 
Protein GI297562693 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGGCG GGGGCGCCGC GCCGGGGATC AGCAGACGAG GACACCGGGA CGGCCGGGCC 
GACCAGGCCC GCCAGGCGTC CGGCACCGAG CGGGAGGACC AGACCGTGCG ACTGATCGTG
GAGAACAGGT TCGCGCTGTT CGGGCTGGTC TCGCTGGCTT TGGCCGCGCT GTTCGGCGTC
GCGTTCGCGA CCGGGCCGAT CACCGCGGAG CTGGGCGTCA CCGCCCCCGG GACCGTGCGG
CCCGACCACG CGGTGCGGGT GTGCCCCGCC CCGCACGAGT CGGGGGACAG CTCCGTGGCC
GCGTTCGCTC CGAGGGTGAG CCGCGACGAC GAGGGCGAGC TGTGGGCCGA GTCGGTTCCC
GAGGCCCCCG AGGAGGACGC CGAGGACACC GGGGACGACG AGACCGGTGG GGACGGCGGG
AACGAGGAGG ACTCCGGGAA CGGTGAGGCC GGTGGGGGCG GCGGCGCGGA CGGTGCCCGG
GGCGGCACCG TCGGCGAGGA GCTGACCGAG CCCGGCCGCG TGTGGAACAC CGACACCGCC
GGGACCGAAG CCCCCACCGC CGTGCGCGCC GAGGGCTCCC TGGCCTCCGG ACTCGACACC
GCCCAGACCA CCCTCTCCGA CGGCTCGGCC ACCGAGGTCC GCTGCCTCGA ACCCTCCGTC
GGAACCTGGT TCGCCCTCCC CGGAGGCGAC GGCATCGAGG GCATGCGACT CGACGCGCTC
ACCGTGCACC TGGCCAACCC CGAGGACTCC CGGGCCACGG TCAGCGTGGA CGTCTACACC
GAGGGCGGCC CCTCCTCCTC CGAGGAGAGC AGGGGCATCG CCCTGCCCGC GGGGACGGCG
ACCGAGCTGG ACCTGACCGG ACTGGTCGGC GGAACCAGCG CCGTCGGAGT CCACGTGCGC
ACGAGCACGG GACGGGTCGC GGCCTCCCTG CTCGCCGAGC ACTCCTCCGG CACCGCCGAC
TGGGTCCCGC CCACGGCCGC GCCCGCCCGG GAGCACGTGA TCCCCGGCGT TCCCGGCGGC
GACGGCCGTC GCCGCCTGCA CGTGGCCGCG CCGGGCGACG AGCCCGTCCA GGTGCGCGTG
TACACCGTCA CCCCGGCCCC GGAGGCCGGG ACGGACGAGA CGCGGGAGGA GGACGGGGAG
GCGCAGGGCG CCACCGCCGA CGACCCGCTG ACCTTCAGCG TGCCCCCGGC CGCCTCCGCC
TGGCTCAGCC TGGAGACCGT CCTGGCCGGG GAGCCCGGCG CGGTCGTGGT GCGGGCGGAC
GCGCCCGTGG TCGCCGGTGT CGCCGCCGAG GCGGTGACCG GGGAGGGGGA CGACGTCGAG
GTGGTCGAGG CGGCCCACAC CTCCGCGGTG CCGCCGCTCG GCTTCCCGCT GGACACCACC
GCGGTCCTGC CCGACGTCCC CGAGGGCGCC GACACCGAGC TGCTCCTGAC CGCCGTCGGG
GGCGACGCCA CCCTCATGGC CACCCCCATC GGCGCCGACG CCACCCAGGG CGACGCCGTG
CGCGTGCGGG TCGCCGCCGG GACCACCACC GTGTTCGGCG GGGACGACGG CTGGCAGGCG
CCTCCGGGCA CCGCGCCCGA GGACGGCTAC GCGGTCCGCC TGGAGGTCCT GGACGGTTCC
GAGCCCGTCC ACGTCGCCCG CGTGCTGCGC GGCGGCGGGG ACGGGCTGGG TGTGCTGCCG
GTGACGCCCG CGCCGGTGCG GATAGAACTC CCGGTGGTAC GCGACAGCAT GGTGGGGGCG
GTCCCCTAG
 
Protein sequence
MPGGGAAPGI SRRGHRDGRA DQARQASGTE REDQTVRLIV ENRFALFGLV SLALAALFGV 
AFATGPITAE LGVTAPGTVR PDHAVRVCPA PHESGDSSVA AFAPRVSRDD EGELWAESVP
EAPEEDAEDT GDDETGGDGG NEEDSGNGEA GGGGGADGAR GGTVGEELTE PGRVWNTDTA
GTEAPTAVRA EGSLASGLDT AQTTLSDGSA TEVRCLEPSV GTWFALPGGD GIEGMRLDAL
TVHLANPEDS RATVSVDVYT EGGPSSSEES RGIALPAGTA TELDLTGLVG GTSAVGVHVR
TSTGRVAASL LAEHSSGTAD WVPPTAAPAR EHVIPGVPGG DGRRRLHVAA PGDEPVQVRV
YTVTPAPEAG TDETREEDGE AQGATADDPL TFSVPPAASA WLSLETVLAG EPGAVVVRAD
APVVAGVAAE AVTGEGDDVE VVEAAHTSAV PPLGFPLDTT AVLPDVPEGA DTELLLTAVG
GDATLMATPI GADATQGDAV RVRVAAGTTT VFGGDDGWQA PPGTAPEDGY AVRLEVLDGS
EPVHVARVLR GGGDGLGVLP VTPAPVRIEL PVVRDSMVGA VP