Gene Ndas_3748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3748 
Symbol 
ID9247617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4500844 
End bp4502058 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content77% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681652 
Protein GI297562678 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCA CCGCTCCCCC GGCCGCGCCC CCGCGCTCCC CCGGCCGCGC GCCGTCCCGG 
GAGGACTCGC GCGTCCTGCG GGTGTGGCGC TCGGTCCGGG TTCCGGCAGC CGTGGTGGCC
GCGCTCGTCA CGGTCTCGGT GCTGCTGTCG CTGGGCAGCG AGCAGTTCCC CACCGGCCAC
CTGGAGCCCG GTTCCATCGA CCCGGACGGC ACGCGGGCGC TGGTGAACGT GCTGGAGGAG
GACCGCGACG TGCACGTGGT GCGCTCCTCC GCCGCCGCGG AGGAGGCCGT GGCCGACGCC
GGGGACGACG CCGTGCTCGC GGTCTTCCTG GACCACCGCC TGCTCCCCGA GGAGCTGGAC
TCGCTGGCCG CGCTCGACGT GGACACCGTC CTGGTGCAGC CGTCCACGCG GTCCCTGGAG
GCGTTCGCCC CCGGGGTGAC GATGACCGGC CGGGAGGAGC CCGAGGGGTT CCCCACGCCG
GAGTCCCCCT ACGCCCCCGA GTGCGGGCTG TCGGCCGCCG AGGCCGCGGG CGAGGCCTAC
GTCGCCGGTG AGCTGTACAC GGCCGGTTCC GGCGCGGACG CCGTGGGCTG CTACCCCGGT
GGCGGCGGCG ACGCCCTGGT CCGGGTGGAG CGGGACGGGG CCGCCACGAC CGTGCTGGGC
ACCGGCAGGC CGCTGACCAA CACCGCGCTC TCCGCCGGCG GCAACGCCGC GCTGGCGATG
AACCTCCTGG CCGCCGAGGA CGTGGTGTGG CTGCGCCCCG ACCCGCCCCA GCAGGAGGGC
GGCTCCGGGC TGTGGCAGCT GCTGCCGCTG GGCCTGCGCT GGTCCCTGGT GCCGCTGGTG
GCCGCGTTGG CGCTGCTCGC CCTGTGGCAG GGGCGCCGGA TGGGCGCCCT GGTGCCCGAG
TCGCTGCCCG TGGTGGTGCG CGCCTCGGAG ACCACCGAGG GGCGTGCGGG ACTGTACCAG
TCGCGCAGGG CCCGGGACCG GGTCGCGGCC GCGCTGCGGT CGGGGTTCGT GGACCGGGTC
GCACCCAAGC TCGGGCTGGG CGCGGACGCC GCGCCCGACA CGGTCGTGGC GGCGGTCGCC
TCGCGGACCG GTGACGACCC CGCCCACCTG CGGGCCCTGC TCCACCCCGG GCAGCCCGAC
CCGTACGCGG GCGACGACGA CATGCTGGTC AGGCTCGCCG ACGAACTCGA CGAGCGCGCC
CGGAGGCTCC GGTGA
 
Protein sequence
MTATAPPAAP PRSPGRAPSR EDSRVLRVWR SVRVPAAVVA ALVTVSVLLS LGSEQFPTGH 
LEPGSIDPDG TRALVNVLEE DRDVHVVRSS AAAEEAVADA GDDAVLAVFL DHRLLPEELD
SLAALDVDTV LVQPSTRSLE AFAPGVTMTG REEPEGFPTP ESPYAPECGL SAAEAAGEAY
VAGELYTAGS GADAVGCYPG GGGDALVRVE RDGAATTVLG TGRPLTNTAL SAGGNAALAM
NLLAAEDVVW LRPDPPQQEG GSGLWQLLPL GLRWSLVPLV AALALLALWQ GRRMGALVPE
SLPVVVRASE TTEGRAGLYQ SRRARDRVAA ALRSGFVDRV APKLGLGADA APDTVVAAVA
SRTGDDPAHL RALLHPGQPD PYAGDDDMLV RLADELDERA RRLR