Gene Ndas_4788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4788 
Symbol 
ID9248671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5674811 
End bp5676037 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content75% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003682678 
Protein GI297563704 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.123096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGAGTT CGCGGGCGCG GCGGCGCGGG CCGGGCCTCC GACCGGCCAG GTTGTCCGTC 
TCCGACCGGC TGCGGACCGG GGCGAGCGGC CTGCGTGCCC GGCCGACGCG GGTGGTGCTG
TCCGCCCTGG GCATCGCCAT CGGGATCGCG GCCATGGTCG CGGTGGTGGG CGTCTCGGAG
TCCGGCAGGG CCGAGCTGGA CGCCCGGATC GGACGCCTGG GCACCAACAT GCTGACCGTC
GCCCCGGGCA GCGACCTGTT CGGCGGCACC GCCGTGCTCC CGCCCGAGGC CAAGGGCCGG
ATCGACCGTA TGCCGGAGGT GGAGCGCTCC GCCCAGGTGG AGATGGTGAA GGGGGCGGGC
GTCTACCGGA GCGACCTCGT CCCCGAGGGC GAGTCCGGGG GGATCGCCGC GTACGGCGTG
GAACCGGGCC TCCTGGACAC ACTGCGCGCC CGGGTGGACG AAGGCGTGTG GCTGAACCCG
GCCACCACCG ACCACCCGTC GGTCGTGCTG GGGCGGGACG CGGCCGCGAG GCTGGGCGTT
ACCCGGGTCA CCCCGGACAC CCTGGTGCTG GTCGGGGACG AGTACTTCGC CGTCGTCGGC
ATCCTGGACG CGGTGGAACT GGCCCCCGAG CTGGACAACG CGGTGCTGGT CGGCCAGGAG
GTGGCCGAGA GCCTGCTCGG CGCTCGCGGT GAGGCCTCCA CGATCTACGT GCGCCTGGCC
CCCGATCGGG TCGCGGACGC ACGATCGCTG GTCGGGCGCA ACGCCAACCC GGAAAACCCC
AACGAGGTGA GGGTGTCGCG CCCCTCGGAC GCGCTGGAGG CGCAGCGGGC CGCCGACCAG
ACCCTCAACG GGCTGCTGCT GGGACTGGGC GGGATCTCCC TGCTGGTGGG CGGGGTGGGG
GTGGCCAACA CCATGGTCAT CTCGGTCCTG GAACGTCGCG GGGAGATCGG GCTGCGCCGG
GCCCTGGGCG CCACACGCCG CGACATCCGG ACGCAGTTCC TGGTCGAGGC GGTCGTGCTC
TCGGCCCTGG GCGGCGCGGC CGGGAGCGTG CTCGGCGTCC TGACGACGCT GGTCTACGCG
GTCCTGCGGA GCTGGCCGTT CGCCGTGCCC TGGTGGGCGG GGGCCGGGGC CCTGGCGGCG
ACGGTCGTCA TCGGTGCGGT GGCTGGCCTG GTGCCCGCGC TGCGCGCGGC GGCGCAGCAC
CCGACCGAGG CGCTCGGTTC CGCCTGA
 
Protein sequence
MRSSRARRRG PGLRPARLSV SDRLRTGASG LRARPTRVVL SALGIAIGIA AMVAVVGVSE 
SGRAELDARI GRLGTNMLTV APGSDLFGGT AVLPPEAKGR IDRMPEVERS AQVEMVKGAG
VYRSDLVPEG ESGGIAAYGV EPGLLDTLRA RVDEGVWLNP ATTDHPSVVL GRDAAARLGV
TRVTPDTLVL VGDEYFAVVG ILDAVELAPE LDNAVLVGQE VAESLLGARG EASTIYVRLA
PDRVADARSL VGRNANPENP NEVRVSRPSD ALEAQRAADQ TLNGLLLGLG GISLLVGGVG
VANTMVISVL ERRGEIGLRR ALGATRRDIR TQFLVEAVVL SALGGAAGSV LGVLTTLVYA
VLRSWPFAVP WWAGAGALAA TVVIGAVAGL VPALRAAAQH PTEALGSA