Gene Ndas_4221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4221 
Symbol 
ID9248095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5040381 
End bp5041421 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_003682119 
Protein GI297563145 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.41128 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGTCG CCCGCTTCTA CGCCCCCGGT GACATCCGAC TGGAGCAGGC CCCCGAGCCG 
ACCGCCGGAC CCGGACAGCT CAAGATCGCC GTCGTCAACT GCTCCACGTG CGGCACCGAC
GTGAAGATCT CCCGGCACGG ACACCACCAC ATCCGTCCGC CCCGCGTGAT CGGCCACGAG
ATCGCCGGGC GGATCGTCGA GGTGGGCGAG GGCGTCACCG GCTGGGCCGA GGGCGACCGC
GTCCAGGTCA TCGCCGCCAT CCCCTGCGGC ACCTGCGTGG AGTGCTCCGA CGGCCGGTTC
ACCGTGTGCT CGCGCCAGGA GTCCATGGGT TACCACTACG ACGGCGGCTT CGCCGAGTAC
ATGATCATCC CCGAGTCGGT CCTGGCCGTG GACGGGGTCA ACCGCGTCCC CGACAACATC
GACCTGGCCG AGGCCTCCGT CGCCGAGCCG CTGGCGTGCG TGCTCAACGG CCAGGAGATC
GCCGGAGTCG GCGAGGGCGA CACGGTCGTG GTCATGGGCG CCGGGCCGAT CGGCTGCCTG
CACGTCCGGC TGGCCCGTGC GCGCGGCGCC GCGAAGGTCT ACCTGGTGGA CCTCAACCGG
GGCCGCCTGG ACATGTCCGC CGACATCGTC CAGCCCGACG CGTCGATCTG CGGCGCCGAG
ACCGACGCCG TGGAGGAGGT GCTCCGCCTG ACCGACGGCC GGGGCGCCGA CGTCGTCATC
ACCGCCGCCG CCTCCGGGCG CGCCCAGGAG GACGCGCTGC GCATGGTCTC GCGCAGCGGC
CGGATCAGCT TCTTCGGCGG CCTGCCCAAG GACGCGCCGA TCATCCAGCT GGACTCCAAC
GCCGTGCACT ACCGGGAGAT CTCGATCTTC GGCGCCAACG GCTCCAGCCC CGAGCACAAC
CGCCGCGCCC TGGAGCTGAT CTCCTCCGGC GCCGTGCCGG TGGCGGACCT GATCACCGAG
CGGATGTCCC TGTCCGACGT GCACAAGGCC ATCGAGACGG TGGCCTCGGG CACCGCGATC
AAGGTGACCA TCCAGCCGTA G
 
Protein sequence
MLVARFYAPG DIRLEQAPEP TAGPGQLKIA VVNCSTCGTD VKISRHGHHH IRPPRVIGHE 
IAGRIVEVGE GVTGWAEGDR VQVIAAIPCG TCVECSDGRF TVCSRQESMG YHYDGGFAEY
MIIPESVLAV DGVNRVPDNI DLAEASVAEP LACVLNGQEI AGVGEGDTVV VMGAGPIGCL
HVRLARARGA AKVYLVDLNR GRLDMSADIV QPDASICGAE TDAVEEVLRL TDGRGADVVI
TAAASGRAQE DALRMVSRSG RISFFGGLPK DAPIIQLDSN AVHYREISIF GANGSSPEHN
RRALELISSG AVPVADLITE RMSLSDVHKA IETVASGTAI KVTIQP