Gene Ndas_2247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2247 
Symbol 
ID9246097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2689943 
End bp2691037 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content76% 
IMG OID 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_003680175 
Protein GI297561201 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.792853 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACGA CCGTCAGGGC CGCCGTCTTC TCCGAGCGCG GCGCCCCTCC CCGGATCCGC 
GACCTGGTCC TGCCCGACCC CGGCCCCGGC CAGGTGCGGG TGCGCCTGGC AGCGGCCGGG
GTGTGCCACT CCGACCTGTC CCTGTCCAAC GGCACCCTGG CGCAGAAGTG GCCCGCGGTG
CTGGGCCACG AGGGTGCCGG AACCGTCGAC GCCGTCGGCG AGGGCGTCAC GGAGGTGGTC
CCCGGCCAGC AGGTGATCCT GAACTGGGCC CCCTCGTGCC GCGAGTGCTG GTTCTGCCGC
CAGGGCGAAC CTCACCTGTG CGAGCACGCG CTGGACCGCA CCGTACTGCC CTACGCGGAG
CTCGCCGACG GCACGCCCGT CTACCCCGGC CTGGGCTGCG GCGCGTTCGC CGAGGCCACC
GTGGTGCCCG CCTCCGCCGT CGTGCCGCTG CCCGATGGGA TCGACCCGGC GGTGGCCGCC
GTGCTGGGCT GCGCGGTGCT CACCGGCTGG GGCGCGGTCC ACAACTCCGC GGGCGTGCGC
GAGGGCCAGT CGGCCGTGGT GCTGGGCCTG GGCGGGGTGG GCCTGTCGGT GCTCCAGGCC
GCGCGTCTGG CCGGGGCCGA CCCGGTGGTC GCGGTGGACG TCTCCCCCGC CAAGGAGGAG
CTGGCCCGTT CACTGGGCGC CACCGAGTTC CTGCTCGCCG ACGAGACCCT GGTCAGGGCC
GTGCGCGCGC TGACGGGCAG GCGCGGCGCC GACCACGCCT TCGAGGTGGT GGGGTCGGCG
AAGGCGGTGC GCTCGGCCTG GGACGTGACC CGGCGCGGCG GCACGGTCAC GGTGGTGGGC
GTGGGCAGGG TGGACGACGA GGTGTCCTTC AACGCGCTGG AGCTGTTCCA CCAGGCGCGC
ACGCTGCGCG GGTGCGTGTA CGGCTCCAGC GACCCGGAGC GCGACGTCCC GCTCATCGCC
GAGCGGGTGC GTTCGGGGGA GCTGAAGCTG GCGGCGATGG TCACCGACGA GATCCCGCTC
GAAGGCGTGC CCGAGGCCTT CGAGCGCATG GCCCGGGGCA GGGGCGGCCG GTCGCTGGTG
CGCTTCGGGG CCTGA
 
Protein sequence
MSTTVRAAVF SERGAPPRIR DLVLPDPGPG QVRVRLAAAG VCHSDLSLSN GTLAQKWPAV 
LGHEGAGTVD AVGEGVTEVV PGQQVILNWA PSCRECWFCR QGEPHLCEHA LDRTVLPYAE
LADGTPVYPG LGCGAFAEAT VVPASAVVPL PDGIDPAVAA VLGCAVLTGW GAVHNSAGVR
EGQSAVVLGL GGVGLSVLQA ARLAGADPVV AVDVSPAKEE LARSLGATEF LLADETLVRA
VRALTGRRGA DHAFEVVGSA KAVRSAWDVT RRGGTVTVVG VGRVDDEVSF NALELFHQAR
TLRGCVYGSS DPERDVPLIA ERVRSGELKL AAMVTDEIPL EGVPEAFERM ARGRGGRSLV
RFGA