Gene Namu_1805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1805 
Symbol 
ID8447410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1980562 
End bp1981560 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content73% 
IMG OID645040934 
Productzinc-binding alcohol dehydrogenase family protein 
Protein accessionYP_003201184 
Protein GI258652028 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID[TIGR02822] zinc-binding alcohol dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.352435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0674921 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCGT GGGTGGTGCA ACAGTGCGCG CCGATCGACC AGGGGCCGCT GCGGCGGATC 
GAGCGGCCGG CCCCGGTACC CGGACCCGGT CAGGTCCGGG TAGCCGTGTC GTGCTGCGGG
GTGTGTCGCA CCGACCTGCA CCTGGCCGAA GGCGATCTGC CGCCCCGGCG ACCCGAGGTC
ACTCCCGGCC ATGAGGTCGT CGGCCGGGTG GACGCGCTCG GCCCCGGAGC CACCCGATTC
GCCGTCGGCG AGCGGGTGGG CGTGGCCTGG CTCGCCCGGA CCGATCAGAC CTGTCGCTAC
TGCCGCCGGG GTGATGAGAA CCTCTGCGCG GAGCCGACCT TCACCGGCTG GGACGTCGAC
GGTGGCTACG CCGACCAGTG TCTGGTCGAC GAGCGGTTCG CCTACCGCTT GCCCGAGCAG
GTCTCGGACG AGCAGGCCGC TCCCCTGCTG TGCGCCGGCA TCATCGGCTA CCGCGCCCTG
CGGGTCGCAC AGGTCCCGGT CGGTGGACGT CTGGGCATCT ACGGTTTCGG CGGAAGTGCC
CATTTAACCG CGCAGATCGC CCTGCAGCTG GGCCTGCGGG TGCACGTGCT GACCCGGGGC
GAGCACAACC GGGCCCTGGC CCGCGAGCTG GGCGCCGACT CCGTGGCAGA CGCGACCGAC
GAGCCGCCGG AGCCGCTGGA CGGGGCGATC CTGTTCGCGC CGGCCGGTGA CCTGGTCCCG
GTGGCCCTGC GCGCCTTGGA CTCCGGGGCG ACCCTGGCCG TCGCCGGCAT CTGGTTGTCC
GACATTCCCG CCCTGAACTA TCAGCGGGAG CTGTTCCGGG AACGGCGACT GCGCAGCGTC
ACGGCCAACA CTCGACGCGA CGGTGAGGAG TTCCTCCGGC TCAGCGCCCG CTTCGGGATC
ACGGCCACCA CCCACCGGTA TCCAATGGCC CAGGCCCCGG CCGCGCTGGC CGATCTGGCC
CACGGGCGGT TCGGCGGGGC GGCGGTGCTG TACCACTGA
 
Protein sequence
MQAWVVQQCA PIDQGPLRRI ERPAPVPGPG QVRVAVSCCG VCRTDLHLAE GDLPPRRPEV 
TPGHEVVGRV DALGPGATRF AVGERVGVAW LARTDQTCRY CRRGDENLCA EPTFTGWDVD
GGYADQCLVD ERFAYRLPEQ VSDEQAAPLL CAGIIGYRAL RVAQVPVGGR LGIYGFGGSA
HLTAQIALQL GLRVHVLTRG EHNRALAREL GADSVADATD EPPEPLDGAI LFAPAGDLVP
VALRALDSGA TLAVAGIWLS DIPALNYQRE LFRERRLRSV TANTRRDGEE FLRLSARFGI
TATTHRYPMA QAPAALADLA HGRFGGAAVL YH