Gene Namu_3688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3688 
Symbol 
ID8449307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4046223 
End bp4047230 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content70% 
IMG OID645042752 
Productshort chain dehydrogenase 
Protein accessionYP_003202988 
Protein GI258653832 
COG category[R] General function prediction only 
COG ID[COG4221] Short-chain alcohol dehydrogenase of unknown specificity 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.000592094 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0665627 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCAC AGGTCGTAGT GGTCACCGGG GCCAGCGGCG GTATCGGCCG CGCGGTCGCC 
TCGGCGTTCG GCGCCCGCGG GGCCCGCGTC GCGATGCTGG CGCGCGGCGA GAGCGGGCTG
ACGGGCGCCG CCCAAGATGT GCGTGCCGGC GGCGGCACCG CGCTGCCCAT CCCGACGGAC
GTGGCCGACC AGGCGCAGGT TTTTTCGGCC GCCGACCGCG TCGAAAGCGA GCTCGGCCCC
ATCGATGTCT GGGTGAATGT CGCTTTCACC TCGGTGTTCG CGCCCTTCGC GAAGATCCAA
CCCGACGAAT ACCGGCGGGT GACCGAGGTG AGCTATCTGG GATACGTCTA CGGCACCATG
GCCGCGCTAC AGAACATGAA ACCCCGCGAC CGGGGCACCA TCGTGCAGGT CGGGTCCGCG
CTGGCCTACC GCGGCATTCC CTTACAGACG GCGTACTGCG GCGCTAAACA CGCGATCCAG
GGCTTTCACG AGGCGCTGCG CTGCGAACTA CTGCATGACA AGTCGAACGT GCACGTGACG
ATGGTGCAGA TGCCCGCGGT GAACACCCCG CAGTTCTCCT GGGTGCTGTC CCGGCTACCC
CACCACGCCC AACCCGTCCC GCCGATCTAC CAGCCCGAGG TCGCCGCCCG CGGCGTCCTG
TACGCGGCCG ACCACCCGAA GCGGCGGGAA TACTGGGTCG GCGCCAGCAC CGTCGGCACC
CTGGCCGCCA ACGCCATCGC CCCGGGACTG CTGGACCGCT ACCTGGGCAA AACCGGGTTC
TCCTCCCAAC AGACCAAGCA GAGGCAACCC CCCGACGCGC CGGCGAACCT GTGGAAACCG
GCCGACGGAC CCGACGGCAG GGACTTCGGC ACACACGGCA TCTTCGACGA CCGAGCCAAG
AACTCCGCAC CGCAACTGTG GGCGTCGCAC CACCACGGCC TGCTCGCCGC CACGGCGAGC
GGTGCGCTGG CCGGCGCCGC GGCCCTGATG CTGGTCCGCC GCAGATGA
 
Protein sequence
MTPQVVVVTG ASGGIGRAVA SAFGARGARV AMLARGESGL TGAAQDVRAG GGTALPIPTD 
VADQAQVFSA ADRVESELGP IDVWVNVAFT SVFAPFAKIQ PDEYRRVTEV SYLGYVYGTM
AALQNMKPRD RGTIVQVGSA LAYRGIPLQT AYCGAKHAIQ GFHEALRCEL LHDKSNVHVT
MVQMPAVNTP QFSWVLSRLP HHAQPVPPIY QPEVAARGVL YAADHPKRRE YWVGASTVGT
LAANAIAPGL LDRYLGKTGF SSQQTKQRQP PDAPANLWKP ADGPDGRDFG THGIFDDRAK
NSAPQLWASH HHGLLAATAS GALAGAAALM LVRRR