Gene Namu_1595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1595 
Symbol 
ID8447193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1757181 
End bp1758230 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content71% 
IMG OID645040722 
Productaldo/keto reductase 
Protein accessionYP_003200979 
Protein GI258651823 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.14436 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGTT CACCGTTCCC CGCCCCCGTC TACCTCGCCG ACGACGCCCG CTACGACTCG 
ATGCCCTACC GCCGCACCGG CCGCAGCGGC CTGCTGCTGC CCGCCGTCTC GCTGGGCCTG
TGGCAGAACT TCGGCGACGT CAACCCGCTG GATACCCAAC GGGCGGTGCT GCGCCGGGCC
TTCGACCTCG GCGTCACCCA CTTCGACCTG GCCAACAACT ACGGCCCGCC GTACGGATCG
GCCGAGACCA ACTTCGGCGC GCTGATGGCC CAGGACTTCC GGCCGTACCG GGACGAGCTG
GTCATCTCCA CCAAGGCCGG CTACGACATG TGGCCCGGGC CCTACGGCGA CCGCGGCTCG
CGCAAGTACC TGCTGGCCTC GCTGGATCAG TCGTTGCGGC GGATGGGTCT GGACTACGTC
GACATCTTCT ACTCGCACCG CGCCGACCCG GACACGCCGC TGGAGGAGAC GATGGGCGCG
CTGCACACCG CGGTCGCCTC CGGGCGGGCG CTGTACGCGG GCATCTCGTC CTACTCGCCG
GAGCGGACCC GGCGGGCCGC GGAGATCCTG GCCGACCTGG GCACGCCGCT GCTGATCCAC
CAGCCGTCGT ACTCGATGCT CAACCGCTGG ATCGAGGAGG GCCTGCTGGA CACCCTGGGC
GAGCTGGGCG TCGGCTGCAT CGCGTTCTCC CCGTTGGCCC AGGGCATGCT CACCGGCAAG
TACCTGGACG GGGTGCCGGA GGACTCGCGC GCCGCGCGGG GCGGTTCGCT CTCGCCCGAG
CTGCTCACCG ACGAGGCGCT GAGCCACATC CGCGCGCTCA ACGGCATCGC CGCCGACCGC
GGTCAGTCGC TCGCCCAGCT GGCCCTGTCC TGGGCGCTTC GGGACGAGCG GGTGACCTCG
GTGCTGATCG GCGCGTCGTC GGTGCGCCAG CTGGAGGACA ACCTGGCGGC GACGGCGAAC
CTGGCCTTCG ACGACGAGGA GCTGGACCGG ATCGGGGCGC ACGCCGTCGA CACCGGCATC
AACCTGTGGA GCCCGTCCAG CGCGGTGTGA
 
Protein sequence
MASSPFPAPV YLADDARYDS MPYRRTGRSG LLLPAVSLGL WQNFGDVNPL DTQRAVLRRA 
FDLGVTHFDL ANNYGPPYGS AETNFGALMA QDFRPYRDEL VISTKAGYDM WPGPYGDRGS
RKYLLASLDQ SLRRMGLDYV DIFYSHRADP DTPLEETMGA LHTAVASGRA LYAGISSYSP
ERTRRAAEIL ADLGTPLLIH QPSYSMLNRW IEEGLLDTLG ELGVGCIAFS PLAQGMLTGK
YLDGVPEDSR AARGGSLSPE LLTDEALSHI RALNGIAADR GQSLAQLALS WALRDERVTS
VLIGASSVRQ LEDNLAATAN LAFDDEELDR IGAHAVDTGI NLWSPSSAV