Gene Namu_3811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3811 
Symbol 
ID8449430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4182577 
End bp4183653 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content72% 
IMG OID645042861 
Productoxidoreductase domain protein 
Protein accessionYP_003203097 
Protein GI258653941 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.247991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0981542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCCC CCCTTCGTAT CGCAGTGCTC GGCGGTGGCC GGATGGGCCA GAGTCACGCC 
CGCCAGATCC TGGCCAATCC CGACACCGAA CTGGTCGCGA TCATCGACCC GGCCACCGAC
CAGCCGGCCC GGCAGTTCGG AGTCGCCCAT TTCCCGGATC ACCCGAGCCT GCTGGCGCAG
GCCCGGCCGG ACGCGGTGAT CGTCGCCACC CCGAACGACC TGCACGTGCC GACCGCCCTG
GACTGCCTGG CCGCCGGCGT GCCGGCGCTG GTGGAAAAGC CGGTCGGGGT GAACCCGCAG
GAGGTCGACG AGCTCGCCGC CGCGGTTCAG ACCACCGGGG TTCCGGTCCT GGTCGGGCAC
CACCGGCGGC ATCACCCGGT GATCGGCGCG GCCAAGCAGT ACATCGCCTC GGGCGAGCTG
GGCCAGCTCG TCGCGATCAA CGCACTGTGG CTGACCCGCA AGCCCGCCGA CTACTTCGAT
ACCTGGCGCT CGGCCGCCGG GGCCGGCGTT CTGCTGATCA ACCTGGTGCA CGACATCGAC
GTGCTCCGGT ACATGTGCGG CGAGATCACC TCCGTGGTCG CCCTGACCAG CTCCGCGGCA
CGGGGATTGG TCGTCGAGGA CACCGCCAGC CTGACCCTGC AGTTCGCCGG CGGAGCTCTA
GGCAGCATCA TCGGCTCGGA TGCCGCGGTG GCCCCCTGGG GCTGGGACAA GAACTCCGGC
GACGACCCCT ACTTCGCCCA GGAGCCGGAC CAACCTTGCT TCATGATCGC CGGTACCCGG
GGCTCCATCC AGGTCCCACA GCTGGCCACC TGGTCCTACC AGGGCCAGGC CGACTGGACG
GCCCCGCTCA CCCGCGACCA GGTGCCGTTG CCGGCCGGCG GAGCGCTGGA CCGGCAGCTC
GCCCACTTCG TGCGGGTCGC TCGCGGGGAG GTGCCGCCGT TGGTGTCCGT GCGCGATGCC
GGCCGCACCA TCGCGGTCGT CGATGCCTGC CACCGGGCCG CCCGGACCGG ACAGCGGGTC
GACGTCACCG AGACCGCCGA CCGGCTGACC GCACCCCCCC TGCAGGCCGC CCGATGA
 
Protein sequence
MGSPLRIAVL GGGRMGQSHA RQILANPDTE LVAIIDPATD QPARQFGVAH FPDHPSLLAQ 
ARPDAVIVAT PNDLHVPTAL DCLAAGVPAL VEKPVGVNPQ EVDELAAAVQ TTGVPVLVGH
HRRHHPVIGA AKQYIASGEL GQLVAINALW LTRKPADYFD TWRSAAGAGV LLINLVHDID
VLRYMCGEIT SVVALTSSAA RGLVVEDTAS LTLQFAGGAL GSIIGSDAAV APWGWDKNSG
DDPYFAQEPD QPCFMIAGTR GSIQVPQLAT WSYQGQADWT APLTRDQVPL PAGGALDRQL
AHFVRVARGE VPPLVSVRDA GRTIAVVDAC HRAARTGQRV DVTETADRLT APPLQAAR