Gene Namu_1596 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1596 
Symbol 
ID8447194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1758255 
End bp1759655 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content74% 
IMG OID645040723 
Producthypothetical protein 
Protein accessionYP_003200980 
Protein GI258651824 
COG category[S] Function unknown 
COG ID[COG5282] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03624] putative hydrolase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.552209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.160274 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTAACG TGGGCCGCAT GACGAATCTG CCCTTCGGAT TCAGCCCATC CGGAGATGAC 
GACCCCGACG GCAAGCCGGG CCAGGGCCCG GGGCCAGGCG GTTTCGACCT CGGCCAGCTC
GGCTCGATGC TCTCCCAGCT CGGCCAGATG ATGTCGCAGG CGAACGCCTC AGGTGCCTCC
ACCGGCCCGG TCAACTACGA CCTGGCGCGC CGGCTGGCCA CCTCCCAGTT GCCCGCCTCC
CACCCGGCCT CCGCGGTCGA CGTGAACAAG GTGGTCGAGG CGATCAAGCT GGCCGAAGTC
TGGCTGGACG GCGCGACCGC GCTGCCCGCC GGGGCCCGCA CGGCCACCGC CTGGACGCCC
CGCCAGTGGG TGGACGCGAC CATGCCGGCC TGGGAGAAGC TGTGCTCCCC CATCGCCGAG
CAGGTCTCCC GCGCCTGGGT GGACGGGCTG CCCGAGCAGG CCAAGGCCCA GGCCGGACCG
CTGCTGGCGA TGATGGGCTC GATGGGCGGC ATGGCCTTCG GCTCGCAGCT GGGCCAGGGG
CTGGCCCAGC TGGCCACCGA GGTGCTCACC TCCACCGACG TCGGCATCCC GCTCGGACCC
GAGGGCACCG CCGCGCTGCT GCCCCGCTCG ATCAGCGAGT TCGGCGCCGG GCTGAACCTG
CCCGAGGACC AAGTGCGGCT GTACCTGGCC GTCCGCGAGG CGGCCCACCA CCGGCTGTAC
GCGGGCACCC CGTGGCTGCG GGACCGGGTG GTCGCGCTGA TCAACGACTA CGCGCGGGCC
ATCTCGGTCG ACTTCTCCGC GGTCGAGCAG CTGGCCTCCA ACATCGACCC GTCCGACCCG
GCCAGCATCG AGGCGGCGCT GGGCCAGGGC ATGTTCGAGC CGACCATCAC CCCCGGCCAG
CAGGCCGCGA TGGCCGAGCT GGAGACCCTG CTCGCGCTGG TCGAGGGCTG GGTGGACACG
GTCGTCGCCG ACGCGGTCGG CGAACGGCTG CCGGGGGCCA ACGCGCTGCG CGAGACGCTG
CGCCGGCGCC GGGCCACCGG TGGGCCGGCC GAGCAGACCT TCGCCACCCT CATCGGCCTG
GAGCTGCGGC CCCGGCGGCT GCGCGCCGCC GCCGAGCTGT GGCAGGCCGT GGGCGAGTCC
CGCGGCACGG ACGGCCGGGA CGCCCTGTGG GCCGACCCGG GCCTGCTGCC CTCGGGCACC
GACCTGGACG ACCCGAAGGG CTTCGTCGAA CGCGACAAGC AGTTCACCGA GCTGCTCGCC
GGGTTGGACG ACATCGAGAC CCAGCTGCTG GGCAAGCCGG ACGACGCGAC CGACCCGGCC
GATTCAGCCG CTCCGGCTGA TTCCCGGGAC CCGGCCGGCG ACGACGGCAA CCCCGACGAG
CAGCCGCCGC GCCCGGTCTG A
 
Protein sequence
MRNVGRMTNL PFGFSPSGDD DPDGKPGQGP GPGGFDLGQL GSMLSQLGQM MSQANASGAS 
TGPVNYDLAR RLATSQLPAS HPASAVDVNK VVEAIKLAEV WLDGATALPA GARTATAWTP
RQWVDATMPA WEKLCSPIAE QVSRAWVDGL PEQAKAQAGP LLAMMGSMGG MAFGSQLGQG
LAQLATEVLT STDVGIPLGP EGTAALLPRS ISEFGAGLNL PEDQVRLYLA VREAAHHRLY
AGTPWLRDRV VALINDYARA ISVDFSAVEQ LASNIDPSDP ASIEAALGQG MFEPTITPGQ
QAAMAELETL LALVEGWVDT VVADAVGERL PGANALRETL RRRRATGGPA EQTFATLIGL
ELRPRRLRAA AELWQAVGES RGTDGRDALW ADPGLLPSGT DLDDPKGFVE RDKQFTELLA
GLDDIETQLL GKPDDATDPA DSAAPADSRD PAGDDGNPDE QPPRPV