Gene Namu_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0234 
Symbol 
ID8445814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp263062 
End bp264339 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content72% 
IMG OID645039379 
Productprotein of unknown function DUF21 
Protein accessionYP_003199654 
Protein GI258650498 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones70 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATCT GGCTGAACCT GCTGGTGGTG CTGCTTCTGC TGATCATCGG CGGCTTGTTC 
ACGGCGACCG AACTCGCCCT GGTCTCCCTG CGCCCGGGTG AACTGGCCGA CCTGCGGGCG
CAGGGACCGC GCGGCGCCCG GGTGGCCCGG CTGGCCGGCC AGCCAACCCG GTTCCTCGGG
GCCGTGCAGA TCGGCTCCAT CGTGGCCGGG TTCTTCGCCG CCGCCTACGC CACCGCGACC
CTGGCCGAAC CACTCGGTGC GGCGATGGGC CGGGCGGGAC TGGGCGAGGA CGGCGGGGAG
ACCCTGGCCG TGCTGATCGT CACCCTGGCG ACCACCTTCC TGGCCCTGAT CATGGCCGAG
CTGACCCCGC GCCGGTACGC GATGCAGCGC CCGCAATCCG TGGCCGCCCT GCTCGGTCCG
ATCCTGGACC GGCTGGCCAC GCTGCTGCGA CCGGTCATCT GGCTGTTGGA GAAGTGCACG
AACGGCCTGT TGCGGCTGTT GCGGGTCGAT CCGAAGGATT CCCGCGCGGA GATGAGCGTG
GAGGAGGTGC GCGAGCTGGT CCTGGCCCAT GAGGAGGTCC CGGACCAGGA GAAACAGATC
ATCCGGCAGG TGTTCGCGGC CGGCGAGCGG ACCATCCGGC AGGCCATGGT CCCCCGGGCC
GCGATCGACT TCCTGTCCAC CGCGGCCACC GGCGCGCAGG CCCGGCGGGC CGCCTGGGAG
CACACCCACA CCCGGTACCC GGTGCTGGAC GAGGCCGGCC AGGTGGCCGG GTTCCTGCAC
GTGCGGGATC TGTTCGCGCC CGAACTGGAT CCCGGCGCCC CGATCCGGGA CCTGGTCCGG
CCGATCAGCG CCTACCCGCC GAACAAGAAG TTGTTGGCCG TGCTGCGCGA GATGCAGACC
GGGGCCGAGA ACATCGCCGC GGTCGTGGAC GAGTACGGCC AGCTCAAGGG CATGGTCACC
CTCGAGGACG TCGTCGAGGA ACTCGTCGGC GAGATGTACG ACGAGTACGA CCGCATCCCC
ACCGCCACCC CCGGGGACGC CACCGTGGTC GACGGCCTGA CCGGCCTGTC CGACTTCGGC
CGCCGGCTCG GTTTCGAGCT GCCGGCCGGC CGGTACGACA CGGTCGGCGG GTACCTGCAG
GCCGCCCTGG ACCGGACGCC CCGCGCCGGG GACGCGGTCG AGGTCGCCGG CCACCGGCTC
ACGGTCTCCT CGGTCGCCGG CTGGCGGGTC GGTCAGGTCA CCGTCGAACC GCTGTCGACG
CCGGTTCAAC CGGACTGA
 
Protein sequence
MGIWLNLLVV LLLLIIGGLF TATELALVSL RPGELADLRA QGPRGARVAR LAGQPTRFLG 
AVQIGSIVAG FFAAAYATAT LAEPLGAAMG RAGLGEDGGE TLAVLIVTLA TTFLALIMAE
LTPRRYAMQR PQSVAALLGP ILDRLATLLR PVIWLLEKCT NGLLRLLRVD PKDSRAEMSV
EEVRELVLAH EEVPDQEKQI IRQVFAAGER TIRQAMVPRA AIDFLSTAAT GAQARRAAWE
HTHTRYPVLD EAGQVAGFLH VRDLFAPELD PGAPIRDLVR PISAYPPNKK LLAVLREMQT
GAENIAAVVD EYGQLKGMVT LEDVVEELVG EMYDEYDRIP TATPGDATVV DGLTGLSDFG
RRLGFELPAG RYDTVGGYLQ AALDRTPRAG DAVEVAGHRL TVSSVAGWRV GQVTVEPLST
PVQPD