Gene Namu_0232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0232 
Symbol 
ID8445812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp261317 
End bp262627 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content71% 
IMG OID645039377 
Productprotein of unknown function DUF1254 
Protein accessionYP_003199652 
Protein GI258650496 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones73 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCATTC TGGCCAGTGA TCTGCGCACC GTGAGCCACG AGGCGTACAT CTATCTGTAC 
CCGTTGGTGA CCATGGAGGT GACCCGCCAG CAGCTGGCGA ACATTCCCCC GGGCACCCGC
GAGGGCTTCG GGCCGCCCAA CCAGTTCCAC CACCTGCGCA CGTTCCCCGA CGCCGACTTC
CGGGCCGTGG TGCGGCCCAA TTTCGACACC CTCTACTCGT CGGCCTGGCT CGACCTGACC
CGCGGCCCGG TCGTCGTCGA GGTGCCCGAC AGCGACGACC GCTACTACAT GATGCCGATG
CTGGACATGT GGACCGACGT GTTCGCCAAC CCCGGCAAGC GGACCACGGG CACGGGCGCC
CAGACGTTCG TGGTGACCGC TCCGGGGTAC ACCGGCGACC TGCCGGCCGG CGCCACCCCG
ATCGCGGCGC CGACCCCGCA CGTGTGGGTC ATCGGCCGGA CCCAGACCAA CGGGCCGGCC
GACTACGCCG CGGTGAACGC GTTCCAGGAC GGGCTGCGGA TCACCGAGCT GGCCGGCCCG
AGCACGTTCG TCATCGACGA GTCGGTGGAC ACCACCACCG AGCCGTTGCG CCTGGTCGAC
GCGATGAGCG CGGCCGAGTT CTTCACGCTC GCCGCGCGCA CGATGGCGGT GAACCCGCCG
CACCGGTCGG ACTTCTCGCA GGTGGCCCGG ATGTCCCTGC TCGGTGTGGC CCCCGGCGTC
GACTTCGACG CCGGCCGGTT CGACGCCGCC GAACTGGCCG AGATCGAGGC CGGCGCGAAG
GCCGCCCAGG CGGCGCTGCA CGCCGGCGTC GCCCGGCTGG CCGCACCGGT GAACGGGTGG
ACGATGCTCA CCGACACGAT GGGCGTCTAC GGCAACGAGT ACTTCCGGCG GGCCGTGATC
ACCCTGGTCG GCCTGGGCGC CAACCCGGCG CAGGATGCGG TCTACCCGCT GCTGGTGGCC
GACGCGGACG GCAAGCCGAC GGTGGGCGAC CACGACTACG TCATCCACTT CGACGCCGAT
CAGCTGCCAC CGGCCCAGGC GTTCTGGTCG ATCACGATGT ACGACGCCGA GGGCTTCCAG
GCGCCCAACG AGCTGGACCG GTTCGCCATC GGCGACCGGG ACCCGCTGGT CTTCAATCCG
GACGGTTCGC TGGACATCTA CATGCAGCAC GGCGATCCGG GACCGCACCG GCGGGCGAAC
TGGCTGCCGG CGCCGACCGG TCCGGTGGGC ATCACCATGC GGCTCTACGC GCCGGCCCCG
GCGGTGCTCG ACGGCACCTG GCATCCGCCC GCCGTGCGCC GGGTGAAGTA G
 
Protein sequence
MSILASDLRT VSHEAYIYLY PLVTMEVTRQ QLANIPPGTR EGFGPPNQFH HLRTFPDADF 
RAVVRPNFDT LYSSAWLDLT RGPVVVEVPD SDDRYYMMPM LDMWTDVFAN PGKRTTGTGA
QTFVVTAPGY TGDLPAGATP IAAPTPHVWV IGRTQTNGPA DYAAVNAFQD GLRITELAGP
STFVIDESVD TTTEPLRLVD AMSAAEFFTL AARTMAVNPP HRSDFSQVAR MSLLGVAPGV
DFDAGRFDAA ELAEIEAGAK AAQAALHAGV ARLAAPVNGW TMLTDTMGVY GNEYFRRAVI
TLVGLGANPA QDAVYPLLVA DADGKPTVGD HDYVIHFDAD QLPPAQAFWS ITMYDAEGFQ
APNELDRFAI GDRDPLVFNP DGSLDIYMQH GDPGPHRRAN WLPAPTGPVG ITMRLYAPAP
AVLDGTWHPP AVRRVK