Gene Namu_2102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2102 
Symbol 
ID8447713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2320625 
End bp2321521 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content74% 
IMG OID645041225 
Productmodification methylase, HemK family 
Protein accessionYP_003201469 
Protein GI258652313 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0088252 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0697027 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGGC ATTCGCTGCG CGCGGCCATC CTGGACGCCA CCCGCACCCT GGAGGCGGCG 
GGGGTGGCCA GCGCGGACGT CGACGCGCAG GAACTGGCCG CGCATCTGAT GGGCGTGCCG
CGAACCCGGC TGGGCCTGAC CCCGCTGGTC GAGCAGTCCT TCCTGACCGA CTACCAGGCC
CTGGTGGAGC GCCGTGCGCA GCGGATCCCG CTGCAGCACC TGACCGGGTC GGTCCAGCTG
GGCCGGGCCA CCGTTGCGGT CGGACCCGGG GTGTTCGTCC CCCGCCCGGA GACCGAGTCG
CTGCTGGTCT GGGCCCTGCA TGCGATCGCC GCCGTGGAAC GGCCGGTCGT GGTCGACCTG
TGCACCGGCA GCGGAGTGCT GGCCCTGGCC ATCGCCGCCG AACGGCCCGA CGCCCGGGTG
ATCGGAGTGG AACGCTCCTC CGCCGCCCTG GCCTGGGCCC GGCGCAACGT GACCAACGCC
GGGGCCGGCC GGACCAAAGT CGAGCTGCGC GGAGGGGACA TCTTCGACGA GCGGTTGCTG
GTCGACCTGG AGGGTCTGGC CGACCTGGTC ACCGCCAACC CGCCCTACGT GCCCGAGGGC
ACCGCGGTCG AACCCGAGGT GGCTGACCAC GATCCGCCCG AGGCGGTGTT CGCCGGACCG
GACGGGCTGG CGGTCATCCG GCCGCTGCTC TCGGTGGCCG CGAGCCTGCT CAAGCTCGGG
GGAGTGCTGG CCATCGAGCA CGACGACAGC CACGGCGAGA CGGTGCCCGC GTTGCTCCGG
TCGCGGCGGG TGCTCACCGA CGTCGAGGAC CACTCCGACC TGGCCGGCCG CCCGCGGTTC
GTCACCGCCA CCCGGGTGCG GATGACGACG GGCGCCGGGA AGACTGGCAC ACCGTGA
 
Protein sequence
MSRHSLRAAI LDATRTLEAA GVASADVDAQ ELAAHLMGVP RTRLGLTPLV EQSFLTDYQA 
LVERRAQRIP LQHLTGSVQL GRATVAVGPG VFVPRPETES LLVWALHAIA AVERPVVVDL
CTGSGVLALA IAAERPDARV IGVERSSAAL AWARRNVTNA GAGRTKVELR GGDIFDERLL
VDLEGLADLV TANPPYVPEG TAVEPEVADH DPPEAVFAGP DGLAVIRPLL SVAASLLKLG
GVLAIEHDDS HGETVPALLR SRRVLTDVED HSDLAGRPRF VTATRVRMTT GAGKTGTP