Gene Namu_3092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3092 
Symbol 
ID8448706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3410245 
End bp3411663 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content72% 
IMG OID645042174 
Productaminoglycoside phosphotransferase 
Protein accessionYP_003202415 
Protein GI258653259 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000153102 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000269489 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACTCAGC AGCCCGACCA TGTCGGCGCG CCCGATAGCG ACCTCATTTC CCTGCTCGCG 
TACTACCTGC CGGCCCAGCG CTGGTTCGCC GCCAAGGGCC AGACCCTGAC GGACGAGAAC
TACCGGATCG TGCGCCGCAC GGTCATCGCC CGTGACGTGT CCGCGGCGGA CGTCGAACAG
GTGCTGCTGG AGGTCACCAC CCCCACCCGG CAGGACCTGT ACCAGCTGTG GGTGGCCTGG
ACCGATCACG TGCCGGACCG GGTGGCGCAC GCCGTCATCG GCACCTACGA CGGCCGGACT
GCCTACGACG CGCTGGCCGA CGTCAACGTC ACCGCCCAGA TGCTGTCCGC GGTCGACGAG
GGCCGCGACT TCGGGTCCGG GGTCAGCGCC CGCCGCGAGC CGGACGCCGT GATCGACATC
GGCGCGATGG GCCTGGTGAT CGGCGCCGAG CAGTCCAACA CCTCGATCGT GTACGGACAC
TCGGCCATCC TGAAGGTGTT TCGCCGGCTG GATCCGGGGC CGAACCCGGA CGCCGAGGTG
CACCGGGCCC TGCACGCGGT CGGCTCCTCG CACATCGCGC AGCCGTTCGG CGAGATCGTC
GGCCGGCTGG CCGAGGGGGC CGGGGAGGCG GAGACGACGC TGGCCCTGCT CACCGAGTTC
TTCGCCAACA GCGCCGACGG CTGGGCGATG GCCACCGCCT CGGTGCGCGA CCTGATGAGC
GAGGGCGACC TGCGGGCCGA CGAGGTGGGC GGTGACTTCG CCGCCGAGTC CTACCGGCTG
GGCGAGGCGG TCGCCGCGGT GCACGCCGAC CTGGTCCGGG CGTTCGGCAG CACCGTGGCC
ACCGGGGCGG AACTGGGCCA GATCCTGGAC GCGATGCGGG CCGAGGCCGA CGCCGCGGCC
GATCAGGTGC CCGCGCTGGC CGAGCACCGC GAGGCGATCC AGGCCGCGTT CGACCGGGCC
GGCGACCACG CGGCCGGGCT GAACCTGCAG CGCATCCACG GCGACCTGCA CCTGGGTCAG
GTGCTGCGCA CCCTGGACGG CTGGAAGATC ATCGACTTCG AGGGTGAACC GGCCAAGCCG
CTGGCCTTCC GCCGGGGCAT GCACAGCCCG CTCAGGGACG TCGCCGGCAT GTTGCGCTCG
TTCGACTACG CCGCCCGCGG CCAGACGATC AACCCGCACA CCGACGCGCA GCACCGCTAC
CGGGCGGCCG AGTGGGCGAC CCGCAACCGT CGGTCGTTCT GCGACGGCTA TGCCGCCGCC
GCGGGCACCG ACCCCCGACA CGCCGACAGC CTGCTGCGCG TGTTCGAGCT CGACAAGGCG
ATTTACGAAG TTGTCTACGA ACATGGACAC CGGCCGCTGT GGGAAGCCAT TCCGCTGCAG
GCCGTTGTCG CCCTCATCAA CCCCGGAGGA ACCGCGTGA
 
Protein sequence
MTQQPDHVGA PDSDLISLLA YYLPAQRWFA AKGQTLTDEN YRIVRRTVIA RDVSAADVEQ 
VLLEVTTPTR QDLYQLWVAW TDHVPDRVAH AVIGTYDGRT AYDALADVNV TAQMLSAVDE
GRDFGSGVSA RREPDAVIDI GAMGLVIGAE QSNTSIVYGH SAILKVFRRL DPGPNPDAEV
HRALHAVGSS HIAQPFGEIV GRLAEGAGEA ETTLALLTEF FANSADGWAM ATASVRDLMS
EGDLRADEVG GDFAAESYRL GEAVAAVHAD LVRAFGSTVA TGAELGQILD AMRAEADAAA
DQVPALAEHR EAIQAAFDRA GDHAAGLNLQ RIHGDLHLGQ VLRTLDGWKI IDFEGEPAKP
LAFRRGMHSP LRDVAGMLRS FDYAARGQTI NPHTDAQHRY RAAEWATRNR RSFCDGYAAA
AGTDPRHADS LLRVFELDKA IYEVVYEHGH RPLWEAIPLQ AVVALINPGG TA