Gene Namu_5384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5384 
Symbol 
ID8451017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp6022527 
End bp6023606 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content67% 
IMG OID645044414 
Productinositol 1-phosphate synthase 
Protein accessionYP_003204636 
Protein GI258655480 
COG category[I] Lipid transport and metabolism 
COG ID[COG1260] Myo-inositol-1-phosphate synthase 
TIGRFAM ID[TIGR03450] inositol 1-phosphate synthase, Actinobacterial type 


Plasmid Coverage information

Num covering plasmid clones90 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCA TCAGAGTGGC CATCGTCGGC GTCGGGAACT GCGCGTCGTC GCTGGTCCAG 
GGCGTCGAGT ACTACAAGGA CGCGGACCCG AAGTCGACCG TGCCCGGCCT CATGCACGTC
CAGTTCGGCG ATTACCACGT GCGCGACATC GAGTTCGTGG CCGCGTTCGA CGTGGACGCC
AAGAAGGTCG GCTTCGACCT GTCCGAGGCG ATCAACGCCA GCGAGAACAA CACCATCAAG
ATCGCCGACG TGCCGCCGAC CGGTGTCGCC GTCCAGCGCG GCGTCACCCA CGACGGTCTG
GGTCGCTACT ACCTGGAGAC CATCACCGAG TCCCCGGCCG AGCCGGTCGA CGTGGTGGCC
GCGCTGCGCG AGGCCCGGGT CGACGTCGTC GTCTGCTACC TGCCGGTCGG CTCCGAGGAC
GCCGTCCGCT TCTACGCCCA GTGCGCCATC GACGCCGGCT GCGCCTTCGT CAACGCGCTG
CCGGTGTTCA TCGCCAGCAC CCCCGAGTGG GCCGAGAAGT TCCGGGCGGC CGGGCTGCCC
ATCGTCGGTG ACGACATCAA GTCGCAGATC GGCGCGACCA TCACCCACCG GGTGCTGGCC
AAGCTGTTCG AGGACCGCGG CGTCATCCTG GACCGCACGA TGCAGCTCAA CGTCGGCGGC
AACATGGACT TCAAGAACAT GCTCGAGCGC GACCGGCTGG AGTCCAAGAA GATCTCCAAG
ACCCAGGCCG TCACCTCCCA GGTCGACCGG GACATGGGTG CCCGCAACGT GCACATCGGG
CCGAGCGACT ACGTGCCGTG GCTGGACGAC CGCAAGTGGG CCTACGTGCG TCTCGAGGGC
CGCGCGTTCG GTGACGCCCC GCTGAACATG GAGTACAAGC TCGAGGTCTG GGACTCGCCG
AACTCGGCCG GCGTCATCAT CGACGCGGTC CGCGCGGCCA AGATCGCCAA GGACCGCGGC
GTGGGCGGCC CGATCCTGTC CGCCTCGTCC TACTTCATGA AGTCCCCGCC GGTGCAGTAC
CCGGACGACC AGGCCCGGGA CAACGTGGAG AAGTTCATCG CCGGCGAGAT CGACTTCTGA
 
Protein sequence
MSSIRVAIVG VGNCASSLVQ GVEYYKDADP KSTVPGLMHV QFGDYHVRDI EFVAAFDVDA 
KKVGFDLSEA INASENNTIK IADVPPTGVA VQRGVTHDGL GRYYLETITE SPAEPVDVVA
ALREARVDVV VCYLPVGSED AVRFYAQCAI DAGCAFVNAL PVFIASTPEW AEKFRAAGLP
IVGDDIKSQI GATITHRVLA KLFEDRGVIL DRTMQLNVGG NMDFKNMLER DRLESKKISK
TQAVTSQVDR DMGARNVHIG PSDYVPWLDD RKWAYVRLEG RAFGDAPLNM EYKLEVWDSP
NSAGVIIDAV RAAKIAKDRG VGGPILSASS YFMKSPPVQY PDDQARDNVE KFIAGEIDF