Gene Noca_4678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4678 
Symbol 
ID4598222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4958562 
End bp4959647 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content66% 
IMG OID639779287 
Productmyo-inositol-1-phosphate synthase 
Protein accessionYP_925860 
Protein GI119718895 
COG category[I] Lipid transport and metabolism 
COG ID[COG1260] Myo-inositol-1-phosphate synthase 
TIGRFAM ID[TIGR03450] inositol 1-phosphate synthase, Actinobacterial type 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTCGG TTCGAGTAGC AATCGTGGGA GTCGGCAACT GCGCCTCCTC CCTGGTCCAG 
GGTGTGCACT ACTACCGGGA CGCCGACCCG ACGGGAACGG TGCCCGGGCT GATGCACGTG
ACCTTCGGCG AGTACCACGT CAAGGACGTG GAGTTCGTCG CGGCGTTCGA CGTCGACGAC
AAGAAGGTCG GCAAGGACCT CTCCGAGGCG ATCAACGCCT CCGAGAACAA CACCATCAAG
ATCGCCGAGG TCCCCACGCT CGGCATCGAC GTCCAGCGCG GTCACACCCT CGACGGCCTC
GGCAAGTACT ACCGCCAGAC CATCGAGGAG TCGGCCGCCG AGCCGGTCGA CGTGGTCCGG
GTCCTCAAGG ACACCCAGGC CGACGTGCTC GTCTCCTACC TCCCGGTGGG CTCCGAGGAG
GCCGACAAGT TCTACGCCCA GTGCGCGATC GACGCCGGCG TGGCCTTCGT CAACGCCCTC
CCCGTCTTCA TCGCCTCCGA CCCGGTCTGG GCCAAGAAGT TCGAGGACGC CGGCGTCCCG
ATCGTCGGTG ACGACATCAA GTCGCAGGTG GGCGCCACCA TCACCCACCG CGTGATCGCG
AAGCTCTTCG AGGACCGCGG CGTCGCGCTG GACCGCACCT ACCAGCTCAA CGTCGGCGGC
AACATGGACT TCAAGAACAT GCTCGAGCGC GAGCGCCTGG AGTCCAAGAA GGTCTCCAAG
ACCCAGTCCG TGACGTCCAA CCTCAAGGGC GAGCTGGCCG GCAAGGTCGC CGACCGCAAC
GTGCACATCG GCCCGTCGGA CTACGTCCAG TGGCTCGACG ACCGCAAGTG GGCCTACGTC
CGCCTCGAGG GTCGCGCGTT CGGTGACGTG CCGCTGAACA TGGAGTACAA GCTCGAGGTC
TGGGACTCCC CGAACTCGGC CGGCATCATC ATCGACGCGA TCCGCGCCGC GAAGATCGCC
AAGGACCGTG GCCTCGGCGG CCCGATCATC TCGGCGTCGT CGTACCTGAT GAAGTCCCCG
CCGGTGCAGC TCCCCGACGA CGAGGGTCGC CGCCGCGTCG AGGCCTTCAT CAAGGGCGAA
GAGTGA
 
Protein sequence
MGSVRVAIVG VGNCASSLVQ GVHYYRDADP TGTVPGLMHV TFGEYHVKDV EFVAAFDVDD 
KKVGKDLSEA INASENNTIK IAEVPTLGID VQRGHTLDGL GKYYRQTIEE SAAEPVDVVR
VLKDTQADVL VSYLPVGSEE ADKFYAQCAI DAGVAFVNAL PVFIASDPVW AKKFEDAGVP
IVGDDIKSQV GATITHRVIA KLFEDRGVAL DRTYQLNVGG NMDFKNMLER ERLESKKVSK
TQSVTSNLKG ELAGKVADRN VHIGPSDYVQ WLDDRKWAYV RLEGRAFGDV PLNMEYKLEV
WDSPNSAGII IDAIRAAKIA KDRGLGGPII SASSYLMKSP PVQLPDDEGR RRVEAFIKGE
E