Gene Ndas_4846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4846 
Symbol 
ID9248732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5742810 
End bp5743901 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content67% 
IMG OID 
Productinositol 1-phosphate synthase 
Protein accessionYP_003682735 
Protein GI297563761 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCGG TACGTGTAGC CGTCGTGGGC GTCGGCAACT GTGCGGCGTC GCTCGTCCAG 
GGCGTGCACT ACTACAAGGA CGCCAACCCC GAGTCCCGGG TACCAGGCCT GATGCACGTG
CAGTTCGGCC CGTACCACGT GCGCGACATC GAGTTCGTCG CAGCCTTCGA CGTGGACGCC
AAGAAGGTCG GCCACGACCT CGCCGACGCC ATCACGGCCA GCGAGAACAA CACCGTCAAG
ATCTGCGACG TGCCGCCGAC CGGTGTCACC GTCATGCGCG GACCGACCTA CGACGGGCTC
GGCAAGTACT ACCGCGAGGT CATCCAGGAG TCCCCCGAGG ACGCGGTGGA CGTGGTCGCG
GCGCTCAAGG CCAGCAAGGC CGACGTGCTC GTGTCCTACC TCCCGGTGGG CTCGGAGGAG
GCGGGCAAGT TCTACGCCCA GTGCGCGATC GACGCGGGCG TGGCCTTCGT CAACGCCCTG
CCGGTGTTCA TCGCCTCCGA CCCCGAGTGG GCCGAGAAGT TCACCAGAGC GGGTGTGCCG
ATCATCGGCG ACGACATCAA GTCGCAGATC GGCGCGACCA TCACCCACCG CGTGCTGTCC
AAGCTGTTCG AGGACCGCGG CGTGATCGTG GACCGCACGT ACCAGCTCAA CTTCGGCGGC
AACATGGACT TCAAGAACAT GTTGGAGCGC GACCGCCTGG AGTCCAAGAA GATCTCCAAG
ACCCAGTCCG TCACCTCCCA GATCCCGCAC GAGCTGAAGG CAGGCTCGGT GCACATCGGC
CCGTCGGACC ACGTGCCGTG GCTGGACGAC CGCAAGTGGG CCTACATCCG CCTTGAGGGG
CGCGCGTTCG GCGACGTGCC GCTGAACCTG GAGTACAAGC TGGAGGTCTG GGACTCCCCC
AACTCCGCGG GCATCATCAT CGACGCGGTC CGCGCCGCCA AGATCGCCAA GGACCGCGGC
ATGGGCGGCC CGATCCTGGC CCCGTCCTCC TACTTCATGA AGTCCCCGCC CGAGCAGTAC
AGCGACGCCG AGGCGCACGA GAAGGTCGAG CAGTTCATCG CCTCGGGCCA GCACGACGGC
GCCGACGAGT AG
 
Protein sequence
MGSVRVAVVG VGNCAASLVQ GVHYYKDANP ESRVPGLMHV QFGPYHVRDI EFVAAFDVDA 
KKVGHDLADA ITASENNTVK ICDVPPTGVT VMRGPTYDGL GKYYREVIQE SPEDAVDVVA
ALKASKADVL VSYLPVGSEE AGKFYAQCAI DAGVAFVNAL PVFIASDPEW AEKFTRAGVP
IIGDDIKSQI GATITHRVLS KLFEDRGVIV DRTYQLNFGG NMDFKNMLER DRLESKKISK
TQSVTSQIPH ELKAGSVHIG PSDHVPWLDD RKWAYIRLEG RAFGDVPLNL EYKLEVWDSP
NSAGIIIDAV RAAKIAKDRG MGGPILAPSS YFMKSPPEQY SDAEAHEKVE QFIASGQHDG
ADE