Gene Cagg_1794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1794 
Symbol 
ID7267706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2200684 
End bp2201769 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content57% 
IMG OID643566634 
ProductMyo-inositol-1-phosphate synthase 
Protein accessionYP_002463129 
Protein GI219848696 
COG category[I] Lipid transport and metabolism 
COG ID[COG1260] Myo-inositol-1-phosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0509898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.352645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTAAGA AAATCCGTGT TGCGATTATT GGCGTTGGTA ATTGCGCCTC TTCATTGGTG 
CAAGGAGTTC ATTACTACCG CAATGCCCGC GATGGCGATG ACATCCCCGG CCTGATGCAC
GTCAATCTTG GCGGCTACCA CATTGGCGAC ATTGAGTTTT CGGCGGCGTT CGACATTGCC
GACACCAAGG TCGATCGCGA TCTAGCCGAG GCGATCTTCG CCGAACCAAA CAACACCTAC
CGCTTCGCCG ACGTGCCGAA GCTGGGCGTC CCCGTCTCGC GCGGGATGAC GCACGACGGC
ATCGGTAAGT ATCTGAGCAC GGTGATCCGT AAGTCGAAGC GCGATACGGA TGATATTGTG
CGCATTCTGC GCGACACCGG CACCGATGTG GTCGTCAATT TTCTCCCTGT CGGCAGTGAG
ATGGCGACGA AATGGTACGT CGAGCAGGTC CTCGATGCCG GTTGTGCCTT TATTAACTGC
ATTCCAGTCT TCATCGCCAG CCAAGAGTAT TGGCGGCGGC GGTTTGAAGA GAAGGGTCTC
CCGATTATCG GCGACGATAT CAAGAGCCAA GTCGGCGCGA CCATTACCCA TCGTGTGCTG
ACCACCCTCT TCAAAGAGCG CGGCGTGCGC CTTGACCGCA CCTACCAACT GAACTTTGGC
GGCAATACCG ATTTTCTCAA TATGCTTGAG CGCGAGCGGC TCGAGAGCAA GAAGATCAGC
AAGACGAATG CGGTCACGTC ACAGTTGGGC TACGAACTGC CGGCAGAGAA TGTTCACGTC
GGGCCAAGCG ACTACGTACC GTGGCTCGGC GACCGCAAGT GGTGCTATAT CCGTATGGAA
GGCACTACCT TCGGCGATGT ACCGCTTAAC CTCGAACTGA AATTGGAGGT GTGGGATTCG
CCGAACTCGG CGGGAGTGGT GATCGACGCC ATTCGCTGCG CTAAGTTAGC CCTCGACCGG
GGGATCGGTG GCGCACTCTA CGGACCGAGT AGTTACTTTA TGAAGACGCC ACCGCGCCAG
TTTACCGATT ACGAGGCGCG GGAACTCACC GAACGCTTTA TTCGCGGTGA GGCGGGAGCA
AAGTAA
 
Protein sequence
MSKKIRVAII GVGNCASSLV QGVHYYRNAR DGDDIPGLMH VNLGGYHIGD IEFSAAFDIA 
DTKVDRDLAE AIFAEPNNTY RFADVPKLGV PVSRGMTHDG IGKYLSTVIR KSKRDTDDIV
RILRDTGTDV VVNFLPVGSE MATKWYVEQV LDAGCAFINC IPVFIASQEY WRRRFEEKGL
PIIGDDIKSQ VGATITHRVL TTLFKERGVR LDRTYQLNFG GNTDFLNMLE RERLESKKIS
KTNAVTSQLG YELPAENVHV GPSDYVPWLG DRKWCYIRME GTTFGDVPLN LELKLEVWDS
PNSAGVVIDA IRCAKLALDR GIGGALYGPS SYFMKTPPRQ FTDYEARELT ERFIRGEAGA
K