Gene Anae109_4291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4291 
Symbol 
ID5376121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp5031071 
End bp5032375 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content71% 
IMG OID640845819 
Productinositol-3-phosphate synthase 
Protein accessionYP_001381453 
Protein GI153007128 
COG category[I] Lipid transport and metabolism 
COG ID[COG1260] Myo-inositol-1-phosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.025414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.498741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCG GTATGGGGGT GAAGCCCGTG CGCAAGGGCG AGAAGCTCGC CGTCCTTCTG 
CCCGGGATGG GTGCCGTCGC GACGACCGCC GTCGCCGGCG CGATCGCGGT GCGGCGGAAG
CTCGCGTCCC CGGTCGGCTC GCTCACGCAG CTCGGCCACC TCATCGACGC CCGGGGCGAG
GCCGGTCCGC GCGTCGCCGA CGTACTCCCG CTCGCGAGCC TCGACGACCT CGTGTTCGGC
GGATGGGACC CGATCCCCGA CGACGCGTAC GCCGCCGCCC TCCGCGCGCG GGTGCTCGGC
CGCGAGCACC TCGACCCGAT CAAGGACGAG CTCGAGGCCA TCAAGCCCAT GCCCGCGGTG
TTCGACACCG AGTGGGTCCG CCGCCTCGAC GGGCCGAACG TGAAGAAGGG CACCCTCCGC
CAGAAGGCCG ACGCCCTCCA GGCGGACATC CGCGGCTTCC TGCAGAAGCA CGGCTGCGCG
CGCGGCGTGA TGGTGTGGAC CGGCTCCACT GAGGTCTACG CGCAGGTCGG GCCGGCGCAC
CAGTCGCTCA AGGCGTTCGA GGCCGCGCTC GACAAGGACG ATGCGTCCAT CGCGCCGTCG
ATGATCTACG CGTACGCGGC GATCAGGTGC GGCGTCCCGT TCCTGAACGG CGCGCCGAAC
CTGTCGCAGG ACACCCCGGC CCTCCAGGAG CTGGCCGAGC GCGAGGGCGT CGTCACCGGC
GGCAAGGACT TCAAGACCGG GCAGACGCTG ATGAAGACGA CGATCGCGCC CATGCTCCGG
GCGCGGCTCC TCGGCCTCGA CGGGTGGTTC TCGACGAACA TCCTCGGCAA CCGCGACGGC
GAGGTGCTGG ACGACGCCGC GAGCTTCAAG ACGAAGGAGG TCTCGAAGCT GGGCGTGCTC
GATCAGATCC TCGATCCGCG GCGCCTCCCC GAGCTCTACG GGGACATGGA TCACAAGGTC
ACGATCCACT ACTACCCGCC GCGCGGCGAC AACAAGGAAG GCTGGGACGC CATCGATCTC
GTCGGGTGGC TCGGCTACCC CATGCAGATC AAGGTGAACT TCCAGTGCCG CGACTCGATC
CTCGCGGCGC CGCTCGTGCT CGATCTCGCG CTCCTCGCCG ACCTCGGCCA GCGCGCGGGT
GAGCGCGGCG CGCAGGAGTG GCTGTCGTTC TTCTTCAAGA GCCCGGTCGT CAACCCCGGC
CACTCGCAGG TGCACGACCT CTTCCAGCAG CAGGCGAACC TCCACGCGCA GCTCCGCCGC
TACGCGGAGG CCGCCGCGGC CGCCACGCAG AGCGCGGTCG GCTAG
 
Protein sequence
MKTGMGVKPV RKGEKLAVLL PGMGAVATTA VAGAIAVRRK LASPVGSLTQ LGHLIDARGE 
AGPRVADVLP LASLDDLVFG GWDPIPDDAY AAALRARVLG REHLDPIKDE LEAIKPMPAV
FDTEWVRRLD GPNVKKGTLR QKADALQADI RGFLQKHGCA RGVMVWTGST EVYAQVGPAH
QSLKAFEAAL DKDDASIAPS MIYAYAAIRC GVPFLNGAPN LSQDTPALQE LAEREGVVTG
GKDFKTGQTL MKTTIAPMLR ARLLGLDGWF STNILGNRDG EVLDDAASFK TKEVSKLGVL
DQILDPRRLP ELYGDMDHKV TIHYYPPRGD NKEGWDAIDL VGWLGYPMQI KVNFQCRDSI
LAAPLVLDLA LLADLGQRAG ERGAQEWLSF FFKSPVVNPG HSQVHDLFQQ QANLHAQLRR
YAEAAAAATQ SAVG