Gene Haur_3367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3367 
Symbol 
ID5736909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4246753 
End bp4247898 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content49% 
IMG OID641280514 
Productmyo-inositol-1-phosphate synthase 
Protein accessionYP_001546131 
Protein GI159899884 
COG category[I] Lipid transport and metabolism 
COG ID[COG1260] Myo-inositol-1-phosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00885843 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTTA ACTGGAGTAC AATTTTTCCT TGGATAGAGC TTTCGTTCAG CCTTTTAAGG 
AGTTGTGTCG TTTTGAGTAA GAAAATTCGC GTAGCCATTA TCGGCGTGGG CAACTGTGCC
TCGTCGCTCG TCCAAGGGAT TGAGTTTTAC AAAAATGCCA ACGAGGACGA TCACGTACCA
GGTTTGATGC ACGTCAACCT TGGCGGTTAT CACGTTCGCG ATATCGAATT TACCGCTGGC
TTCGATATTA ATATTACCAA GGTCGGCAAG GATTTATCTG AGGCGATTTT TGCTGAACCC
AACAACACCT ATAAATTTTC CGAAGTTCCA CATTTGAACG TGCCTGTCTA TCGCGGCATG
ACCCACGATG GTTTGGGCAA ATATCTTTCA CAAGTGATCG AAAAAGCGCC TGGCTCGACC
GCTGATATTG TGAAAATTCT GCGCGACACC AAAACCGATG TTGTGATCAG CTACTTGCCA
GTTGGCTCAG AAATGGCCAC CAAATGGTAT GTTGAGCAAA TTCTCGAAGC TGGTTGCGCC
TTCATCAACT GCGTACCCGT CTTTATTGCC AGCCAAGAAT ACTGGCGCAA ACGCTTCGAA
GAAAAGAACT TGCCCATTAT CGGCGACGAT ATTAAATCGC AAGTTGGCGC AACCATCGTC
CACCGCGTGC TGACCAATTT GTTCGAGCAA CGGGGCGTGC GCTTGGATCG CACCTATCAA
TTGAATTTCG GCGGCAACAC CGATTTCTAC AACATGCTTG AACGCGAACG CTTGGAATCG
AAAAAGATCT CCAAGACCAA CGCTGTTACC AGCCAATTGC CCTATGCCTT GCCCGCCGAT
AGCGTGCACG TTGGCCCAAG CGATTATGTG CCTTGGTTGA CCGACCGCAA GTGGTGTTAC
ATCCGCATGG AAGGCACAAC CTTTGGCAAT GTGCCATTGA ACGCCGAAGT TAAGCTCGAA
GTTTGGGATT CGCCTAACTC GGCTGGCGTG GTGATCGATG CAATTCGCTG TGCGAAGTTG
GCGCTTGATC GTGGGGTTGG TGGCGCATTG TACGCACCTT CATCTTACTT TATGAAAACC
CCACCTGAAC AATACACCGA TGACGAAGCC CATCGCCGCA CCGAAAGCTT TATCGCTGGC
GAATAG
 
Protein sequence
MDFNWSTIFP WIELSFSLLR SCVVLSKKIR VAIIGVGNCA SSLVQGIEFY KNANEDDHVP 
GLMHVNLGGY HVRDIEFTAG FDINITKVGK DLSEAIFAEP NNTYKFSEVP HLNVPVYRGM
THDGLGKYLS QVIEKAPGST ADIVKILRDT KTDVVISYLP VGSEMATKWY VEQILEAGCA
FINCVPVFIA SQEYWRKRFE EKNLPIIGDD IKSQVGATIV HRVLTNLFEQ RGVRLDRTYQ
LNFGGNTDFY NMLERERLES KKISKTNAVT SQLPYALPAD SVHVGPSDYV PWLTDRKWCY
IRMEGTTFGN VPLNAEVKLE VWDSPNSAGV VIDAIRCAKL ALDRGVGGAL YAPSSYFMKT
PPEQYTDDEA HRRTESFIAG E