Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3367 |
Symbol | |
ID | 5736909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4246753 |
End bp | 4247898 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641280514 |
Product | myo-inositol-1-phosphate synthase |
Protein accession | YP_001546131 |
Protein GI | 159899884 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1260] Myo-inositol-1-phosphate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00885843 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTTA ACTGGAGTAC AATTTTTCCT TGGATAGAGC TTTCGTTCAG CCTTTTAAGG AGTTGTGTCG TTTTGAGTAA GAAAATTCGC GTAGCCATTA TCGGCGTGGG CAACTGTGCC TCGTCGCTCG TCCAAGGGAT TGAGTTTTAC AAAAATGCCA ACGAGGACGA TCACGTACCA GGTTTGATGC ACGTCAACCT TGGCGGTTAT CACGTTCGCG ATATCGAATT TACCGCTGGC TTCGATATTA ATATTACCAA GGTCGGCAAG GATTTATCTG AGGCGATTTT TGCTGAACCC AACAACACCT ATAAATTTTC CGAAGTTCCA CATTTGAACG TGCCTGTCTA TCGCGGCATG ACCCACGATG GTTTGGGCAA ATATCTTTCA CAAGTGATCG AAAAAGCGCC TGGCTCGACC GCTGATATTG TGAAAATTCT GCGCGACACC AAAACCGATG TTGTGATCAG CTACTTGCCA GTTGGCTCAG AAATGGCCAC CAAATGGTAT GTTGAGCAAA TTCTCGAAGC TGGTTGCGCC TTCATCAACT GCGTACCCGT CTTTATTGCC AGCCAAGAAT ACTGGCGCAA ACGCTTCGAA GAAAAGAACT TGCCCATTAT CGGCGACGAT ATTAAATCGC AAGTTGGCGC AACCATCGTC CACCGCGTGC TGACCAATTT GTTCGAGCAA CGGGGCGTGC GCTTGGATCG CACCTATCAA TTGAATTTCG GCGGCAACAC CGATTTCTAC AACATGCTTG AACGCGAACG CTTGGAATCG AAAAAGATCT CCAAGACCAA CGCTGTTACC AGCCAATTGC CCTATGCCTT GCCCGCCGAT AGCGTGCACG TTGGCCCAAG CGATTATGTG CCTTGGTTGA CCGACCGCAA GTGGTGTTAC ATCCGCATGG AAGGCACAAC CTTTGGCAAT GTGCCATTGA ACGCCGAAGT TAAGCTCGAA GTTTGGGATT CGCCTAACTC GGCTGGCGTG GTGATCGATG CAATTCGCTG TGCGAAGTTG GCGCTTGATC GTGGGGTTGG TGGCGCATTG TACGCACCTT CATCTTACTT TATGAAAACC CCACCTGAAC AATACACCGA TGACGAAGCC CATCGCCGCA CCGAAAGCTT TATCGCTGGC GAATAG
|
Protein sequence | MDFNWSTIFP WIELSFSLLR SCVVLSKKIR VAIIGVGNCA SSLVQGIEFY KNANEDDHVP GLMHVNLGGY HVRDIEFTAG FDINITKVGK DLSEAIFAEP NNTYKFSEVP HLNVPVYRGM THDGLGKYLS QVIEKAPGST ADIVKILRDT KTDVVISYLP VGSEMATKWY VEQILEAGCA FINCVPVFIA SQEYWRKRFE EKNLPIIGDD IKSQVGATIV HRVLTNLFEQ RGVRLDRTYQ LNFGGNTDFY NMLERERLES KKISKTNAVT SQLPYALPAD SVHVGPSDYV PWLTDRKWCY IRMEGTTFGN VPLNAEVKLE VWDSPNSAGV VIDAIRCAKL ALDRGVGGAL YAPSSYFMKT PPEQYTDDEA HRRTESFIAG E
|
| |