Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0851 |
Symbol | |
ID | 5732752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 960823 |
End bp | 962082 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277983 |
Product | glycine hydroxymethyltransferase |
Protein accession | YP_001543627 |
Protein GI | 159897380 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0112] Glycine/serine hydroxymethyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0407045 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTAA TGGATGTATT GCGCCAACAA GATCCCGATT TGGCGCAAGC TATCGACTCT GAAGCTGAAC GCCAACGCCA TGGGATTGAG TTGATCGCCT CTGAAAACTA TGTCAGCAGT GCTGTATTAG CTGCCCAAGG TTCGGTGTTG ACCAATAAAT ATGCTGAGGG TTATCCGCGC AAACGCTACT ATGGCGGCTG CGAGTTTGTT GATGTCGCCG AAGATTTGGC GATCAAACGC GCCAAGCAAC TCTTTGGTGC TGAACATGTC AACGTGCAAC CCCACTCGGG AGCGCAAGCC AATATGGCGG TGCAACTGGC CACGCTTGAG CATGGCGATC GCGTCTTGGG CATGAGTTTG GCGCATGGCG GCCACTTAAC CCATGGCCAT CCACTCAACT TCTCGGGCAA ATCGTATGAG ATCCACGGCT ATGGCGTTGA TCGCGAAACC GAACAGATCG ATTATGAAGA AGTTGCCGAG ATTGCCCACA AAACCCAGCC TAAGATGATT ATCTGTGGAG CCAGTGCCTA TCCTCGCAAC ATCAATTTCG ATTTGCTGCG CACGATCGCC GATAATGTTG GGGCGATTTT GATGGCCGAT ATTGCCCACA TTGCAGGCCT TGTCGCCGCA GGTTTACACC CATCGCCGAT CGGCGTAGCT CAATATGTCA CCACCACCAC CCACAAAACC CTGCGCGGCC CGCGTGGCGG CATGATTATG TGCAGTGCTG AGCATGGCAA AAATATCGAT AAAACCGTAT TTCCAGGCGT GCAAGGTGGG CCATTGATGC ATGTAATCGC GGCGAAGGCC GTGGCATTTG GCGAAGCCTT GCAACCCGAA TATCGCGACT ATATGCGACG GGTCGTCGAA AATGCCAAGG TTTTGGCCGA AGCCTTGACC AACGAAGGCT TGCGAATCGT CAGCGGCGGC ACCGATAATC ACCTATTGCT GGTCGATTTG ACTCCAGTGA ATGCCACAGG CAAAGACGCA GAAAAAGCCC TTGACCACGC TGGTATCACC GTCAACAAAA ACGCCATTCC CTTCGATCCC AAGCCACCAA TGACAGCCAG CGGCTTGCGC TTTGGTACGC CTGCTGCGAC GACTCGCGGC TTTGGGCCAA ACGAAATGCG CCAAATTGCG GTTTGGGTCG GCCAAATCGT GCGTGAATTG GGCAACAAGA GCCTGCAAGC CAAAATTGCT GGCGAAGTTC GTGAATTGTG CGCGGCCTTC CCAGTACCAG GTCAACCAGA ATACGTCTAA
|
Protein sequence | MSLMDVLRQQ DPDLAQAIDS EAERQRHGIE LIASENYVSS AVLAAQGSVL TNKYAEGYPR KRYYGGCEFV DVAEDLAIKR AKQLFGAEHV NVQPHSGAQA NMAVQLATLE HGDRVLGMSL AHGGHLTHGH PLNFSGKSYE IHGYGVDRET EQIDYEEVAE IAHKTQPKMI ICGASAYPRN INFDLLRTIA DNVGAILMAD IAHIAGLVAA GLHPSPIGVA QYVTTTTHKT LRGPRGGMIM CSAEHGKNID KTVFPGVQGG PLMHVIAAKA VAFGEALQPE YRDYMRRVVE NAKVLAEALT NEGLRIVSGG TDNHLLLVDL TPVNATGKDA EKALDHAGIT VNKNAIPFDP KPPMTASGLR FGTPAATTRG FGPNEMRQIA VWVGQIVREL GNKSLQAKIA GEVRELCAAF PVPGQPEYV
|
| |