Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4145 |
Symbol | |
ID | 8744773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 413716 |
End bp | 414966 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 646514695 |
Product | Glycine hydroxymethyltransferase |
Protein accession | YP_003405642 |
Protein GI | 284167364 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0112] Glycine/serine hydroxymethyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.021739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGACA GCAGTCAGCT CGCAGACACC GACGAGCGAC TTTACGAGGC GATTAGCGCC GAGGAACGGC GTCAGGAAGA CAACCTGGAG ATGATCGCAT CGGAGAATCA CGTCTCGAAG GCCGTCCTCG AGGCCCAGGG GAGCGTCCTC ACTAACAAGT ACGCGGAGGG ATATCCGGGT GCACGCTACT ACGGCGGGTG CGAACACGTC GACGAGGCCG AGAATCTCGC GATCGAACGG GCGAAGGAAC TGTTCGGCGG CGACCACGTG AACGTCCAGC CCCACAGCGG GACGCAGGCG AACATGGGCG TCTACTTCGC CATGCTCGAT CCCGGCGACA AGATCCTCTC GTTGAACCTC AACCACGGGG GGCACCTTTC GCACGGCCAT CACGTGAACT TCTCCGGCCA GCTCTACGAG GTCGAGCAGT ACGGCGTCGA CCCCGAGACG GGATACGTCG ATTACGACGA ACTGGAGCAG AAGGCGCTGG ACTTCGAACC GGACGTGATC GTCAGCGGGT CCTCGGCCTA CCCGCGTGAG TTCGAGTACG AACGGATCAG TTCCATCGCG GCGGACGTGG ACGCCTATCA CCTCGCGGAC ATCGCGCACG TCACGGGGCT CATCGCGGCC GACGTTCACG CGAACCCGGT CGGCGTCGCG GACTTCGTTA CGGGCAGTAC GCACAAGACG ATCCGGGCGG GTCGCGGGGG GATGATTATC ACGGGAGAAG AGTACGCCGA TGACATCGAC AGCGCCATCT TCCCCGGCAG CCAGGGCGGC CCGTTGATGC ACAACATCGC CGGCAAAGCG GCAGGGTTCG GAGAAGCCCT CCAACCGGAG TTCAGAGAGT ACGCCGAGCA AATAGCCGCG AACGCCAAAA CGCTCGCCGA CGCGTTCAGC GAGCGCGGGC TCTCGCTGGT CAGCGGCGGA ACCGACAAAC ACCTCGTCCT CATCGACCTC CGCGATTCAC ATCCGGACCT GACCGGCGAG GAGGCCGAAA ACGCGCTCGA AGCGGTCGGG ATCACCGTCA ACAAGAACAC GGTCCCCGGC GAGAGTCGCT CCCCGTTCGT GACCAGCGGG ATCCGCGTCG GGACGCCGGC CCTGACCACG CGCGGCTTCA CCGAGTCGGC GATGGAAGAG GTGGCGAACC TCATCGTCGA CGTCCTCGAC GAACCGGACG ACGGTGACGT CGCCCAGCGG GTCGAAGCCA GAGTCGACGA ACTGACCGAC GAGTTCCCCA TCTACGACTA A
|
Protein sequence | MSDSSQLADT DERLYEAISA EERRQEDNLE MIASENHVSK AVLEAQGSVL TNKYAEGYPG ARYYGGCEHV DEAENLAIER AKELFGGDHV NVQPHSGTQA NMGVYFAMLD PGDKILSLNL NHGGHLSHGH HVNFSGQLYE VEQYGVDPET GYVDYDELEQ KALDFEPDVI VSGSSAYPRE FEYERISSIA ADVDAYHLAD IAHVTGLIAA DVHANPVGVA DFVTGSTHKT IRAGRGGMII TGEEYADDID SAIFPGSQGG PLMHNIAGKA AGFGEALQPE FREYAEQIAA NAKTLADAFS ERGLSLVSGG TDKHLVLIDL RDSHPDLTGE EAENALEAVG ITVNKNTVPG ESRSPFVTSG IRVGTPALTT RGFTESAMEE VANLIVDVLD EPDDGDVAQR VEARVDELTD EFPIYD
|
| |