Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0952 |
Symbol | |
ID | 5732838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1092070 |
End bp | 1093317 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278084 |
Product | threonine dehydratase |
Protein accession | YP_001543728 |
Protein GI | 159897481 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR01127] threonine dehydratase, medium form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000323756 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGAAA AGTCTATGCC AACGATTGAT GACATTTATG CCGCAGCCCA TGTTTTAGGC TCGATTATCA CCCAAACGCC GCTCTTGCCA GCTGAACAAC TGAGCCAAGA GCTTGGTGGC CAAATTATTT ACAAAGCCGA AAATACTCAA CGTGCTGGTT CGTTCAAAGT GCGTGGAGCG TATACCAAAA TCAATTCGCT CTCCGATGAA GAAAAAGCCC GTGGCGTAAT TACCCATTCG GCAGGCAACC ATGCCCAAGG CGTGGCCCTC GCTGCTCAAT TGAATGGCAT TAAAGCTACC GTCGTGATGC CGGAATTTGC CCCATTGGCC AAAATTACGT CGGCCCAACG TATGGGTGCA GAGGTTATTT TGCATGGAGC TTCGTTTGAT GATGCTGGGT CGTATGCCCG CGAACTGCAA GCCCAAACTG GCGCAACCTA TGTCCATGCC TTCGACGATC CCTTTACAAT TGCTGGCCAA GGCACGCTCG GCTTAGAAAT TGCCGACCAA CTGCCCGACC AAGGCGGCAC GGTCGTCGTA CCAATCGGCG GTGGCGGGAT GATGGCAGGG ATTGCCCTGG CCTTGCGTTC GCTGCGCCCC AATGTGCGAT TGATTGGGGT GCAGGCAGCA GGCTGCCCAT CGATGATCGC CTCGCAGCAA GCAGGCAAGC CAATTGCTGT GCCCCATGCC GCGACCATCT GTGATGGAAT TGCGGTCAAA CGCCCAGGCG AATTGACCTT GCCGATTATC AATCAATTAG TTGATGATAT TGTAACGGTT GATGACGATG CAGCAGCGCG GGGCTTAGTG CATATTTTGC AATATAGCCG CATGGTGGTC GAGGGAGCAG GAGCAGTTGG CGTGGCCGCC TTGCTTGAAG GCGCAATTCG CTTGCGACCA AATGAGCCAA CGTTGGTAGT GCTCAGCGGT GGCAATATCG ATGGCAACTT CCTTGCTCGA ATTATTGAGC AAGTTTTGGT CAAACAAGGT CGCTATTTAC GCGTTCGGAC TAGTGTTCCT GATCGTCCGG GAAATCTCGC TCCCTTAGTT AATGCGATTG CCCAGGCTGG GGCGAATGTG ATCGATATTA GCCATCGGCG GGCAGTGTGG CAACTCCCGC TTGATCGGGT GGGAATAGAG ATGATTCTCG AAGTGCGCGA TGAAGCGCAT GGCCAATCTA TCATTGACAT GTTGGAAACA CACGGCTATC ACATCGAGCG TTTTGGCCAG CGTGTGTGGC CGGTGTAA
|
Protein sequence | MVEKSMPTID DIYAAAHVLG SIITQTPLLP AEQLSQELGG QIIYKAENTQ RAGSFKVRGA YTKINSLSDE EKARGVITHS AGNHAQGVAL AAQLNGIKAT VVMPEFAPLA KITSAQRMGA EVILHGASFD DAGSYARELQ AQTGATYVHA FDDPFTIAGQ GTLGLEIADQ LPDQGGTVVV PIGGGGMMAG IALALRSLRP NVRLIGVQAA GCPSMIASQQ AGKPIAVPHA ATICDGIAVK RPGELTLPII NQLVDDIVTV DDDAAARGLV HILQYSRMVV EGAGAVGVAA LLEGAIRLRP NEPTLVVLSG GNIDGNFLAR IIEQVLVKQG RYLRVRTSVP DRPGNLAPLV NAIAQAGANV IDISHRRAVW QLPLDRVGIE MILEVRDEAH GQSIIDMLET HGYHIERFGQ RVWPV
|
| |