Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1036 |
Symbol | |
ID | 5732940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1182413 |
End bp | 1183447 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278171 |
Product | threonine aldolase |
Protein accession | YP_001543812 |
Protein GI | 159897565 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2008] Threonine aldolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGATT TGCGCAGCGA TACCGTTACG AAGCCAAGTT TGGCCATGCG TGAGGCCATG CATCATGCCG AGGTTGGTGA CGATGTGTTT GGTGATGATC CAACGGTCAA TCAATTACAG CGCTATGCCG CCGATCTGGT TGGCAAAGAG GCGGCGATTT TTGTGCCAAG CGGCACGATG GGCAATTTGG CGGCAATTTT GGCCCATGCT GGGCGTGGTC AAGAACTGTT GCTCGGTGAT GAATCGCATA TTTATCATTA TGAGGCGGGT GGCGCTTCAG CCTTGGGTGG CTTGGTGTTT CATCCCATCC CAACCAATGC CCAAGGCGAG CTGGATTTAG CGGCTTTGAA TGCGGCAGTA CGCCCAGCCT ACGATGCCCA TGCGGCTCAG GCAGGCCTGG TGTGTCTAGA AAATAGTCAC AATCGCTGTG GTGGCACAGT GCTTTCGTTA GAATATTTGG CGCAAGTACA GCAATGGGCC AGCAGCCAAA ACTTGCCAGT GCATATGGAT GGCGCTCGGG TGTTCAATGC AGCGGTGGCG CTCGGCGTGC CAGCTAGCAC CATCACCAAG CATGTCGATA GCGTGCAATT TTGCTTATCC AAAGGCTTGG GTGCACCAAT TGGCTCGATT GTGGCTGGCT CAGGCGAGTT TATCAAAAAG GTGCATCGCT GGCGCAAAAT GCTTGGCGGC GGCATGCGCC AAGTTGGCGT AGTCGCAGCG GCGGGCATGA TTGCGCTCAA CGAAGGCCGC GAACGCTTGA TCGATGACCA TGTAAATGCC AAAATGCTGG CCGAAGCACT CAGTCAATTG CCTCAAATTG AGCTTGATTT GGCTTCGGTG CAAACCAATA TTATCGTTTT TGGCTTGCGT GATAGCACGT TTAGCCCTGA ACAATTGGTT GAACGTTTAC GCCAAGCAGG GGTTTTGATC GTGCCATTCA AAGGCCGTTT ACGGGCTGTA ACCCATGTTG ATGTTAATCG TGAGCAATGC ACCGAGGCCT CCAGCATCAT CGCCGAGGTG CTGCAAAGCG CTTAA
|
Protein sequence | MIDLRSDTVT KPSLAMREAM HHAEVGDDVF GDDPTVNQLQ RYAADLVGKE AAIFVPSGTM GNLAAILAHA GRGQELLLGD ESHIYHYEAG GASALGGLVF HPIPTNAQGE LDLAALNAAV RPAYDAHAAQ AGLVCLENSH NRCGGTVLSL EYLAQVQQWA SSQNLPVHMD GARVFNAAVA LGVPASTITK HVDSVQFCLS KGLGAPIGSI VAGSGEFIKK VHRWRKMLGG GMRQVGVVAA AGMIALNEGR ERLIDDHVNA KMLAEALSQL PQIELDLASV QTNIIVFGLR DSTFSPEQLV ERLRQAGVLI VPFKGRLRAV THVDVNREQC TEASSIIAEV LQSA
|
| |