Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1044 |
Symbol | |
ID | 5732948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1191596 |
End bp | 1192603 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278179 |
Product | alpha/beta hydrolase fold |
Protein accession | YP_001543820 |
Protein GI | 159897573 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATAA AACCACAAAA ACGCTCGCTG TTACGACGGA TTGGGCGCTG GCTCGCTTGG CTTGGCCTGT TGATTGTTGG TTTGCTGATC GGCGGTTGGG CGTTTCAGCG CTGGGCCAGC CAGCGTGATC GCCAACAATT TTTGCCAGCC GAGCAGCAAA TTATGCTCAA TGGCCATGCG ATGCGACTGA TTTGTATGGG CAGCGGCAGC CCAACGATCG TGCTCGAATC TGGCTTAGGC GATGGTGCTG ATGTTTGGGG CTTAGTCCAA CCTGCCTTAG CCGAGCAATA TCGGGTTTGT GCCTATGATC GAGTTGGCAT GGGCTGGAGT GCAGCGGTAG CCAACAAGGC TGATCGGGCT TCGATTGCCC AAACCTTGCA TGAACTGCTG AGCCAAGCCA ACGTATCAGC GCCATATGTA TTGGTTGGCC ATTCGGCTGG TGGTTTGTAT GTGCGCGAAT ATGCCCAGCG CTACCCTGAG CAAGTTATTG GTTTGGTGCT GGTCGATTCA TCGCACGAAC AACAACGCCA ACGTCAACCA CAGCTTGCTG AAGATCCATT TGCAATCATG CGTCAGTCGA TGCAAGCCTG TGATGCCTTA GCGCCATTCG GAATTATTCG GCTGACAAAG CTGTTTGAGC AATCGCAATC GACCTATGCC AAACTTCCAC AACCAGCTCA AGCCTCGATT GCAGCTAGCC AATACCAAAC GAGCACCTGT AGCGCGATGG ATGCGGCCTT GGCAGCAATC ACCCAAGATC TGAATCAAGC CCAAGCTCCG CAATCGCTAA AGGATCTCCC GTTGGTGGTA TTAACCCGTG GGATTGCTGA TAGCACCATG CCAGCGGAAT TTGAACAGAC GTGGGATAGC TTGCAACAAG AATTAGCTCA GCTTTCGAGC AACAGCCAAC ATCATATAGC TGAAACCAGT GGTCATTACA TTCATCTTGA TCAACCAGCG TTGGTGATCG AGGCAGTTGA ATGGGTAATC AGCCAACAAG CTAAATAG
|
Protein sequence | MNIKPQKRSL LRRIGRWLAW LGLLIVGLLI GGWAFQRWAS QRDRQQFLPA EQQIMLNGHA MRLICMGSGS PTIVLESGLG DGADVWGLVQ PALAEQYRVC AYDRVGMGWS AAVANKADRA SIAQTLHELL SQANVSAPYV LVGHSAGGLY VREYAQRYPE QVIGLVLVDS SHEQQRQRQP QLAEDPFAIM RQSMQACDAL APFGIIRLTK LFEQSQSTYA KLPQPAQASI AASQYQTSTC SAMDAALAAI TQDLNQAQAP QSLKDLPLVV LTRGIADSTM PAEFEQTWDS LQQELAQLSS NSQHHIAETS GHYIHLDQPA LVIEAVEWVI SQQAK
|
| |