Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1643 |
Symbol | |
ID | 5733527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1900091 |
End bp | 1901485 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278782 |
Product | beta-lactamase domain-containing protein |
Protein accession | YP_001544414 |
Protein GI | 159898167 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG0491] Zn-dependent hydrolases, including glyoxylases [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTTCC AACGGTTGTA TCACGATGGT TTAGCCCAAG CAAGTTATTT AGTCGGTTGC CAAAAAACTG GCGAAGCTTT GGTCATTGAT CCCCATCGTG ATAGTGCAGT GTATTTGAAT GCAGCGCAAG CGGCTGGTTT GCGGATTAGC GCAATCACCG AAACCCATAT TCACGCCGAT TTTGTCTCGG GCAGCCGCGA GCTGGCGCAA ATTACGGGCG CAAAACTGTA TCTTTCGGCA GCAGGCACAG CAGCTTGGCA ATATGCTTTT GCCAGCAGCA ACCCCAATGT GCAGTTGATC AACGATGGCG ATAGCTGGAT GGTTGGCAAT TTGAAGATCG AGGTCTTGCA TACCCCAGGC CATACGCCAG AACATGTGAT TTTTATGCTC ACCGACACGC CTGCTGGCGA TGAACCAATG GGTATTTTCA CAGGCGATTT GTTGTTTGCT GGCGATGTTG GCCGCCCTGA CTTGCTCGAA AAAGCCGCTG GGATTCAAGG CACCAGCGAG GCCGGAGCCA AAGATTTGTT TAAATCGTTG CAGCGCATCG CTTCCTTGCC CGATTTCTTG CAAGTTTGGC CAGGCCATGG TTCGGGCAGC GCTTGTGGCA AAGCACTGGG AGCTGTGCCG CAAAGTACCT TGGGCTACGA AAAGAAAGTT AATTGGGCCT TCAAACAGCA GAACGAGGCT GATTTTGTGG CCGCAGTGCT CGATGGTCAG CCAACGCCAC CGCCCTATTT CGCCGCGATG AAGCGCATCA ACCGCGATGG TCCTGAGTTG CGCCCGCAAC GTTTGCCAGC CCTCAGCCTT GCCGAAATTA AGCTTGATAA TCGGTTGGTT GTTGATTTAC GTTCGGCCAA TGATTTTGCT CAAGCCAGCA TTCCAGGCGC ACTCAGCCTG ACCAGCGGCG CGATGCTCAA TCGCTGGGCC GGTTGGTTTG TGCCGGTCGC AAGCAAGGTT GTGTTAATTG GCACTGCCGA AGCGGCCAAA ACTGCCCAAA CCGAGCTAAG CATGATCGGC ATCGACCAAA TCGAAGGCTA TATCACGCCC GATATGCTAA CCGAATGGCT CAAAACCAAC CCAAGCCAAA GCTATCAACG GGTGCCAGCC GCCAATTTGA GCGAACAACT TGACCAGGCG TTTGTGCTCG ATGTGCGCAC GCCCGAAGAA TATGCCAACG GTCACGGAGC TAAGGCGGTC AATATTCCAT TGAATGAATT ACCCAAGCGC CTGGCCGAAA TTCCCAATGA TCAACCGTTG ATCGTGCATT GTCAGGCGGG TGGGCGTTCG CCAATTGCCA TGAGTTTGCT CAAACCACAT TTCAGCCAAG CCATGGTTGA GATGAGCGAC GGCTGGAATG GTTGGTATCA ATTGAAAAAG GAGCGTCACG TATGA
|
Protein sequence | MFFQRLYHDG LAQASYLVGC QKTGEALVID PHRDSAVYLN AAQAAGLRIS AITETHIHAD FVSGSRELAQ ITGAKLYLSA AGTAAWQYAF ASSNPNVQLI NDGDSWMVGN LKIEVLHTPG HTPEHVIFML TDTPAGDEPM GIFTGDLLFA GDVGRPDLLE KAAGIQGTSE AGAKDLFKSL QRIASLPDFL QVWPGHGSGS ACGKALGAVP QSTLGYEKKV NWAFKQQNEA DFVAAVLDGQ PTPPPYFAAM KRINRDGPEL RPQRLPALSL AEIKLDNRLV VDLRSANDFA QASIPGALSL TSGAMLNRWA GWFVPVASKV VLIGTAEAAK TAQTELSMIG IDQIEGYITP DMLTEWLKTN PSQSYQRVPA ANLSEQLDQA FVLDVRTPEE YANGHGAKAV NIPLNELPKR LAEIPNDQPL IVHCQAGGRS PIAMSLLKPH FSQAMVEMSD GWNGWYQLKK ERHV
|
| |