Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4831 |
Symbol | |
ID | 5736676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6159692 |
End bp | 6160849 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641281996 |
Product | carboxylate-amine ligase |
Protein accession | YP_001547589 |
Protein GI | 159901342 |
COG category | [S] Function unknown |
COG ID | [COG2170] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02048] glutamate--cysteine ligase, cyanobacterial, putative [TIGR02050] uncharacterized enzyme |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.476623 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATATC GTGATCCGGG GCATCCGCAT TTTCCTTTTA CGATTGGGAT CGAAGAAGAA TATCAGATTA TCGATCCTGA AACCCGCGAA TTAAAATCGT ACATCACCCA AATTCTCGAC GAGGGTCAGT TGATTTTACG TGAACAGATG AAGCCCGAAA TGCATCAATC GATTGTCGAA GTGGGCACGC ATGTCTGTCG CACGGTCGAG GAAGCTCGCG CCGAAATTAT TCGTTTGCGT GGCGCAATTG GCAGTTTGGC CGCGAGCAAA GGCTTGCGAA TTGCCGCCGC TGGCACACAC CCATTTTCAT CGTGGCAAAA GCAGGATATT TATCCCCATG AGCGCTATTA TGGCGTGATT GAGGAGATGC AAGAGGCAGC GCGGCGCTTG TTGATTTTTG GCATGCATGT GCACATCGGC ATGCCCGATA ACGAAACCTG CATCGAAATT ATGAATGTGG CGCGGTACTT TCTGCCGCAT TTGTTGGCGC TTTCCACTTC ATCGCCCTTT TGGATGGGGC GCAAAACTGG CTTTCAATCG TATCGCTCGA TCATCTTTAC CAACTTCCCA CGTACTGGCA TTCCCGACAC CTTCCAATCG TATGCGGAAT TTGAGCAATA TATCAACATT TTGCTCAAAA CTCATAGCAT CGACAATGGC AAAAAAGTCT GGTGGGATGC GCGGCCACAC CCGATGTTTG GCACGCTAGA AGTGCGAATT TGCGATATTG CCACCAAAGT TGACGAAGCA ATTATGATTG CTGGCTTGGT TCAAGCGATT TTTGTCAAAA TTTACAGCTT GTTTCGCCAA AACCAAACTT TCCGCGTCTA TAGTCGTGCC TTGATCAACG AAAACAAATG GCGGGCGGCA CGTTATGGCA TGGGTGGCAA ACTGATTGAT TTTGGCCGCC GCGAAGAATT ATCTGCCCAT GATCTGATGG CCGAATTACG CGAATTTGTT GATGATGTGG TTGATGACCT TGGCTCACGG GCGGCGGTCG ATTATATTGA CCAAGTGCTC AAGCATGGCA CCAGCGCCGA ACGCCAACTG CGTACCTACG AAGAAACTGG CGATATCAAA GCAGTTGTCG ATCAGCTGAT TCGTGAAACC ATGGAAGGTG TGCCGCTCGA TCAGGCAACT CAAGTAGTCA GCGGATAA
|
Protein sequence | MEYRDPGHPH FPFTIGIEEE YQIIDPETRE LKSYITQILD EGQLILREQM KPEMHQSIVE VGTHVCRTVE EARAEIIRLR GAIGSLAASK GLRIAAAGTH PFSSWQKQDI YPHERYYGVI EEMQEAARRL LIFGMHVHIG MPDNETCIEI MNVARYFLPH LLALSTSSPF WMGRKTGFQS YRSIIFTNFP RTGIPDTFQS YAEFEQYINI LLKTHSIDNG KKVWWDARPH PMFGTLEVRI CDIATKVDEA IMIAGLVQAI FVKIYSLFRQ NQTFRVYSRA LINENKWRAA RYGMGGKLID FGRREELSAH DLMAELREFV DDVVDDLGSR AAVDYIDQVL KHGTSAERQL RTYEETGDIK AVVDQLIRET MEGVPLDQAT QVVSG
|
| |