Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4174 |
Symbol | |
ID | 5736035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5321859 |
End bp | 5322962 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281328 |
Product | hypothetical protein |
Protein accession | YP_001546934 |
Protein GI | 159900687 |
COG category | [S] Function unknown |
COG ID | [COG2718] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02877] sporulation protein YhbH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGAG TTGAACGTGA TCTCAATCGC TTCCGCCAAA TTGTGCGGGG CAAAATTAAG AAAGATCTGC GCAAATATAT TTCCCATGGT GAGATGATTG GCCGCCAAGG CAAACGCTTG GTCTCGATTC CTGTGCCCTC AATCGACTTA CCGCGCTTTC GCTTTGGCAA AAATCAAGGT GGGGTGGGGC AGGGCGAGGG CGATAACGGC GATCCGATCG GGCGTGGCGA TGGCTCGCAG CCAGGCGCGG GCGAGGCAGG CGAGGATTCA GGCCAACATT CGCTTGATGT CGAAATGACC ATCGAAGAAC TGGCGCAGTT GTTGGGCGAA GAACTGCAAT TGCCCAATAT TCAACCCAAA GGTAGCAAGA ATATTATCTC GCAAAAAGAT AAATATTCGG GCATTACCAA GGTTGGCCCA AACTCGCTGC GCCACTTCAA ACGCTCCTTC CGCGAAGCCC TCAAACGCCA AGTCGCCTCT GGTCAGTACG ATCCTGATGA TCCGGTGGTG GTGCCAATTC GCGATGACAT GCGCTTTCGC ACCTGGAAAA CCACCGAAAT GCCCGAAAGC AACGCCGTGA TCATCTACAG CATGGATGTT TCTGGCTCGA TGGGCCAGGA GCAAAAGGAG CTGGTACGGG TAACCGCATT TTGGATCGAA ACGTGGCTCC GTTCCCAATA CAAACATCTC GAAATGCGCT ACATCGTGCA CGACGCAGCG GCCAAAGAAG TGACCCAAGA CGTTTTCTAT CGCATTCGCG AGGGTGGCGG AACCAAAATT TCGTCGGCCT ATCGCTTGGT GTTGAAACTG ATCGAGGAGC ATTATTCGCC CGACGAGTGG AATATTTATC CCTTCCATTT TTCCGATGGC GATAACTGGG GCGGTGGCGA TACCCGTGAG TGTGTTGAGC TACTGCGTAA CCAAATTTTG CCCAAAGTCA ATCTTTTCTG TTACGGCCAA GTGCGTTCAT TGTATGGATC AGGGCGTTTT GCCCATGATC TGAACGAAAA TTTTCGAGCC AATGAAACCA TGGTCGTTTC CGAAATTGCC GATCGCGACG ATATTTACAA TGCAATCAAA GCCTTTTTAG GCAAAGGCAA ATAG
|
Protein sequence | MQRVERDLNR FRQIVRGKIK KDLRKYISHG EMIGRQGKRL VSIPVPSIDL PRFRFGKNQG GVGQGEGDNG DPIGRGDGSQ PGAGEAGEDS GQHSLDVEMT IEELAQLLGE ELQLPNIQPK GSKNIISQKD KYSGITKVGP NSLRHFKRSF REALKRQVAS GQYDPDDPVV VPIRDDMRFR TWKTTEMPES NAVIIYSMDV SGSMGQEQKE LVRVTAFWIE TWLRSQYKHL EMRYIVHDAA AKEVTQDVFY RIREGGGTKI SSAYRLVLKL IEEHYSPDEW NIYPFHFSDG DNWGGGDTRE CVELLRNQIL PKVNLFCYGQ VRSLYGSGRF AHDLNENFRA NETMVVSEIA DRDDIYNAIK AFLGKGK
|
| |