Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0847 |
Symbol | |
ID | 5732748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 956074 |
End bp | 957081 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641277979 |
Product | hypothetical protein |
Protein accession | YP_001543623 |
Protein GI | 159897376 |
COG category | [S] Function unknown |
COG ID | [COG4034] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.724519 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCTTA ATTTGCCAAT TCTCGACCAA TTGCAAGGCC GTCGTTCCAT CTTGATTGCA GGCATGGGCG GCGGTTTTGA TATTTTCTGT GGTTTGCCGA TTTTTCTGCA ATTACGCCAA CTAGGCTTTA ATGTGCATTT GGCCAATTTT AGTTTTAGCG ATCTTGAAGC CGCTAGCAAT CTACAGTATC TTGGCGATAC GGTCTATGGT GTGCCGAGCC TGCCGCGTGA CCCTGAGTTC GTGCCCAGTC ATAGCCCAAT TGATGCTACC CAAGTGGCGT TGTTTCAGCG CTACAGTCCG TTTATCTATT TTCCTGAGTT GTATTTGGCC GAGTGGTTTC GCCAAACTTA TGGCGAGCCG CTGACGGTTT GGGCTTTTTT GCGCAGCGGG GTTAAGCCAC TCTTGGCTAG CTATCGCCGC TTGGTCCAGC ATCTCTCAAT CGATGCAATT ATCTTGATTG ATGGTGGGGT TGATGCGTTG GTGCAGGGCG ACGAGGCCGA AATTGGCACG TTGGTTGAAG ATACAATTTC CTTAGCTGCG GTCAATCAGT TGAACGAGGT TCCAACCAAA ATTGTGACCT GCATTGGCTT GGGCGCTGAA CGCGATATGC ATTATCCACA TATTTTTCAG AATATTGCCA GCTTGGCTGA ATCCGAAATT TTGCTAGGTA CGTGTAGCCT GATCAAACAG ATGCCAGTTT ATCAAGCCTA CGCTGCGGCG GTGCATTGGA CGCAATCGCG AATTTATCAA GACCCTAGCG TGATCAACAG CTCAATTATT TCAGCGGTCG AGGGCTATTT TGGCAATTAT CATCTGACCA GCAAAACTGA GGGCAGCAAG CTCGATATCA ATCCATTGAT GAGTATCTAT TGGTTTTTTG ATTTGGTGGG CTTGGCTGAG CAGCATTTGT TGCTAGAGCC GCTGTTGGAG ACTGAACATT TACGCGAGGT AGTGCAGTGG GTGCATACGC TGCGGGCAAA TTTGCCAATT CGCCCGCGCT TACGTTAA
|
Protein sequence | MQLNLPILDQ LQGRRSILIA GMGGGFDIFC GLPIFLQLRQ LGFNVHLANF SFSDLEAASN LQYLGDTVYG VPSLPRDPEF VPSHSPIDAT QVALFQRYSP FIYFPELYLA EWFRQTYGEP LTVWAFLRSG VKPLLASYRR LVQHLSIDAI ILIDGGVDAL VQGDEAEIGT LVEDTISLAA VNQLNEVPTK IVTCIGLGAE RDMHYPHIFQ NIASLAESEI LLGTCSLIKQ MPVYQAYAAA VHWTQSRIYQ DPSVINSSII SAVEGYFGNY HLTSKTEGSK LDINPLMSIY WFFDLVGLAE QHLLLEPLLE TEHLREVVQW VHTLRANLPI RPRLR
|
| |