Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3701 |
Symbol | |
ID | 5735550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4654885 |
End bp | 4656069 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280853 |
Product | protein of unknown function Met10 |
Protein accession | YP_001546465 |
Protein GI | 159900218 |
COG category | [R] General function prediction only |
COG ID | [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGCGGT TTATTCTCAA ACCTGATCGT GATAAATCAA TTCGCCAACG CCATCCATGG GTTTTCAGCG GGGCGGTTGC CCGTCGTTCT GGCCCAGTGC ATCCTGGCGA AACCGTCGAT ATTATTAGCT CCGAGGGGGT GTGGTTGGCG CGGGGCACTG CCAGCACCCA CTCGCAAATG GTCGCCCGCC TGTGGACGTG GCAACGCGAC GAGGAGCTAG ACCCCAGTTT TATTCGCCAA CGGATCGAAC GAGCAATCAA AGGGCGCGAA CAGCTCCATA ATGATCCCCA AACCAATGCT TATCGCGTAA TTTTTAGCGA AAGCGATGGC TTGCCAGGCT TAATCGTCGA TCGTTTTGCT GATTGGCTAG TGGTGCAATT GCTGACTCAA GGTGCAGTTG CTCACACGGC GATTATCGTC GCAGCCTTGG CCGAATTGAT GCCAGTCCGC GGGATTTACG AGCGCTCTGA TGTTGAAATT CGTGCTCGTG AAGGCTTGGG TGAGGCTGAA GGCCTGCTTT GGGGCGAAGC ACCACCCGAT CAATTAACTA TTCGCGAAAA TGGCTACGAG TTTGTGCTCG ATTTGGCGGG CGGCCAAAAA ACTGGCTGGT ACGTTGATCA ACGCATCAAT CGCCAGCGCG TGGCCCAATA TGCCAAGGAT GCCGAAGTGC TGGGGGTTTT CTCCTATACT GGCGGGTTTG AGGTGCTAGC GGCGGGCGCT GGCGCTCAAT CAATCACAGC GGTTGATAGT TCGGCGGCGG CCTTGCGTGG CTTACATACC AACCTCAGCC AAAATGGGCT TAACACTCCA GTTACGGCGG TCGAAGGCGA TGCCTTCAAG ATTTTGCGCC AACTACGCGA AGAAGGCAAA CAGTACGATC TGATTGTGCT TGACCCACCC AAGTTTGCCC ACTCGCAGAG CCAAATTGAT CGAGCAACGC GTGGCTACAA AGATATCAAT ATGCAAGCCT TCCACTTGCT ACGACCAGGC GGCGTGTTGG CAACCTTCTC ATGTTCAGGG CTGATTAGCG CCGATCTCTT TCAAAAAGTG GTGTTTGGCG CAGCCTTGGA TGCTGGTCGC GATGCCCAAA TTATCGACAC ACTCACCCAA GGTGGTGATC ACCCTGTCTT GTTGACCTTC CCAGAGGCGG CTTATCTCAA GGGATTAATC TGCCGCGTCT GGTAG
|
Protein sequence | MLRFILKPDR DKSIRQRHPW VFSGAVARRS GPVHPGETVD IISSEGVWLA RGTASTHSQM VARLWTWQRD EELDPSFIRQ RIERAIKGRE QLHNDPQTNA YRVIFSESDG LPGLIVDRFA DWLVVQLLTQ GAVAHTAIIV AALAELMPVR GIYERSDVEI RAREGLGEAE GLLWGEAPPD QLTIRENGYE FVLDLAGGQK TGWYVDQRIN RQRVAQYAKD AEVLGVFSYT GGFEVLAAGA GAQSITAVDS SAAALRGLHT NLSQNGLNTP VTAVEGDAFK ILRQLREEGK QYDLIVLDPP KFAHSQSQID RATRGYKDIN MQAFHLLRPG GVLATFSCSG LISADLFQKV VFGAALDAGR DAQIIDTLTQ GGDHPVLLTF PEAAYLKGLI CRVW
|
| |