Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1188 |
Symbol | |
ID | 5733081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1364922 |
End bp | 1366547 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278328 |
Product | FHA domain-containing protein |
Protein accession | YP_001543964 |
Protein GI | 159897717 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [T] Signal transduction mechanisms |
COG ID | [COG1404] Subtilisin-like serine proteases [COG1716] FOG: FHA domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00110568 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGTC GCATTGCACT TGTTTTTTGT TTGATCTTGC TGGTCGGCAC GCCAGTCTAT GCCCAAAATG CCTTAACTGG CCCACCCAAC GACCCCGACG CAAATAAACA ATGGGTCATT CGCCAACTTG GTTTAGCGTG TGCATGGGAG CATTTGACGG GTAACTCCGA TGTTACTGTG GCCGTGATCG ACTCTGGCGT TGATATGAAC CACCCCGATT TGGTCGATGT CTTGCGCACC GACGGCTTTG ATGCTGTCGA TGGCGATGAT GATCCATCGG ATGAGAATGG CCACGGAACC CATGTTTCAG GCATTATTGC CGCCACCATC AACAACAGTA AAGGAATTGC AGGGGTTGCC GGTGGGGGCA CGCGCATTTT GCCAATTCGG GTGATGGCTG CCGATGGTTC GGGCACGAAT CAAGATATTA TTGCGGGCAT TCGCTATGCC GTTAGCAAAA ATGTCCAGAT CATCAATATG AGCCTTGGCT CGATGTTACC GCTCGATAGC GAAGATATTG TTGAGGCAAT CAAAGAGGCC GATGCCGCTG GCGTACTGGT GATTATCGCC GCTGGTAATT CGTTTGTGCC CTTGCCCAAC TTCGCCTTTG GGGTTGAAGA ATTTGCCATG GTCGTGGCCG CAACTGATCC TGATGATCGC AAAACCGATT TTTCGAACTA TGGTAAGTGG ATTTCGGTTT CGGCACCAGG GGCAGGCATT TACTCAACAA TGCCCACCTA CGATGTGTAT ATGACCAGTC AGTTGCCTGC TGAAGAACGC TTCAACAAAA ACTACGATCA AATGAGCGGC ACATCGCAAG CTACGCCAGT GGTTGCTGGC TTGGCAGCCT TGCTATTTGC CCAACACCCC GATTGGAATG CCGACCAAGT ACGGGCTGAA ATCGAAAAAA CTGCTGATGA TATTTCAGCG CAAAATCCGA TTAAGCGCTA TGGCCCAATC ACCTATTTCG AGCCAAGCAA CCTTGGCAAA GGCCGTGTCA ATGCTTGCAA TGCCTTGGGT GGCCCAGTTA AAGGTAGTGC TGCAGCAGTT GGCGGCCCTG AAGGAAAATC CAATTTAATC ATGCTGCTTG GGTTTAGCGC GGTCATGTTG GTAGTTTTGG GCGGATTAAC CGTCTTTTTA ATTCGCCGTC GCAAGCACAA CGCCCGCCCT GCGGCGATTC CCGCTGGTGG TTTTGCCCCA CCGCCACCAA TTCATTATGA TCAAGCAATG GCCCAACAGC AACGCAACCT TGCCTCGCAA CCAATCGTCG GATCAACGCC GCCATTGAAC CAGAGTTATG CCCCACCCAG CGCGGCTCCG GCAGGCGGGC CAGCGTGGGG CAAATTAACG GTTGTCCGGG GAGCCGAATT GAACAAGTTC TATTTGCTGC GTGAAAATCA AATTTTTATT GGCCGTGAAG CCTCGCTTGC CGTGGCAATT AGCGGTGATA GCACCGTTTC ACGTCGCCAT ACGATCATCT ATCGTGATCC TCGTGGCATT GAAATTGAAG ATGCAGGCAG CAGTCATGGC ACCAAAATCA ACGGCATTCC AGTACAAGGC CGCCAACTGG TTCGACCAGG CGATGTGATC GAAGTTGGGC AGACCCATTT ACGTTTTGAG GGTTAA
|
Protein sequence | MSRRIALVFC LILLVGTPVY AQNALTGPPN DPDANKQWVI RQLGLACAWE HLTGNSDVTV AVIDSGVDMN HPDLVDVLRT DGFDAVDGDD DPSDENGHGT HVSGIIAATI NNSKGIAGVA GGGTRILPIR VMAADGSGTN QDIIAGIRYA VSKNVQIINM SLGSMLPLDS EDIVEAIKEA DAAGVLVIIA AGNSFVPLPN FAFGVEEFAM VVAATDPDDR KTDFSNYGKW ISVSAPGAGI YSTMPTYDVY MTSQLPAEER FNKNYDQMSG TSQATPVVAG LAALLFAQHP DWNADQVRAE IEKTADDISA QNPIKRYGPI TYFEPSNLGK GRVNACNALG GPVKGSAAAV GGPEGKSNLI MLLGFSAVML VVLGGLTVFL IRRRKHNARP AAIPAGGFAP PPPIHYDQAM AQQQRNLASQ PIVGSTPPLN QSYAPPSAAP AGGPAWGKLT VVRGAELNKF YLLRENQIFI GREASLAVAI SGDSTVSRRH TIIYRDPRGI EIEDAGSSHG TKINGIPVQG RQLVRPGDVI EVGQTHLRFE G
|
| |