Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0900 |
Symbol | |
ID | 5732801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1031727 |
End bp | 1033388 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641278032 |
Product | FHA domain-containing protein |
Protein accession | YP_001543676 |
Protein GI | 159897429 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1716] FOG: FHA domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTGTC CAAGCTGTGG ACATACGAAT GATGGTGGCA ACCGTTTTTG TGAATATTGT GGTGCACGAC TCGATCCGTC GATGAATCAA GAAGCAACCC AAATCGGGGC AGTTCCCAAT TTACATACGG ATCAAAGTTA CGATGCGCCA ACCATGTTTG TTCCAGCCGA TCAGGCTCCT CCTGCGCCAC CAGCCCAAGC GGAGCCAGCA GTCGCCAGTG CGCCAGCCGC AAGCTCGCTC AACTGCGCTG AATGTGGCTA TATCAATCAA CCAGGCGACC GCTATTGCGA TCAGTGTGGA GCCTCACTCG ATGCCGCTCC TGTCGCTGTT GTTCCAGTGA CTACGCCTGC GCCAGTTGCC AGTGAAGCGC TTACGCCGCC CGATGGTGTG CCAGCCGTAC CAGTCGCCGA GCCACATTTG GCTGAATTAA CCAATGTTGC GCCAGCTGAG GAGTTGCCAA CCGTGCCGAT TGATGATCAA CAGCCAGTTT CGACTCCTGT GGCCGAAGCC GAACCAGTTG CGCCAGCCAT TGAAGAACCA GTGGTTGCAC CTGTAGCCGA GGCTGAACCA GTGGTGCCTG CGGTTGAAGA ACCAATTTCA ACCCCTGTGG CCGAGGCCGA ACCAGTGGTT GCTCCGGTAG CTGAAGCCGC CCCAGCAGTT GACGAAGAAG TCGTAGCAGC CGAACGCACT GCTTTATCAG CAGCGGTGAT TGAGCAAGAA GATAATCTGG TGATGTTCGA GCAAATGGCT AATCGGTATG CTGGCCGCGC CTTGCCTGCG CATATCGCCG CTGGCATCGA AGAAACCAAG GCTAGTCTGG CCGAAGCACA AGCTAATTTG GCGGCGTTTG ATCAAGCCCA AGTTGTAGCC AAAGCTGCTG CTGAAGCCGC TGCTCAGGCT GCCGCCGACG CTGCGGCTGC GGCTGCTGCT CAGCCCGATC CAGAAGAGGT AGCTCGTTTA GAAGCAGCAA TCACCGAACA TCAAGATAAC TTGGCGATGT TTGAGCAGAT GTCAGCTCGT TATGCTGGCC GTGCTTTGCC AGCCCATATC GCTGCTGGCT TGGAAGAAAG CAAACATGCC TTGGCTGAGG CCGAAGCTGA ATTAGCCGCC TTGCTTGGTG GTGCACCCGC TGCACCTGCT GCTGCGCCAA TTCCTTCAGC TCCGGTCAAT ACCTATGATG CGCCAACGGT TGCTGCGGCT GCCCCAGCTG AACCAGCGCC AGCTCCGGTT GTGCCTGCCG AACCAGTGCC AGCTCCAGTA GTTGAGGCTG TGCCAGCATG GGCTGCCCCA ACGCCTGCCG AGCCAGTCGC TGCTCCGATT GCTCCACCAG CGCAAGTTAC CCCGCATTTG GTGGTTGCTG GCAGCCAAGT GGTGCTCAAC TTGCCAACCG ATAAGCAAAT TTATGTGATT GGCCGTGAAG ATCCGATTAG CGGGATTTAT CCTGAGGTCG ATTTGACCAA TCATGGCGGC GAAGGCGGTG GGGTCAGCCG TCAGCATGCC CGCTTGCACA ATACTGGCGG CAATTGGACC TTGGAAGATT TGAATAGCAC CAACTATTCC AAAGTCAACG GCCAAAAATT GGCTCCGCAT GCGCCAGCTC CGGTCAACCA TGGCGATCAA CTGCAATTTG GCAAAGTTGT TGTGACTTTG CATTTGCATT AA
|
Protein sequence | MKCPSCGHTN DGGNRFCEYC GARLDPSMNQ EATQIGAVPN LHTDQSYDAP TMFVPADQAP PAPPAQAEPA VASAPAASSL NCAECGYINQ PGDRYCDQCG ASLDAAPVAV VPVTTPAPVA SEALTPPDGV PAVPVAEPHL AELTNVAPAE ELPTVPIDDQ QPVSTPVAEA EPVAPAIEEP VVAPVAEAEP VVPAVEEPIS TPVAEAEPVV APVAEAAPAV DEEVVAAERT ALSAAVIEQE DNLVMFEQMA NRYAGRALPA HIAAGIEETK ASLAEAQANL AAFDQAQVVA KAAAEAAAQA AADAAAAAAA QPDPEEVARL EAAITEHQDN LAMFEQMSAR YAGRALPAHI AAGLEESKHA LAEAEAELAA LLGGAPAAPA AAPIPSAPVN TYDAPTVAAA APAEPAPAPV VPAEPVPAPV VEAVPAWAAP TPAEPVAAPI APPAQVTPHL VVAGSQVVLN LPTDKQIYVI GREDPISGIY PEVDLTNHGG EGGGVSRQHA RLHNTGGNWT LEDLNSTNYS KVNGQKLAPH APAPVNHGDQ LQFGKVVVTL HLH
|
| |