Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1229 |
Symbol | |
ID | 5733122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1422206 |
End bp | 1424233 |
Gene Length | 2028 bp |
Protein Length | 675 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278369 |
Product | PKD domain-containing protein |
Protein accession | YP_001544005 |
Protein GI | 159897758 |
COG category | [S] Function unknown |
COG ID | [COG5276] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACACAC ACAATGGACG TTGGTCGCGT TCGTTAGCGG TCGGTTTTGC GGTGTTGGCG GCGTTTGTGT TAGCGCCAAT TGCTTCGATT TCGGCCCATA GCAATCATGA TCATAAGATT TATGTTGCAC CATTTGGCAA AGATGAGGGT GATTGTTCGA AGATTTGGCA GCCCTGTGCG ACGATCGATT ATGCGATTAG CCGCGGGACT GGCAAGGGTG ATGAAATTCG GGTTGCTGCT GGCGATTATA GCTACGATGC TCAGGCGGTT GGCTTATTGC TGAGCCGCCA GATTCCGGTG CACGGTGGTT TTAGCGCTCG TGATAATTTC ATGCAGCAGG ATACCAAGGC CAATCCAACC TATATCGCTG GGATTCCTAG CCGCTACCGC GAACGCTTGG CGGCAATTGG CTTTGCGCGG GTCGAAGATC GCGATGGTCG CAATACCCCA GAACTTGAGC GGATGCAAAT TCCAACTGAA TTGGAAACGC CAAGTGCTGC TCAGCCTTGT ACCAATGGTA TGGCTGGCAT CTTTGAGTGC AACGGCATCG ACTTCTTGGG CAGCGTGCCG CTCTCAAGTT TTCGGGTAAA TGGTAGTGGA GCCACCGATG GCAGCAACTT GTGGGGCCAC GTCGATCTCA ATACCGATCG CGAATATGCG ATTATTGGGG TTAACAATGG CACTGGCGTA GTTGATGTGA CTGATCCCAC CAATCCGGTG GTGATTGGCA CGGTGCCAGG TAATAATTCG CAATGGCGCG AAGTTAAGGT CTATCAATAT TTCAACCAAG CCCAAAATCG CTGGAACGCC TATGCCTATC TTTCAACCGA AAATCGTACC CAAGGGTTGC AGATCGTTGA TCTCAATCAC TTGAGTGATC CAACGCCTTC GGTTAGTTTG GCGGCAACCT ATACCGCCGA TTTTGGCTCA TCGCACACTG TCTATATTAA GAATGTTGAT TTCAGCACCA ACGTAGCGTT GCCTGGTAAA ACTGCGGCGT TGTATATGAA CGGGGTCGGC AAAAACGGCA GTCGCTTGAG TTCAGGTGTG TTCCGCGCCT TTGATATCAG CAATCCGTTG ACTCCGCAGA TTATTGATAG TGGCGTGCCG GATACCAATA TTAGCTACAC CCACGACGAT ACCAGCATGA TCATTACCGA TTCGCGGACT TCGGCATGTG CGCCTGGTCA CCAAGCCTCG TGCGAAATAA TGTTTGATTT CAGCGAATCG TCGGTCGAAA TTTGGGATAT TACCGATTCG GCTGCCCCAT TCCATATTAG TTCACGGCCT TATTCGGGCA GTGGCTACAC CCACTCAGGC TGGTATAGCG ACGATAAAAT GTACGTCTTT ATCCAAGATG AATTGGATGA ACAAAATTTT GGCCACAACA CTCGTGTGCG CACGATGGAT ATTCATGATC TGGATAATCC AACCATCAGC GCGACGTGGG ATGGCCCAAC TCGGGCAATT GACCATAACG GCTACACGAT TGGCAACAAA TATTATATGT CGAACTATTT GCGTGGTTTG ACGATTCTTG ACATAACCAA TCCAAATAAC ATCCAAGAAG CCGCTTTCTT TGATACCTAT CCTGGTAGCA ATTCGGCCAG TTTTGATGGA GCTTGGGGGG TTTATCCCTA CTTGCCAAGC GGCACCTTGA TGATCAGCGA TATTTCACGC GGCTTGATTT TGGTGCGCGA ACCAACTAGT ACCCCTGATC AAGCGATTGC TGGCTTGGAA GCAACCAACG ATGGCCCAAC CGTTGCTGGC GAGGCGACCA ATTTTGATGC AGCAATTCGG GCTGGTACAA ACGTGACCTA TGCGTGGGAT TTTGGTGATG GTACAGCTGT GGTAACTTCA ACCAACACCA CGATGAGTCA TACCTATCCA AACGTTGGTA ATTATACGGT TGAGTTAACT GCCAGCAATG GTACTAATTC GCAAACAGCT ACAACAACCG TGGTGGTGCA AGCGCCACCA CAAACCGAAT GGAAAATTTG GCTGCCCTTT GCGATTCGGG CGGAATAA
|
Protein sequence | MHTHNGRWSR SLAVGFAVLA AFVLAPIASI SAHSNHDHKI YVAPFGKDEG DCSKIWQPCA TIDYAISRGT GKGDEIRVAA GDYSYDAQAV GLLLSRQIPV HGGFSARDNF MQQDTKANPT YIAGIPSRYR ERLAAIGFAR VEDRDGRNTP ELERMQIPTE LETPSAAQPC TNGMAGIFEC NGIDFLGSVP LSSFRVNGSG ATDGSNLWGH VDLNTDREYA IIGVNNGTGV VDVTDPTNPV VIGTVPGNNS QWREVKVYQY FNQAQNRWNA YAYLSTENRT QGLQIVDLNH LSDPTPSVSL AATYTADFGS SHTVYIKNVD FSTNVALPGK TAALYMNGVG KNGSRLSSGV FRAFDISNPL TPQIIDSGVP DTNISYTHDD TSMIITDSRT SACAPGHQAS CEIMFDFSES SVEIWDITDS AAPFHISSRP YSGSGYTHSG WYSDDKMYVF IQDELDEQNF GHNTRVRTMD IHDLDNPTIS ATWDGPTRAI DHNGYTIGNK YYMSNYLRGL TILDITNPNN IQEAAFFDTY PGSNSASFDG AWGVYPYLPS GTLMISDISR GLILVREPTS TPDQAIAGLE ATNDGPTVAG EATNFDAAIR AGTNVTYAWD FGDGTAVVTS TNTTMSHTYP NVGNYTVELT ASNGTNSQTA TTTVVVQAPP QTEWKIWLPF AIRAE
|
| |