Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2584 |
Symbol | |
ID | 5734462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3317588 |
End bp | 3318574 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279724 |
Product | hypothetical protein |
Protein accession | YP_001545350 |
Protein GI | 159899103 |
COG category | [S] Function unknown |
COG ID | [COG4842] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000192335 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCAAG ACACCATTCA GGCCAAGTAC GATGAGTTGG ATCAAGTTGC CAGCCGTTTT GGCAAAGCCG CTGACACCAC CACAGCAATT TTCAAATCGC TTGAAAGTAT GGCCAATCGG CTTGAGGGTG GTGATTGGCA GGGCGATGCT CAAAAGGCAT TTAGTGCTGA ATTTCGGGGC GAGGTTCAAC CAGCCTTTAA CCGCCTTCAC ACTTTTTTCC AACAAGCACA AAGCACAACC TTAGAGATCA AAAAAGTCTT CCAACAAGCT GAAGAAGAAG CGGCTGCCCT TTTTCGTGGC GATGTCTTTG GCGGTGGTGC TAATGGTAAT GGTAGCGGCA ATGGGACTGC TGGCAATGGC AATGGCAGTG GCAGTGGCAG TGGCAGTGGC AATACTGGCG ATGGTGGCAC AGGTACAGGC GGCTCAGGCA GTGGCCCAAC CTTCAAGGGT AAAGTTAAAG TTCACACCTA TGATTACAAA ACCAAAACTG GTGAAACCAA GCCGCAACTC AAATTAGGCG TTAGCGGCGC ACTCCTTGAA GACAAAGGCG ATCTGATTAA GGTTGATGGG CCGATTCCTT ACACCCTCAA AGGCAAATAC CAAGTTGGTT ATGGCGAAGC TGGGATCGGC TTAGGGGTCA ATGGCGATAA GAAATTTACG ATTGGGCCAT ACGTCGAAGG AACTGTTGCC AAAGGTGAAT TAACCAATGT TTATGGCGAC AAAAACTTTG GCTATACCGA AACCCTCGAA GGCAAAGCCC TCTCAGTTGA AGGTTTTGCT GGCCTCAAAG ATGGCTCAGT TGGCGCAACC ATCGGCGGCA CGCTCGTCTC AGTTCAAGCC ACCAAAGGCT TGAATGTTGC TGGGGTCAAC GTTGGGGTTA CCGCCGAAGT TGGCCTCAAG GCCGAGCTTG GCTTTAGCAT CGGTAAAGAA ACCAAAATTA AATTGCCATT TGTCTCGTTT GGCTTCTCGT TTGGTGGAGC CAAATAA
|
Protein sequence | MAQDTIQAKY DELDQVASRF GKAADTTTAI FKSLESMANR LEGGDWQGDA QKAFSAEFRG EVQPAFNRLH TFFQQAQSTT LEIKKVFQQA EEEAAALFRG DVFGGGANGN GSGNGTAGNG NGSGSGSGSG NTGDGGTGTG GSGSGPTFKG KVKVHTYDYK TKTGETKPQL KLGVSGALLE DKGDLIKVDG PIPYTLKGKY QVGYGEAGIG LGVNGDKKFT IGPYVEGTVA KGELTNVYGD KNFGYTETLE GKALSVEGFA GLKDGSVGAT IGGTLVSVQA TKGLNVAGVN VGVTAEVGLK AELGFSIGKE TKIKLPFVSF GFSFGGAK
|
| |