Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0856 |
Symbol | |
ID | 5732757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 969042 |
End bp | 970256 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277988 |
Product | hypothetical protein |
Protein accession | YP_001543632 |
Protein GI | 159897385 |
COG category | [S] Function unknown |
COG ID | [COG4402] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.028401 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCGTT TCGCCACAGT CTTTGCGCTG TGTAGCATTT TAAGCTTTAC TTTGCCCTCA ATTGCTGCCG CTTGTGGAGC GTTGATTCCC GCCGACGATC AGATTCGCCA AGCTGGCCTC AACGTGATTT TTGCTGTCGA TGGCCAAGCC AACCAAACCA CCGCCTATAT TCAAATTAAC TATGTTGGCG ATCCCGCTGA GTTTGCTTGG ATCTTGCCAG TGCCCAGTAA TCCCAAGGTC GATGTGATTG AAGCTAGCAC GTTTGCTGAA TTACATACTC TGACCGATCC ACGGGTGACC TTCCCTTCGC CACCTGAATG TTTTCCGGCG ATCGTTGGAG CAGCACCCGA TGGCGCTGGT CAAGCACCAA ATGTGTTGCA ACAAGGTCAA GTCGGCCCCT ACGACTACAG CGTAATTGAG GATCGTGATC CAGCGGCGCT TGAAACATGG CTCAAAACCA ACGGCTATCA AACTCCAGCT GGTTTAGAAG CTGCCCTAAA ACCCTATACC GAAGCTGGCA TGCCGTTGAT CGCCATGAAA CTTAAGCCTG GCGCTGATAC CAACGATATT CAACCGGTCG CGATCAGCTT CACGGGCACA ACCCCAATGC TGCCGTTGCG CTTAGCAGCC TTGAGCAGCG AACCCAAAAC TCCAATTACG GTCTGGATTT TTGGTGAAGC TCAGGCAATT CCAACCAATA CTGAACGCTT TACCATGCGC GAAAACGATT TGGCGCTGAC GGCCTACGAT GGCTCGAATA ATTATAAAGA ATTGCGCAGC GGCGTGTTGG CAAGCGTAGC GGGCAAAGGC TTTTTGACCG AGTACGCCCA ACAAAGCAAA TTTCTCAACC CTCAAGATAG CCTGCTGAAG GAATTGACCA GCAAATATGC CTTCTTGACG CGGTTGTATG CTGAAATTTC ACCTGAAGAG ATGCTGTTTG ATCCAACCTT TGGTTATAGC CCTGATCTGC CACCACTTTC CAACAACATC GATTTGACCT ATCGCGCCGA GCCATACGAT TGCGAAACTG CGACCTTGCA CACGGTGCGT CGCCAGCAAT ACGATGCGGC CAAAGGCAAT CGCTCATCCG AAGAAGAAGC GACTGCCAAT TTGCGTCGTG GCCCCTTGCG CATCGGCACA CTGTTGGTGA TTGGCGTAAC CATGGGTTCG ATCTGGCTCT TGGCTCGTCG GCCCAAACGT CGCCCCAGCG CCTAG
|
Protein sequence | MRRFATVFAL CSILSFTLPS IAAACGALIP ADDQIRQAGL NVIFAVDGQA NQTTAYIQIN YVGDPAEFAW ILPVPSNPKV DVIEASTFAE LHTLTDPRVT FPSPPECFPA IVGAAPDGAG QAPNVLQQGQ VGPYDYSVIE DRDPAALETW LKTNGYQTPA GLEAALKPYT EAGMPLIAMK LKPGADTNDI QPVAISFTGT TPMLPLRLAA LSSEPKTPIT VWIFGEAQAI PTNTERFTMR ENDLALTAYD GSNNYKELRS GVLASVAGKG FLTEYAQQSK FLNPQDSLLK ELTSKYAFLT RLYAEISPEE MLFDPTFGYS PDLPPLSNNI DLTYRAEPYD CETATLHTVR RQQYDAAKGN RSSEEEATAN LRRGPLRIGT LLVIGVTMGS IWLLARRPKR RPSA
|
| |