Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3566 |
Symbol | |
ID | 5735425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4483060 |
End bp | 4484316 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280713 |
Product | hypothetical protein |
Protein accession | YP_001546330 |
Protein GI | 159900083 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.153773 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAG CCATGAGCGA GTCTGTAAGC ATCCAAGAGG AATACCTGTG GAATAGCGCC CGTGAGGATC TGCCGCCCAA AAAACCATTG GTTGAAATTG GTTGGGTGTT AATTGGCACC TTTGAAAAAA TCGACGAAGG CGCAATTCGC ACAGCTCGCG AACGGACTCG TGAATATTTA CGCCAGCACT TCCCTGAATT TTCGTGGCGC GTGCCAATTG TGGTGCGCCC CGACCCTGAC CCTGGCGATG TGCTAGCCGA GCCAATTCGC TTAATTGATA TGGCGGTGAC CGAGCGTTTG ACTAAGGGTT GGGATTTTGC GATTGCTTTT ACCGCGACCG ATCTACATTC CCTGAGCAAA CAATATACGC TTGGCACGCC CGCCAGTGGC CTTGATGCAG CGGCGATCTC GACTGCGCGG CTCGACCCAA CTGCGCAGCG CTCGATGGTA CGCAAAGCTG AGCGCGAATT GGTGTTGAGC AATCGAATTT GGGCTTTATT TTTACATTTG CTGGGCGATT TGTTGGCCTT GCCGCATAAC GACGATCACA GTAGCCCAAT GAGCGTGCCG CGTGTGCCCG AAGATCTCGA TGCAGTCCAC GATTTTAGTA GCGAGGAGCA TGAGTGGATG CTCGAATCGC TGACCGAAGT TGCCGACTTA CGGGTTGAAG AAACTGGTCA ATCATCGCGC AATCCATTCT TTTTTACCTT GCGGGTGCTG GTGCGCTCAG GCCGCGAAAT TATCGCCGCA GTTGCCCGAA ACCGCCCATG GATCTTGGTT TGGCATCTGA CTAAATTTAC GGGCGCAGCC TTATCAACCT TGCTAATTCT GCTCTTTACC GCCGAAGTCT GGGATGTTGG CGTGCGCTTG CCATTGGCTG AGGTAGCTGG CGCGTTAGGA ATTACGATTG CCGGAACAAC CACCTTTGTC GTGACGCGCC AACGATTGCT CACCCGCCGC ACCCGCTTTC GGCTGACTGA GCAAGTTGTC GTAACCGATT TCGCGATTAT TCTGACGATT TTAGGCTCAA TGCTTTCGAT GGTGTTGCTG CTAATCAGCG CGGCAATGTT GGTGATTTGG GTGGTGCCAA ATGAGCTAGC CGAACGCTGG ACTGATTTTG CGATTCTGTC GCTTGAGCAT AATCTGGCCT TGGCTGGCAC GGTTACGATC ATTGGCCTGA TTGTAGGCGC ACTTGGGGCC AGCATCGAAG ACCAAACCAC CTTCCGTCAT GTGGCCTTCA TCGATGAAGA AACTTAA
|
Protein sequence | MSEAMSESVS IQEEYLWNSA REDLPPKKPL VEIGWVLIGT FEKIDEGAIR TARERTREYL RQHFPEFSWR VPIVVRPDPD PGDVLAEPIR LIDMAVTERL TKGWDFAIAF TATDLHSLSK QYTLGTPASG LDAAAISTAR LDPTAQRSMV RKAERELVLS NRIWALFLHL LGDLLALPHN DDHSSPMSVP RVPEDLDAVH DFSSEEHEWM LESLTEVADL RVEETGQSSR NPFFFTLRVL VRSGREIIAA VARNRPWILV WHLTKFTGAA LSTLLILLFT AEVWDVGVRL PLAEVAGALG ITIAGTTTFV VTRQRLLTRR TRFRLTEQVV VTDFAIILTI LGSMLSMVLL LISAAMLVIW VVPNELAERW TDFAILSLEH NLALAGTVTI IGLIVGALGA SIEDQTTFRH VAFIDEET
|
| |