Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2792 |
Symbol | |
ID | 5734673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3549470 |
End bp | 3550660 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279935 |
Product | hypothetical protein |
Protein accession | YP_001545558 |
Protein GI | 159899311 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGAAAGA AGCTTAACTA TTTATTCGGT TTAGGATTTG GCCTGGTACT TTTAATTGGC TGTGGCTCGG ATGCTGCGCT TGGTGAAGTG ACGCTCAGCG AGCCAATTAT CAACTTGAAT ACTGGCCCGC GCACCACCAC CATCAGCTAT CTGATTGGGC AGCCAACCAA GGTTTCGATT TGGCTTGAAA CCAGTAGTGG TGAGCGTTAT GCCTTGCGTC AAGCAGTTAC CCGTGAGCCA TCCAAGGATG CGTATCAAGT GCTGTTTGAT GGCAGTGTGC CAGTTGATGC TGATACGAAT CGTTTATTGC CGAGTGGTAG CTACACAGTG GTGATTGAGG CGGAAAATAC TGCGGCCCAA CGGCTGAATT TGCAAATCGA TCAAGCGCCA AGTGATAGCT TTGACGTGTA TGATCTGCGG GTTACGCCCA ATCCATTTTC GCCAAATGAT GATGCAGTTG AAGATTTTAC AACCTTCTCG TATCGTTTGC CGATTACCGC GACCGTCTCG TTGGATGTGA TTGATTCAGC CAATGCGACG CGCTATCCAA TTCTCAATCG TGAGTTACAA GGCTTGGGCG AACATAGTGA GGCTTGGTCG GGGCGGCCTG TGGTTGGCGG TATTTTGGCC GCAGGAACCT ATCAATATGA ATTGCGAGCT GATGATGGCC GTGGCAACCG CGTGACCAAG CGCGGCGATG TGACCCTCAG CAGTGCTGGG ATCGGGGCGC TCGATGTGCT CAGCGTTGAA ATTGGCCCTG AACAGATCTT ATTAAACGAT GTGATCACGG TCACGTTCAA GGTCAAAAAT AATAGCGATG TGGCCTTGCG CACCTTTGGG CCAGCTTCGG GCTACACCTA TAGCACCAAC CAAAGCTACT CGTCGATTGA AAACGAGCAG TATGCTAACC TTGGCGGCGG CTTATGGCGG GTGGCAATCG ATTGGGATGG CAATGGTGGC TCGGGTTTTC GCTACCCATT CCGTTGGGCG ATTTCGCCAC GCAGCCCTGA CCAATGGTAC GACCCCAACA AATTTGATTA CCTGTATCCA GGCGAAGAAG CCACGATTAT TGGGCGCGTG CAAATCAAAC AACGCGAAGA TCGCATGACC TTTTATGCTG GCGTGGCCCA TGAAGGTGTT GATTACCCCA CCAATCGACT CAAACCAACC CTAATTCAAG TTTCATTCTA A
|
Protein sequence | MRKKLNYLFG LGFGLVLLIG CGSDAALGEV TLSEPIINLN TGPRTTTISY LIGQPTKVSI WLETSSGERY ALRQAVTREP SKDAYQVLFD GSVPVDADTN RLLPSGSYTV VIEAENTAAQ RLNLQIDQAP SDSFDVYDLR VTPNPFSPND DAVEDFTTFS YRLPITATVS LDVIDSANAT RYPILNRELQ GLGEHSEAWS GRPVVGGILA AGTYQYELRA DDGRGNRVTK RGDVTLSSAG IGALDVLSVE IGPEQILLND VITVTFKVKN NSDVALRTFG PASGYTYSTN QSYSSIENEQ YANLGGGLWR VAIDWDGNGG SGFRYPFRWA ISPRSPDQWY DPNKFDYLYP GEEATIIGRV QIKQREDRMT FYAGVAHEGV DYPTNRLKPT LIQVSF
|
| |