Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1275 |
Symbol | |
ID | 5733168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1484680 |
End bp | 1485990 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278415 |
Product | hypothetical protein |
Protein accession | YP_001544051 |
Protein GI | 159897804 |
COG category | [S] Function unknown |
COG ID | [COG4842] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0739014 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGCAC CAATCGTTCA AGCTGATTTT GAAGTAATGG ATCAAGTTGC CCAGCGCCTT AGCAAAAATG CTGAGAGTGT TACGGCAATG CAAAATACGC TCAAACAAAC TATTGAAGAC TTACGCTCAA CATGGTTGGG TGATGCTGCG GTTGCATTTC AAAAAGAAAT GCAGGCTGAT ATTTTGCCTG CAGTGCAACG GCTGATCAAC GCTTTCCAAA CCGCCCAAAG TACAACCTTA GAAATTAAGA AAGTTTTACA AGAAGCCGAG CAAGAGGCCG CCAATCTATT TAAAGGCGAT CCAACTGGTG GCTCAGCCAG CACCCAAAGC GCTAGTTCAA GCGCTGGTGG CGGTGGAGCC TCAAGTGCTG GTGGCGATAC TGCGGCAGCC AGTGCTAGCC CAAGTAATGT TGGCGTAATG GCTGGTGGCA CCAGCAGCGC TTCTGCCAGT GGCAGTGGCG GCGGCGGTGG TGGCGGTGGC GGTGCAGCCT CGGCTCAAGC AAGTGGCGGC GGCGGTGGTG GTGGTGGCGG AGCAGCCTCA GCTCAACCAA CTGGCCAACA ACCCAAGGCT ACCAGTGGTG GCGGCGGTGG CGGTGGTGGT GGTGGCGGAA CAGCCTCAGC CCAACCAACT GGTGGTAATG CAGCCGCTGC AGGTAATGCA AGCCTTGGTA AACTCTCTGA AAAATACGAA ACTGGTGGCC GTGGCCCAGG CACGGTTTCA TCAGGCAAAG GCGACCTTGG CGGCGCTTCA TATGGCTCAT ACCAAATGAC CAGCCAAACT GCCATCAAAA AAGATGGCAA AATTGTCTTT GTTAATGGCG GACGGGTCGC TGAATTTTTA CGCAACCCTG CTGGTGCACA ATATGCTGAA GAATTTAAGG GCTTGAAACC AGGGAGCGCT GAATTTACCG CCAAGTGGAA GCAAATTGCT GCTCGCGATC CACAAGGCTT TGCTGCTGCC CAACATCAGT ATATTGAAAA CACCCACTAT CAGCCTCAAG TCAACAAGCT CAAGGCAGCT GGCTTTGATG TAAACAACTA TTCGCCAGCA ATGCGCGATG TTGTTTGGTC AACCTCAGTT CAACATGGCC CAGGCGCAAG CGTGATCACC AATGCGCTCC GTGGCAAAGA TCTTAGCCAA ATGAGCGAAT CGCAAATTAT CAATGCGATT TACACCGAAC GTAGCAAAAC CCTCGATAAT GGGCGCTTGG CCTATTTCAA AAATACCAGC GATGCTGGGG TTATTCAAGG CTTGAAAAAC CGCTTCGTCA ACGAACGCAA AGATGCCTTG AACATGTCGG CAAATCACTA G
|
Protein sequence | MAAPIVQADF EVMDQVAQRL SKNAESVTAM QNTLKQTIED LRSTWLGDAA VAFQKEMQAD ILPAVQRLIN AFQTAQSTTL EIKKVLQEAE QEAANLFKGD PTGGSASTQS ASSSAGGGGA SSAGGDTAAA SASPSNVGVM AGGTSSASAS GSGGGGGGGG GAASAQASGG GGGGGGGAAS AQPTGQQPKA TSGGGGGGGG GGGTASAQPT GGNAAAAGNA SLGKLSEKYE TGGRGPGTVS SGKGDLGGAS YGSYQMTSQT AIKKDGKIVF VNGGRVAEFL RNPAGAQYAE EFKGLKPGSA EFTAKWKQIA ARDPQGFAAA QHQYIENTHY QPQVNKLKAA GFDVNNYSPA MRDVVWSTSV QHGPGASVIT NALRGKDLSQ MSESQIINAI YTERSKTLDN GRLAYFKNTS DAGVIQGLKN RFVNERKDAL NMSANH
|
| |