Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4234 |
Symbol | |
ID | 5736088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5396099 |
End bp | 5398249 |
Gene Length | 2151 bp |
Protein Length | 716 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641281389 |
Product | hypothetical protein |
Protein accession | YP_001546994 |
Protein GI | 159900747 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGGA AACGGATCGT CGGAAACAGA ATCGGATTTT ATTTAATCTT AATTGGTATC CTGCTCAATT TGGGCACAGT CAAAGCCCAA CAGCCCCAAT CTCAGCCAAA GCGGCTGCTC AGCTCAGCGA CAGACAGGGC TTTTGAACCA ACAACCATCC AATCCTCGCC CTTAACCTTA ACCAACAGCA GCGTGATTAG TGGCACCTGG GATACGAGCT TTGGCAATAG TTTAGCGACC GCTGGCACGA TCACCAAGTT GCATCTTGAT GCCAATAATC GTTTGTATGT CGCTGGTAAA TTCAATTACC TCAATGGGCA ATTGTTTAAT GGGCTAGCCG CATGGAATGG TACAACTTGG CAAGGCTATG GCTTGCAAGC CAATGATGTG GATACGATCA ATGCCCTCGA TAGCTACCAA AACCAATTGG TGGTGGCTGG GCGTTTTCGC CAATTGGCGA ACCAAACCAC GAATGGCCTT GCCCGTTGGG ATGGTAGCCA ATTTCAAGCC TTTGGCACTG GCGTTAGTGG CATCAACGAT TCATATCGCA ACAATATTAT TTCGCAGGTC TATGATATTG AAGTCCTGAG TGATACGCTG TATCTCGGTG GCAACTTTGA TACATTTAAT GGCCTCGATG CCCACGGCAC AGCCTACTGG CGACCGACCG GCTTTGGCGA GTTTGGCGAT TTTAATGGCG TAATCAGCGA TCTCGATGCA ATTCCCAATA CGCTTTTTGC TGGTGGCAGT GTAAGCCGCG TCAATCAAAC CTCAATTCAA GGTTTAGGCT TATGGAATGG CACAACCTGG CAAAACTCCA GCTTGCCAGG CTCAAATGCC GCCCGTGATC AAATAGGGCG AATTGGGACG ACGCTCTATA CCTGGGCCAA TAACTATGAT CCGCGACAAA CCATGATGTA TCGCTGGAAT GCTAATCAGT GGCAAGCGTT TGGCAGCTGG ATCGATGAAT TTGTGCTTGG TGTGGCGCAA AGCAACGATT TATATGCCTA TACTATCAGC GATCTATGGC GCTTAGTTGG CAACCAATGG CAATTGGTCA ATCTGCCAAT GACGATCAAC TCTATTAATA CAGTTGTTGG CAATGACAAT GGGTTGTATC TCGCTGGTAA TTTTGAAGTT GATGGCAACG CCGTGCAATT GGTGCATTGG AATGGTACAA CGCTCACGCC ACTTGCCAGC ACCAGCGTTT ATCCACGGCT TGATGAAATA ACCGGCGTTC AAGGTCTGCC AATTATCAAA CCAAATTGGC CGAATGTGCT CAAGCAGTGG GATGGTCAAA CCTGGCAAAC AAGCTACGCA TTACAGGGGA TGGCTAGCCT TTTCGCCACC AGTGATGATC GTGCCTATTT GGCTTACCCT TATCCATATC AAACCGCTGG TTCATTGAGC AGTTCGCTGT GGCAATTTGA TTCGACTGGC GCGTTGACTG CTCCCTTGCA ATTGATTGGT CAGGCAACCA CCTGGACGCT TTCAGGAACC GAGCTTTTGG CTAATGGCAT AACGATGATT AACAGCCAAC CAATCAGTGG GGCCGTGGCG CTGAATGGCA CTGAATGGCG TGAATTAGGG CTAAACACCA ATTTTCGCAA GATCTATTAT GCCAATAATC GTATCTATGC GCTGATGATC ACCCAAAACC ATGATTGGTG GGAGATTGAT ATTGACACTT GGAATGGCAC TGCTTGGGTT GATTTTACTG GGTTGTTTGT ACAAAGCAAT GGCTCGATTC CACAATTTGT AACCTGGCGC AATCAGCTGT ATTTTACCAA CGGCAATCAG TTGCATCGGC TCGAAAATAC TGGCAATAGC ATGGTATTTG AGTTTGATGG TGCGCCATTG CAACTTGCCA ACATCAACGA TCGCTATTTA TATGTTGGCG GGGCATTTAG CAAAGTCGGT GAGCAACAGC TTGGTAGCAT TGTGCGCTGG AATGGCAATA GCTGGCAAGC TCCAAGCCAA ACGCCCAACG GCACAGTTGA ACACATGGCA ATTACCGAAA ACTATCTCTA TTTGAGTGGT AATTTTACCC ATATTGGCGC AAACCCAAGT TTAGGTGTGG CCAGCTTTCA GCTTGCCGCT GAACCTGAAC CACTACCCTA TCAATTGTTC ATTCCCCAAG TTGTCAAATA A
|
Protein sequence | MARKRIVGNR IGFYLILIGI LLNLGTVKAQ QPQSQPKRLL SSATDRAFEP TTIQSSPLTL TNSSVISGTW DTSFGNSLAT AGTITKLHLD ANNRLYVAGK FNYLNGQLFN GLAAWNGTTW QGYGLQANDV DTINALDSYQ NQLVVAGRFR QLANQTTNGL ARWDGSQFQA FGTGVSGIND SYRNNIISQV YDIEVLSDTL YLGGNFDTFN GLDAHGTAYW RPTGFGEFGD FNGVISDLDA IPNTLFAGGS VSRVNQTSIQ GLGLWNGTTW QNSSLPGSNA ARDQIGRIGT TLYTWANNYD PRQTMMYRWN ANQWQAFGSW IDEFVLGVAQ SNDLYAYTIS DLWRLVGNQW QLVNLPMTIN SINTVVGNDN GLYLAGNFEV DGNAVQLVHW NGTTLTPLAS TSVYPRLDEI TGVQGLPIIK PNWPNVLKQW DGQTWQTSYA LQGMASLFAT SDDRAYLAYP YPYQTAGSLS SSLWQFDSTG ALTAPLQLIG QATTWTLSGT ELLANGITMI NSQPISGAVA LNGTEWRELG LNTNFRKIYY ANNRIYALMI TQNHDWWEID IDTWNGTAWV DFTGLFVQSN GSIPQFVTWR NQLYFTNGNQ LHRLENTGNS MVFEFDGAPL QLANINDRYL YVGGAFSKVG EQQLGSIVRW NGNSWQAPSQ TPNGTVEHMA ITENYLYLSG NFTHIGANPS LGVASFQLAA EPEPLPYQLF IPQVVK
|
| |