Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2258 |
Symbol | |
ID | 5734145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2888574 |
End bp | 2890166 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279399 |
Product | hypothetical protein |
Protein accession | YP_001545026 |
Protein GI | 159898779 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTACGGC GTAAAAAAAT TCTGTTAAGT GGATTGATCA TGCTTGGTTT AGGGCTAGTT TGGGATAGCC GCCCAGTTGC TGCCGATAGC GTGGTGGTAG GCACTGGCAC ACCTGCTAGT TGTAACGAGG CGGCCTTTGA TGCTGGCTTG GCTCAGCTTT TTCCAGGCGA ACAAGCCCCT GGCGGCACGC TGACCTTTAA TTGTGGGCCG AATCCGCATA CGATTGTTTT AACCAGCCAA AAATTTTTGC ACGATGGCTC GGTGATTGAT GGTGGCGGCA AAATTACGCT CTCTGGCGGC AATACCACGC GAATTTTTTG GGTCAGCCAA CAAGCGCGGG TCGAAATTCA GCGCATCATC CTGACGAATG GCAATGCTCA GCATAGCGGG GCGATTTTTG CCGAGCCAAA TTGGAGCGGC GAGTTTACCA ATTTGGCCCT CAACCAAGTA ACAATTAAGC ATAGCCAAGC GACAACTTTT GGTGGTGGGA TTGGGGCGCA ACATACCAAC CTGAGCCTGA TTGATAGCCT GATTGAAGCC AATCGATCGA GTGGCAGCGG CGGTGGCGTA AGTTTTAATA CTGGCAATCT AACAATTCGT AATAGCAAAT TTAGCACTAA CAAAGCCGAG ACCGAAGGCG CTGGGCTTGA GGCATGGACG GCAAATTTAG ATATTAGCCA AACGAATTTT GAGCTAAACG AATTACAAGG CCGTGAACAT ACCGATTTTG GTGGTGGCCT CGTGATTCAG CAAAGCTACG GGGTGTTTCA AGGCGGACGT ATCTGGAGTA ATATTGCTGG TCAAGGCGGT GGCATTTATC TGCGCGGAGG CAGCACAATT GAATTTAACG CCAGCAAAAT TGCCGATAAT GTGGCTTTTA ACGAAGGCGC TGGGGGCTAT ATTACCGCTA ATTCAAGCTT GACCTTCAAA AATGGCATTA TTGATCAGAA TTTGTCGGCA GTAGCTGGTG GCGGCATTGC CAACCAGGGT GGCCTGTTGA TCGAACGCTC GACCCTAACT AATAATGAAG CTTTACAGAG CGATGGCGGA GCACTTGATA ATACAGGCGT GGCCGTCTTG AGGTATAGCA CTCTCGCCAA AAACAAGGCT CAGCGTGGGG CTGGGCTGAA TAATCGCCCG AATAGCACAC TGGTAATCGA TCGTGTGACC ATGACCGCCA ATAATGCAGA GATCGCTGGC GGTGGAATCT ATCATGCTGG CACTCTATTT ACGGTCGATA ACAGCATTCT TACCTATAAT AATGCCCCGG CAGGAGCGCA ATGTGGCTAT GCCAGCCAAG TGCCGAGCAT GAGCTTTAGT ATGTGGAGTG ATGGAAGTTG TGGCACGCAA ACCATCGATG GTAATAAACC ATTTACTGGG CCAAGCTTGC GACCATTGGG CTGGTATGGT GGCCCAACCC CAACCTACTT GCCACTCAGC CATAGCGCAT CGACCGATGC TGGCTCATGC TCAAGCTCTG CTGTGACCGA TCAACGTGGT TTGGCAGGCT TTGTGGGTGC GGCCTGCGAT ATGGGCGCGG TCGAAAGTGG CTCGTTATGG TATCAAGTGG CGTTGCCAAT GACGATTAAG TAA
|
Protein sequence | MLRRKKILLS GLIMLGLGLV WDSRPVAADS VVVGTGTPAS CNEAAFDAGL AQLFPGEQAP GGTLTFNCGP NPHTIVLTSQ KFLHDGSVID GGGKITLSGG NTTRIFWVSQ QARVEIQRII LTNGNAQHSG AIFAEPNWSG EFTNLALNQV TIKHSQATTF GGGIGAQHTN LSLIDSLIEA NRSSGSGGGV SFNTGNLTIR NSKFSTNKAE TEGAGLEAWT ANLDISQTNF ELNELQGREH TDFGGGLVIQ QSYGVFQGGR IWSNIAGQGG GIYLRGGSTI EFNASKIADN VAFNEGAGGY ITANSSLTFK NGIIDQNLSA VAGGGIANQG GLLIERSTLT NNEALQSDGG ALDNTGVAVL RYSTLAKNKA QRGAGLNNRP NSTLVIDRVT MTANNAEIAG GGIYHAGTLF TVDNSILTYN NAPAGAQCGY ASQVPSMSFS MWSDGSCGTQ TIDGNKPFTG PSLRPLGWYG GPTPTYLPLS HSASTDAGSC SSSAVTDQRG LAGFVGAACD MGAVESGSLW YQVALPMTIK
|
| |