Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2145 |
Symbol | |
ID | 5734047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2702012 |
End bp | 2703577 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279286 |
Product | hypothetical protein |
Protein accession | YP_001544913 |
Protein GI | 159898666 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTAA TTGTTGATAT TTTAATTGAT GATCTGCGGG CCTTGATTCG CGACCTTGGC CAAAATGGTG GCCTGATGAG TCCATCAGTC TATGACACAT CCCAAGCGTT GCGGCTCTAT CCAACGCCCA GCGAAGAGCA TGTTTGGCCA GCAGTCAACT GGCTGATTAG CCAACAACAG TCGGATGGTG GCTGGGGTAA TCCATCGATG CCGCTCAGTC GAGCAGTGCC AACCCTTGCG GCAATTTTAG CCCTACGCCG CCACTGTCAG CGTCGTTCAA CCTTCGATGG ATTGCTTGAG GCCAAACGTT TTCTGCGCCG CCAACTTGAA TATTGGGAGA AACCGCTGCC CGATAACCTG CCAGTTGGAA TGGAACTCCT GCTCCCTTAC ATGCTTGAAG AGGCCTATCG CGAAGAGCAT CAAGATGATA TCGACGATGT GCCAATTAAG CTCCGCCTTA ATATCCCCCT TGCACCCTAT CGCGAGTTGA TCGCACTCGG CGAACATAAA CGCTCATTGA TTCAACAAAA AAAGCCCCGT GCAGGCACAG CCCCAGTTTA TTCATGGGAA GCATGGGCTA GTCATGCTGA TCCAGAATTG ATCGATGGCT CAGGTGGCAT TGGTCATAGC CCCGCTGCCA CCGCTGCATG GTTATTTGCT GCCAATCATA ATCCAAATCT ACGCAACGAA ATCGCTGGCG CAGAAAACTA CCTGCGCCAA GCGTCGCTGG CCACCTCGGA AAGTGCTCCA TGCATTATGC CAACCGCATG GCCAATCCCA CGCTTCGAAC AATCGTTCAG CCTATATGCT TTGGTCACTG GCGGAATTCT CGATTTCCCC AGTATTCAGG ATGTGCTCAA ACCACAAATT GCCGATTTAC ATCAAGCACT CAAGCCGCGC GGGATTGGCT TTAGCGACGA TTTTATGCCC GATGGCGATG ATACCGCCGC CGCCGTGGCA GTATTAATCG CAGCAGGCTA TCCAGTCGAT CTCGCGATAT TAAATCAATT TGAGCGTGAA CCCTACTTCG TAGCCTATCA TGGTGAGTTA CAGCCTTCAA TTTCGCTGAC AGCTCGCGCC GTGCACGCAC TCGATTTAGC CGGAGTTGAT ATTTCACGCT GGTGGAAGAT TTTTATTGAT GCTCAAAAAC TTGATGGCAG TTGGAGCGGC GATAAATGGA ATACTTCGTG GCTCTACACG ACCTGCCATG TACTGATTGC GCTCAAAAAC TCGCCCTACA AAACCGCCAT GAAAGAAGCC GTCGCTGCAT TACAAGTCCA TCAACATCCT GATGGTGGCT GGGGCATCAT CAATCGATCA ACCACGGTTG AAACGGCCTA TGCGGTGCTG GCATTGCAAA ACTTACGTGA AGCTGGCCTC TTAGATGACG ACGACATCCA CATGCTCCAA CGTGGTTATA ATTGGCTCTG TATTCATTAT CGTCCATTTC GGATGAAAGA GTATCAATGT TGGCTCAATA AAGAAATTTA TTGTCCCCAA CGGATTGATC GCGCTTATGA GTTAAGTGCC ATGTTAGCAG TCACTCTAGG AGAATTAAAA TTATGA
|
Protein sequence | MSLIVDILID DLRALIRDLG QNGGLMSPSV YDTSQALRLY PTPSEEHVWP AVNWLISQQQ SDGGWGNPSM PLSRAVPTLA AILALRRHCQ RRSTFDGLLE AKRFLRRQLE YWEKPLPDNL PVGMELLLPY MLEEAYREEH QDDIDDVPIK LRLNIPLAPY RELIALGEHK RSLIQQKKPR AGTAPVYSWE AWASHADPEL IDGSGGIGHS PAATAAWLFA ANHNPNLRNE IAGAENYLRQ ASLATSESAP CIMPTAWPIP RFEQSFSLYA LVTGGILDFP SIQDVLKPQI ADLHQALKPR GIGFSDDFMP DGDDTAAAVA VLIAAGYPVD LAILNQFERE PYFVAYHGEL QPSISLTARA VHALDLAGVD ISRWWKIFID AQKLDGSWSG DKWNTSWLYT TCHVLIALKN SPYKTAMKEA VAALQVHQHP DGGWGIINRS TTVETAYAVL ALQNLREAGL LDDDDIHMLQ RGYNWLCIHY RPFRMKEYQC WLNKEIYCPQ RIDRAYELSA MLAVTLGELK L
|
| |