Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3241 |
Symbol | |
ID | 5735109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4101921 |
End bp | 4104290 |
Gene Length | 2370 bp |
Protein Length | 789 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280387 |
Product | hypothetical protein |
Protein accession | YP_001546006 |
Protein GI | 159899759 |
COG category | [S] Function unknown |
COG ID | [COG1432] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00288] conserved hypothetical protein TIGR00288 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000142022 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGACC GTACTCGATC ACTCGCCAGA ATTAAATGGC ATGCCACAAC CAAAATGACT GCCCAGGAGC ACATTGGAAT CCCTATGAAC AAACCCAAGC AGGATGTTGC GGTTTTTATC GACTTCGAGA ACATTTATGT TTCAGTTCGA GAGAAGTTCG ACGCTACCCC TAATTTCGAA GCTTTGATGG AACGTTGCGA GGACTATGGC CGTGTTGTAG TTGCTCGCGC TTACGCCGAT TGGTATCGTT ACCCACGCAT TACCAGCGCT TTGTTTGCCA ATAATATCGA GCCAATGTAT GTGCCAACCT ACTATTATGA TAAGGACGAA GGTCGCATGG GCCGTCCGAT CAAAAATAGT GTGGATATGC ACATGTGTAT CGATGCAATG CGCACGCTCT ACACACGCAC CAACATTGGT TCGTATATCT TTATCACTGG CGACCGCGAT TTTATCGCCT TGGTCAACTG TGTGCGCCAA GAAGGTAAAG ATGTGATTAT CGTTGGCATT GGCGGGGCTG CCTCCAGCCA TCTCGCTCAA AGTGCCGATG AATTTTTGTT CTACGAGCAA ATTGTCGATA TTCGCCCGAT GGGTGGCCGT CGCAACGAGC GCCACGAAAA AACCTTCGAG CGGGTCGAGC GCCCTGAACG GGTTGAACGT AACGATCGTA GCGAACGCCC AGAACGCCCT GAACGCAGCG AACGCCCAGA ACGGTCAGAA CGTAGCGAAC GCAATGGCCG CGATGATCGC CGCCGCGAAC GCGAACGCCC TGAGAAGAAT GCCGAGCCAA CTTTCGAGCG GCTTGATTCA CGTTCGGAGA AAACAGCCGA ACCACGGGTT GAAAAGCCCG CTGAAAAAGC TCCCGAAATT GTGGTACGGC CAGCACCAAG CACCGCGCCA GTCGTGGCAA CCACCCCCAG CAATCCCGAT GCGGCAATTT ATGATAAGTT GGTTGAGGCC TTGCACTTAG CGCGGAAACG TGGCTATGTG ACTTCATTTG GCTCGCTCAA GGTGCTGATG AAAGAACTGC TGCCCAACTT CAAAGAAAGT CGCTATCGCG ATGCTCAAGG CAAGCCGTTC ACCAAATTTA CCGATTTTGT GCGCGAAGCC GAACGACTTG GCAAGATTCA AATCTTCACC AGCGGCACAG TTAACGAAGT CTTCTTGCCC GATGAAGATC CTTACAAGCT TTCGCAATTT GCTGAAGATT TGCCCTCAGT TGAGGCCGAA CGCGAACTCG AACAAGAGCT TGAGTTGCAA GCCGAATCGA TCGCCCCAAT CACGATTGAA ACAATTGGCA TCGCCGATGA GCCGATGGAA TTGGCCGATG GCGAAGAAGT TGCCCCAAGC AGCGAACGTG GCGACCGCAA TGGCCGTCGT CGCCGCCGCC GTGGCAGCAG CCGCAGCGAA CAACCAGCCG TCGATGTGCT GCCCGAAGAG TTGAGCGAAA CAATCCTTGT ACCAGCACCA ACCGAAGGTT TCAGCTTTGG CGAACGTGAA TGGCAGTTGT TCTATCAGGT GATGGGCGTG TATAGCGAGC CAGTGCCATT CGCCGATATC TTTAACGATC TTCGCGAAAT GCGCAACACG GCTGAGCTTG ATCTGACCAA TAATGCCTTG AAGGAATTGA TCAAACAAGC GATCAACGAA GGCAAAGTCC AGCGCTCAAA TCGTGGAGCC AAAGCCCACT ATCGCTTGGT GCTCAACCCT GAGCAGTTTG ATGGTGCAAT AACAACCTTC GAGCCAAGCG ATCTTGAAGC CTTTGATGAT GGCCCAAGCT TGCCTGACCT GGTTGATGAA GAACCACGTT TGTTGCCAGC CATGATCGGT GGCCAAACCA TCGCGATTGA CGATGATTAC ACTGCGCCAA TCGATGAAGT GGCCTTGCCC GAAAGCTTTG AAGAACAGCC GTTGCCTGAA AGTGATGCCG ATTATTCGTT TGCCGAGCCA ATCGAGTATG AAGCACCTGT GGCAACCTCA ACCGCAGCGA TCAAGCCGCG TAAGCCCAGC CGCCGCCGCA AATCGATTGT GCCGCTGAGC GTGGCCGAAG CGATTGCTGC CAGCGTCGCC AATGATCCAG CGCCTGAGCC AGCGGTTGAA GTGGAAGCTG TAGCTGAGCC TGTCGCCGAA ATCGTCAGCG AACCAATCGC AGAACCTGTG GTTGAAGCAG TGACTGAGGA AAAGCCCAAA AAGCGGGGTG GCCGCCGGAA AGCTGCCGAG CCAGTTGAAG TCGTCGAAGA AAAACCTAAG AAAACCACTC GCCGCAAAAA GGCCGAGCCA GTCGCTGAAG CTCCAGCTCC AGTCGAGCCA GCCAAAAAAC AACCACGTCG CCGTAAAAAG GCCGCCAGCG AAACTGGAGA AGAAGCATGA
|
Protein sequence | MGDRTRSLAR IKWHATTKMT AQEHIGIPMN KPKQDVAVFI DFENIYVSVR EKFDATPNFE ALMERCEDYG RVVVARAYAD WYRYPRITSA LFANNIEPMY VPTYYYDKDE GRMGRPIKNS VDMHMCIDAM RTLYTRTNIG SYIFITGDRD FIALVNCVRQ EGKDVIIVGI GGAASSHLAQ SADEFLFYEQ IVDIRPMGGR RNERHEKTFE RVERPERVER NDRSERPERP ERSERPERSE RSERNGRDDR RRERERPEKN AEPTFERLDS RSEKTAEPRV EKPAEKAPEI VVRPAPSTAP VVATTPSNPD AAIYDKLVEA LHLARKRGYV TSFGSLKVLM KELLPNFKES RYRDAQGKPF TKFTDFVREA ERLGKIQIFT SGTVNEVFLP DEDPYKLSQF AEDLPSVEAE RELEQELELQ AESIAPITIE TIGIADEPME LADGEEVAPS SERGDRNGRR RRRRGSSRSE QPAVDVLPEE LSETILVPAP TEGFSFGERE WQLFYQVMGV YSEPVPFADI FNDLREMRNT AELDLTNNAL KELIKQAINE GKVQRSNRGA KAHYRLVLNP EQFDGAITTF EPSDLEAFDD GPSLPDLVDE EPRLLPAMIG GQTIAIDDDY TAPIDEVALP ESFEEQPLPE SDADYSFAEP IEYEAPVATS TAAIKPRKPS RRRKSIVPLS VAEAIAASVA NDPAPEPAVE VEAVAEPVAE IVSEPIAEPV VEAVTEEKPK KRGGRRKAAE PVEVVEEKPK KTTRRKKAEP VAEAPAPVEP AKKQPRRRKK AASETGEEA
|
| |