Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2616 |
Symbol | |
ID | 5734494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3354901 |
End bp | 3359121 |
Gene Length | 4221 bp |
Protein Length | 1406 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279756 |
Product | hypothetical protein |
Protein accession | YP_001545382 |
Protein GI | 159899135 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0177484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACATC GTTACACTCG TTTAGCAACG GTCTTAGCGT TGTTAACCAT GTTAGCGGGC AATTTGAGCG CTAACAATGT CACACGGGCG GCAGAACCCC TCAAGCCCGT TGTGGAAACG GCTACTGATT CGGTTCAAGT GAACCAAGCC AGTGCTAATC GTTCAAGCAA GCCCACAATT GCCCCATTGT ATGGTGATTT CCCCAAGTCT GGCAAAGTTA AAGTTATTGT TCAATTCCAC GAAGCTCCGT TGGCAACCTA TGCTGGCGAA GTTAAGGGTT TGAAGGCTAC CGCCAATGCG CAAACTGGCC GCGCCAAGCT TGATGTCAAG GCGGCTGAAT CACAAGCCTA CTTGAAACAT CTTGAAGCTC AGCACGCCAG CTTCCTCAAG GATGTGCAGG CCAAATCAAA CCAAATCAAA CAAATTGCTG ATTATCAAGT TGCCTTGAAC GGGATGTTCT TGGATGTTGA TGTTTCAGCG ATTACCGTCT TGCGCAACCA CCCTGATGTT GCATCAGTTG AAGTTGCCAA AGTTGAAAAA CTCGATACAG ATAGCAGCAA CCAATTCATT GGTACGCCAA GCTTTTGGCA AAACCTTGGT GGTGGCAGCA CCGGCACGAA CGCTGGTGAA GGCATTGTGA TTGGGGTGAT CGATAGCGGG ATCTACAACC CTTCGAGCAC CTTGACCACG ACTGGTATCC ACCCATCATT GCGCGATCCA TCACCAGTTG GTGGCGATTA CAGCGCGTGG CCAGGTGGCT ACAAAGGCGT TTGTGCTCCA TCCAACCCAC AAGCTCAAGA TGGTTCGTTT GGCGCTTGTA ACGACAAACT AATCGGGGCT TGGTGGTATA ACGGTGGCGG CATCGCATTC CCTGGTGAAG TTGATTCACC AATGGATGAA GATGGCCACG GCAGCCACAC CGCTACCACT GCTGGTGGTA ATGGTGGCAC TGCTTCACCA TTCGGCACCG TTTCAGGGGT TGCTCCACGC GCCCGAATCA TTGCCTACAA AGTTTGTTGG GAAGAAGATC CAGCAATCAG CGATGACGGC GGTTGTAACG GTGGCGACTC GGTTGATGCA ATCGACCAAG CCTTGATCGA CGGCGTTGAT GTGTTGAACT TCTCGATCAG TGGTGGTAAC TCACCATGGA CCGATGCGGT TGAAATTGCA TTCCGTAACG CTAACGCTGC TGGTGTGATT GTTAGCGCCT CGGCTGGTAA CGCTGGTACG GTTGGGAGCG TTGCTCACAA CTCACCATGG TTGATCACCG TTGCTGCAAG CACCCAAGCC CGCGAATTCA AAGGCTATGT AAGCAACTTG AGCGGTGGTA CGGGCACAGC TCCGTCACCA TTGGTTGGTG CTTCACTTTA CCCAGGCTCA GTCACTGGCC AAATCAAGTT GGCTCCAGTG CAAGCAGGCG AAAGTGTTGC CCCAAGCTTG TGTCAACAAC CCTATGCACC AGGAACCTTC AACGCAACCG ATATCGTGAT TTGTCGCCGT GGTGTCAACG CCCGGGTGCT CAAATACGCT AACGTGTTTG CTGGTGGTGC TGGCGGTGGG ATCATCGTCA ACGCTGCTGA TAACCAAGGT ATTGTTGCCG ACTACTGCGC CAAAGTTTGT ATTCACCTCG AAAAGAATGC AACCTTGGAA AGCGTTGCTG GCGAAGCCTT GGTAACCTAT GTCCAATCAG GCACTGTTAA TGGTAAAGCT GATGGCGGTG TCAAGGTTAT GGGCGCTGGC GATAAGATGG CCGGCTTTAG CTCAATGGGT CCTTCGCCCA TCAAGAACGT GCTCAAACCA GACGTAACTT TGATCGGTGT TGATGTCAAC GCTGGTCATA CTGCATTCGT TTGGGATGAT GGCTTCGCTG ATGGCGAATT GTTCCAAGTT ATCGGTGGTA CCTCAATGTC AAGCCCGCAC AACGCTGGCG TGACCGCTTT GATGCGCCAA AAATACCCAA CTTGGACTCC ATTCGAAATC AAATCGGCTT TGATGACGAC TGCTAAGACC AGCGTTGTGA AGCCCGATGG TGTAACCGCT GCCGATCCAT TCAACATGGG TGCTGGCCGC GTCGATTTGA CCAAAGTATT CAGCGCTGGC TTGGTGCTTG ATGAAACCAT TCCAAACTTC ATCAATGCTA ACCCATCAGC TGGTGGCAAC CCTGCAACCT TGAACATTGC AAGTGCTGCT AACGAATCGT GCCCAGTCGA ATGTACGTGG ACACGGACAG TCAAGAACAC CTTGAACGTT CCTGCAACTT GGAATGTTAG TGGCTCAACC TCGCCTGTAA CGGTAACTGC AACTCCAGCC AGCTTCACCA TTCCTGCTGG TGGAACCCAA GTTATCACGA TCAAGGCCAA TGTTGCTGGC CAACCAATCA ACGGCGTTTG GAAGTTTGGC GAATTCCGCT TGACCGAAGC CGCAAACCGT GCTCCAGCCG TCCACTTCCC AATTGCCGTC AAGGCGAGCG CTGGTGGCGT GCCCGAGAGT GTTTCTGCAA CCACCCGCCG CGATACGGGT ATGGTGACTG CTGGTGGCTT TAGCGCCCTC AACGTAACCA ACCTGACCAT CCGCGAATTT GGCTTGACCA AGGGTGTTCG CGCAACCGAA GTTATCACTG GCGATACCAC CAATGGTAAC CCATACGATA GCTTGACCAA CGGTGTCTTC TATCGCGCAT TGCAAGTGCC TACTGGCGCA AGCAAGTTTG TCGCTAAGAT CACCGATACC GCATCAGAAG ACATCGACTT GTTTGTTGTC CGCGATGCTA ACGGCGACGG GATTCCTCAA GCAGGTGAAC AAGTTGCTGA ATCAGCAACC TCATCGGCTT ACGAAACCGT CACGATCAAC AACCCAACGG CTGGTAACTA CATTGTGGTT GTCCAAAACT GGCAGGCTTC ACCTGCTGCT CCCGACTCAA CCACGGTTGA TATCTACTAC GTACCTGGCA GCAACCTTGG CAACTGGGAT GTTGTTGGTC CAGCCAATGT TTCAGGCAGC GAACCATTCA GTGTTGATAT TCACTGGGAT GAGCCAGCTC TCGAAGCTGG TGATGTCTGG TTCGGCGAAT TCGACCTTGG CACCAGCCCA GCTAGCCCAG GTAGCATTGG TCGTACCGCC TTCACCCTCG AAGTGTTGGA AGCCGAAGTC AGTGTTGATG TTAGCGATAC CATGGTTGAT GTTGGCGAAT ATGTCACCTA CACCGTGAAT CTCTCTAACT ACAGCGCAAT GAACGATACC TTCTATGTAA CCAGCACCTT GCCAGTTGGC TTGGAAGTTG ATGTGAGCAG CTTGCCTGCT GGCGCGAGCT ACAACGCATC AACCCGCACG GTTACTTGGA GTGGTTTGGT TAATGGTGCT GAAACTGGTT ATACCTTCAG CGATAGCCGC GATGGTGGCG TAGCATTCGA TAACTTCGAT GTTTCAGCTG ATGCCAACGC TATTGCGATT TGTGCTGCTG CCAGTACTAC CTGTGACGAA ACCGCACCAA ACTTCACCTT AGGCACGTAC CTTGTTCGCG CCTACGATGT GAATTATGGC GCTGTTCGCC CATGGAGCAA CGGTTTGATT CAATTCGACC CAACGCTTCC ATCAGGCAAT ACCAACTATG TTGCCCAAGA TATGCCAAAC TCAGCAACTC CTAACGGAGT GTACGCTGGT CTTTGGACGG ACTTGGACTT GGATGGTTCT TCAGCTACCG ATACTGGTGG TGGTGAGTAT TACATCTTGA TTGTTGATGG GGTCAATCCA AACGATCCAA CTGTACCGTA TGTTGCAGTT GAATACGAAG ATGCCCAACA ATATGACGAT CCAACTTCAA ACCTCAACTT TACCAACTAT GCCCGCGCTG ATGGCATTGA GTCAGAGTTC TGTACCGTTT ACGGTTCGCT CACTGGCAAT TTGACCACCT TCGCCGATGG TGCAGCCGTT GGGATCGAAA ACTTGACTGG TACGGTTGGC AACACCTACT ACTACAGTGG TGATAGCTCG ACCGCCGCCA ACTTGCCAAC CGCAGGTGCA ACCATCTGTA CCTCATTGGG TGCTGGCGCA CCAGGCTTGC GCTCGTTCAG CTACCGCGCT CGCGCAACCC AAGCTGGTGT CTTGACCAAT AACCTTGAAT ATAGCTCAGC TGCTGATAGC AATACCAGCA CTGAAGCTGT GGCTATCGAA GCAGTTCAAG CAGTGTTCAA GATCTTCATG CCATTGGTTA GCAAGTCGTA A
|
Protein sequence | MKHRYTRLAT VLALLTMLAG NLSANNVTRA AEPLKPVVET ATDSVQVNQA SANRSSKPTI APLYGDFPKS GKVKVIVQFH EAPLATYAGE VKGLKATANA QTGRAKLDVK AAESQAYLKH LEAQHASFLK DVQAKSNQIK QIADYQVALN GMFLDVDVSA ITVLRNHPDV ASVEVAKVEK LDTDSSNQFI GTPSFWQNLG GGSTGTNAGE GIVIGVIDSG IYNPSSTLTT TGIHPSLRDP SPVGGDYSAW PGGYKGVCAP SNPQAQDGSF GACNDKLIGA WWYNGGGIAF PGEVDSPMDE DGHGSHTATT AGGNGGTASP FGTVSGVAPR ARIIAYKVCW EEDPAISDDG GCNGGDSVDA IDQALIDGVD VLNFSISGGN SPWTDAVEIA FRNANAAGVI VSASAGNAGT VGSVAHNSPW LITVAASTQA REFKGYVSNL SGGTGTAPSP LVGASLYPGS VTGQIKLAPV QAGESVAPSL CQQPYAPGTF NATDIVICRR GVNARVLKYA NVFAGGAGGG IIVNAADNQG IVADYCAKVC IHLEKNATLE SVAGEALVTY VQSGTVNGKA DGGVKVMGAG DKMAGFSSMG PSPIKNVLKP DVTLIGVDVN AGHTAFVWDD GFADGELFQV IGGTSMSSPH NAGVTALMRQ KYPTWTPFEI KSALMTTAKT SVVKPDGVTA ADPFNMGAGR VDLTKVFSAG LVLDETIPNF INANPSAGGN PATLNIASAA NESCPVECTW TRTVKNTLNV PATWNVSGST SPVTVTATPA SFTIPAGGTQ VITIKANVAG QPINGVWKFG EFRLTEAANR APAVHFPIAV KASAGGVPES VSATTRRDTG MVTAGGFSAL NVTNLTIREF GLTKGVRATE VITGDTTNGN PYDSLTNGVF YRALQVPTGA SKFVAKITDT ASEDIDLFVV RDANGDGIPQ AGEQVAESAT SSAYETVTIN NPTAGNYIVV VQNWQASPAA PDSTTVDIYY VPGSNLGNWD VVGPANVSGS EPFSVDIHWD EPALEAGDVW FGEFDLGTSP ASPGSIGRTA FTLEVLEAEV SVDVSDTMVD VGEYVTYTVN LSNYSAMNDT FYVTSTLPVG LEVDVSSLPA GASYNASTRT VTWSGLVNGA ETGYTFSDSR DGGVAFDNFD VSADANAIAI CAAASTTCDE TAPNFTLGTY LVRAYDVNYG AVRPWSNGLI QFDPTLPSGN TNYVAQDMPN SATPNGVYAG LWTDLDLDGS SATDTGGGEY YILIVDGVNP NDPTVPYVAV EYEDAQQYDD PTSNLNFTNY ARADGIESEF CTVYGSLTGN LTTFADGAAV GIENLTGTVG NTYYYSGDSS TAANLPTAGA TICTSLGAGA PGLRSFSYRA RATQAGVLTN NLEYSSAADS NTSTEAVAIE AVQAVFKIFM PLVSKS
|
| |