Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4056 |
Symbol | |
ID | 5735914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5176037 |
End bp | 5179441 |
Gene Length | 3405 bp |
Protein Length | 1134 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281207 |
Product | hypothetical protein |
Protein accession | YP_001546816 |
Protein GI | 159900569 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.55653 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACGGT TGGTGCTTCA ACTCGCTTTG CTCGCCGCAC TTATCGTTCG TTTTAATCTC CCCGCCGCGC AAGCGCAACC TCAAGCTTTT GTCGCTCCTG AAGAATTACA GGCTGTCGCC GATGATCAAT TTTGGAATAA TACAGGCTTG ATTGCTGGGG CAAATAACAC GATTCGCGCC ATCAGTTCGC AATCAGGCGA TTTATTTGTT GGTGGTTTGT TCGATCGCAT CGCTGGCATC AGTGCCAATC GGGTGGCTTT TTGGGATGGC GATCATTGGA ATACGATGGG CAGTGGGGTT AACGGCCCAG TCGATGATCT CGATGCTTCA ACTGGCGGTT CAGTTTATGT GGTTGGCTCC TTCAGCAGTG CTGGTGGAAT TGCCGCCGAT GGCATTGCTC GTTGGAATTC TGGCACAGGC CAATGGTCGG CCTTGGCGAC CAACGTCAAT GGAGCTGTTA CCGCCGTTTT GGTGCAACAG GTCGCTGGCA GTGATGTGGT GTATGTTGGT GGAACGTTTA GCTCAATTGA TGGGGTGAGT GCCAACCGTA TCGCTAAATT CAGCAATGGC TCATGGTCGG CATTAAGCAG TGGGATCGGC GGTGGCACTG CTCCCCAAGT GCTCGATTTA GCGATCAATC CGGCCAATGT CAACCAACTT GTGGCTGGGG GCACGTTTAG CTCGGCTGGT GGTAGCACTG CCAACAATGT AGCGATCTGG ACAGGCTCAG CTTGGCAGAG CCTTGGTACG GGTAGCAGCA ATGGGGTCAA CGGTGCTGTG CGTTTTGTTG ATTTCCGTGG CACAAATATG GTGGTCGTTG GTGGGAGTTT TAGCAATGCT GGCACTGTTA CCAATGTTGG TGGTGCAGCG GTTTGGACTG GTGGCAATAC CTGGGCAGCC ATGGCCGGAC GTGGGGTGAC TGGCGATGTG CGTGGGATTG TCGAGAACGC CAATTTTACC TATGTGATGG GCAATTTTGG CAGCGGTATC AACCCCAATG GCAATAGTGT TTTCTCGCCC AATATCGCCC GTTGGGATGG TAATATCTGG TCGCCTGTAC CGAATGCCAC CAATGCCTTT GGCACAAATG GGGCAATTTT ACGGGCTGAA CGCTTGGGCA GCGGCAGCGA TACCTTCTTT ATTGCTGGGG CGTTTGGCAC AGCCCATGGC ATGGAATTGA ATTTTGTGGG CATGGTTGTG CCACAACAGG GCTTCTTGCC TGGCGCAACT GATCGCTTTT TCCCCTTGGC TGGTGGCCTC GAAGGCTCCA ACGCCAAAGT TTTTGCCATT CAGCCACGTT CTGGCGAAAT TATTGCTGCT GGGCGCTTCG ATCTGGGCAG TAATCGCTTG CTCAATAATA TCGCTCGCTT TGATCCTGTT GATCGAGTCT GGTCGCCGTT GACTGGCTCC AGTGATAGTG GAGTCAACGA TGATGTGCGT GATCTTGCGT TGCGCAACAC CGATTTGATT GTTGTGGGTG AGTTTAGCAA GGCTGGTGGC ATTGATGCCG CAGGCGTAGC TAGTTGGAAT GGCACAACCT GGACTGCCCT CGCAACCAGC ATCAATGGTC GGGTCAACGC GGTTGCGATC AGTGGCAGCG ATATTTATAT TGGCGGCGAA TTTACGCTGA TTGATGGTGT GCCTGCCAAT CGGATTGCCC GTTTGAGTGG TGGTTCATGG CAAGCTGTCG GCGCTGGCAC TGATGGCCCT GTCAATAGTT TGTTGTTCAA ATCGAACCAA CTCTATGCTG GCGGCTTGTT TGCCAACGCT GGTGGTGCGC CTGCCAGCAA TCTTGCCCGC TGGAACGGCA CAACATGGCA AGCAATCGGG GTGGGCACTG ATGCCGAAGT GTTAGCTTTG GCCGATGTTA ATAGCACGAC TGTGGCAGTT GGTGGGCGGT TTACCAGTGC TGGTGGGGTT GCCAACACCC GCGCAGTTGC TCTGCTTAAC CACAGTAGTT TGGCATGGAC GGCGCTTGGC ACTGGCACTG ATGGCTATGT AACCAGCCTC GTCGTGCGCG GTGACGATTT GTATGCTGGT GGTTTGTTCA GCCGCATGGA TGGCTTGACG GTCAATCATG TGGCGCGGTG GAATGGCACA ACTTGGAATG CCTTGGGTAG TGGGGTTGCG GGTGGCAACC TGCAAAATAG CGAAGTTGGA GCCTTAGCGG TCAATGGCGA TAATCTGTAT GTTGGTGGAC GTTTTGATCG GGCTGGCGAT AAAGTTTCGC ATCGCTTTGC TGAATGGCGA CAACCCGAAG TTGATCTCAG CCTCAAACTG AGCGAATCGC CTGATCCGGT GACGATTGGC AATCCTATGA GCTATAAAGC CACCGTCAGC AATTTGGGCA CGATCAGCGC CAGCAGCGTC GTGTACGAAC AAACCTTTGC CAATACATTA GTTTTTGGTC AAGTTACAAC CTCGCAAGGC TCATGTAGTT TCCCCAATTC TACGACCTTA CGTTGTAATT TAGGTACGCT AGCGGCCAAT GCTAGTGCCA ATATCACAAT TAATGCTACG CCCAGCCAAG TTGGCACAAT TAGTAGCACG GGCACGGCTT CATCGCCTGC GAATGAGGCT TTTCCAAGCA ACAATAGTCG TAGCGTTGCA ACTCAAGTGA TTGTGCCTGG TAATCCAGTG CCAAGCATCA GCAACATCAC GCCTGATCGC TTTATTCGCC AGCCGATTGG CTTTCCACCG CCGCCAGCAG TGCGGATTAC AGTCAATGGC ACGGGCTTTG TGGCCAATTC CAAGGTTGTG GTAGCTGGGG TTGAGCGCAC GACAACCTTT ATCAATAGCA ATCGGCTGGA ATTTAGTATG GCCGCAACCA CCAGCCAAGG CACATACTCG GTTTTGGTGC GCAACCCCAC GCCAGGCGGC GGCGATTCCA ACAGCGTAAC CTTGGGGGTT TCGGTTGGCA TCGTTGGCTT GAGCAGCATC ACGCCCAACC TTGGTGGCAC TGATGTCGAT CTACAAACAA CCTTCAATGT AAGCTGGACG CATACCACCG ATCCATGGCG AATTATCGAG CACCTCGATT TGCGCTTGGT CGATAGCGAT GGTGTGGCTT TATGGGCACG CTTTACCGAA GGCGTTTCGG GCACATTTAG CCTGCTCGAT GCCAATGGCG ATGTGCTTGG TTATGCCACG GCTGGCAGCA GCGATCCATT GGAGAGCGAT AGCGCCATCC TCGATTTGGC GGATAGCAAC TTTGCTGGCA GCGGGCCGAC AGGCTTTAGC ATGCAAGTCA ATTTCAGCAT TCGCTTCAAA CCAAGTGCCG CTGGTCGTCG CTATAACATC GAGTTGTATG CCACCGATGA TCATGGCGGG GTACAAGGGC CAGATGTGAT GGGCACATTC ACCGTTGGCA TTCACGATGT ATATCTGCCA ATGACCATTA AATAG
|
Protein sequence | MPRLVLQLAL LAALIVRFNL PAAQAQPQAF VAPEELQAVA DDQFWNNTGL IAGANNTIRA ISSQSGDLFV GGLFDRIAGI SANRVAFWDG DHWNTMGSGV NGPVDDLDAS TGGSVYVVGS FSSAGGIAAD GIARWNSGTG QWSALATNVN GAVTAVLVQQ VAGSDVVYVG GTFSSIDGVS ANRIAKFSNG SWSALSSGIG GGTAPQVLDL AINPANVNQL VAGGTFSSAG GSTANNVAIW TGSAWQSLGT GSSNGVNGAV RFVDFRGTNM VVVGGSFSNA GTVTNVGGAA VWTGGNTWAA MAGRGVTGDV RGIVENANFT YVMGNFGSGI NPNGNSVFSP NIARWDGNIW SPVPNATNAF GTNGAILRAE RLGSGSDTFF IAGAFGTAHG MELNFVGMVV PQQGFLPGAT DRFFPLAGGL EGSNAKVFAI QPRSGEIIAA GRFDLGSNRL LNNIARFDPV DRVWSPLTGS SDSGVNDDVR DLALRNTDLI VVGEFSKAGG IDAAGVASWN GTTWTALATS INGRVNAVAI SGSDIYIGGE FTLIDGVPAN RIARLSGGSW QAVGAGTDGP VNSLLFKSNQ LYAGGLFANA GGAPASNLAR WNGTTWQAIG VGTDAEVLAL ADVNSTTVAV GGRFTSAGGV ANTRAVALLN HSSLAWTALG TGTDGYVTSL VVRGDDLYAG GLFSRMDGLT VNHVARWNGT TWNALGSGVA GGNLQNSEVG ALAVNGDNLY VGGRFDRAGD KVSHRFAEWR QPEVDLSLKL SESPDPVTIG NPMSYKATVS NLGTISASSV VYEQTFANTL VFGQVTTSQG SCSFPNSTTL RCNLGTLAAN ASANITINAT PSQVGTISST GTASSPANEA FPSNNSRSVA TQVIVPGNPV PSISNITPDR FIRQPIGFPP PPAVRITVNG TGFVANSKVV VAGVERTTTF INSNRLEFSM AATTSQGTYS VLVRNPTPGG GDSNSVTLGV SVGIVGLSSI TPNLGGTDVD LQTTFNVSWT HTTDPWRIIE HLDLRLVDSD GVALWARFTE GVSGTFSLLD ANGDVLGYAT AGSSDPLESD SAILDLADSN FAGSGPTGFS MQVNFSIRFK PSAAGRRYNI ELYATDDHGG VQGPDVMGTF TVGIHDVYLP MTIK
|
| |