Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3841 |
Symbol | |
ID | 5735706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4821508 |
End bp | 4824930 |
Gene Length | 3423 bp |
Protein Length | 1140 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280994 |
Product | hypothetical protein |
Protein accession | YP_001546605 |
Protein GI | 159900358 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0865863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACGGA TGCTACAGTT TTTGGCAATC GTGATGCTGG TGATCGGTGG CACTTGGCCG CAATTTCAGG CCAACGCAGC TCAATCAGAG CCAAATTTGC AGGCTCCCAC GGCACTCAAT GCGGTGGCCG ACGACCAATT TTGGAATAAC GATAATTTGG TGGCCGATGC CAATAATAAG ATTAATGCGA TTACAACATT GAATAACGAT CTGTTTGTTG GTGGGTTGTT TAGCCGAGTC GGCGGCATCG ATGCTGCACG AGTAGCCCGT TGGAATGGCG AACGTTGGTT TGCGCTAGGC AGTGGTATCA GCGGCGCAAA TGCCAAAATC GAGGCGATCG ATGCCTCAAC CAGCGGCAAT GTCTACGCTG TGGGCGAGTT CGCTAGTGCT GGCGGGGTCG CCGCTGATAG TATCGCCCGC TGGAATATTG CCACCCAACA ATGGTCGGCG CTAGCAACCA ATGTTAATGG CTCGGTCGAT GCTGTGGTGG TTGTGCCCGA TGCGAATGGC GATATTGTGT ATGTTGGCGG CAGCTTTACG AGCATCGATG GGGTCAGCGC CAATCGAATT GCCCGCTGGA CGAATGGCAC ATGGAGCGCC TTGGCTGATG GCCTAGCAGG CACAAATGCC CAAGTTTTTG CTGTAGCCTT CAATCCCAGC AACACGGCCC AAATTTTGGC AGGCGGGGTA TTTACCAGCG CCAATGGTTT GGTTGCCAAC AATATCGCCT TATGGGATGG CAATGCTTGG CAAAGCCTTG GTACTGGCAC GAACAATGGG GTAAATGGTC GAGTGCGCTT TGTCGATTTT CGCAGCAGCA ATTTTGTGGT GGTTGGTGGT AGTTTCACCA GTGCAGGCGT TGTCTTGCCA GTGGGCGGCT CGGCAGCTTG GCGCGGCGGC AATACGTGGG AAGCCTTCAC TGGCCGCGGC TTTAATCCAA TTGCGTCAAA TTCTGAAATT CGCGCAATTG CCGAATTACC TGGGCGAATC TTTGTTGGGG GCAATTATTT TGGTCTGGTG AACTCCAGCG GTCAGGTAAC CACGGGGAAT TTAATGGGGC TATGGAACGG CACAACCTGG CAGTTTATAC CAAATTCACC CTTCAAAATT AGCCAAATCG AACGGATTGG CACAACTGAA CAATTTTTCG TTACTGGTGA AAGTACCAAT ATACTTAATC CGAGTGGTTT TGTTAGTTTG TTTATTCCCA CGCCAACTGG CAGCAATCAG CCGCATCAAT TCTTGCCCTT GGGTGGCACA ATCAGCGGGC CGGATAATGC AGTATTTGCG GTGCATGCCA GCGCTTCAGG CGTGTTTGTC GGTGGCGAAT TCAACCGCGC CAGCGATCAG GCGATCAACA ATATTGCGCT GTTCAACCCA ACGACCCGCA CATGGTCGCC ATTATTTGGC AAAACTGAAA ATGGCACCAA TGATACCGTG CGGGCAATTG CGCCATTTGG CAGTGATCAA TTGATCGTTG CTGGCGATTT TAGCGAGGCT GGCGGAATCA ACACCGCAGG CGTGGCCAAG TGGGATATTG GTGATCAGCA ATGGTTTGAG ATTTCCAATA GTATCAATGG TTCGGTGCGG GCAATTGCCA TCAACGGTAG TGAAATTGTG ATTGGCGGCG ATTTTACCCA AATCGATGGT ACGACTGTCA ATCATATTGC CCGCAGTACC AACGGCGGAA ATTCATGGTC GGCGCTTGGC TCAGGCATTG GCGGGCCAGT TCATGCCTTG GTGTTCCGCA GCGGAACGTT ATTTGCAGGC GGTGATTTTA GCACTGCTAG CGGCGTGAGC GCCAATAATA TTGCCCGTTG GAATGGTACA GCCTGGCAAG CATTGCGCGG TGGCACTGAC GATATTGTAT TTTCGTTAGC GGCATTTGGC ACCCAGATTG CGGTTGGTGG CTCGTTCAAT TTAGCTGATG GCGTGGCTGG AACCAGCGCA ATCGCCCTTG TTCACCCGAC AACTGGGGTT TGGTCGCCGT TGGCTCAAGG CTTTGGCATT GGCAACACTG TTCAAACGCT GGCAGTGCGT GGCACTGATT TGTATGCAGC TGGTACGTTT TTCTACTCGC AGCCTAACCC CACTCGCCCA ACCAATATCG CCCGTTGGGA TGGTACTGCT TGGCAAGCAT TGGGTAGCGG AATTAGTGGC GGCACTGGCA ATGCCGATTC GGTCAAAGGA TTCGCAATGA GCGTGCGCGG CGATGATCTA TTTGTTGGTG GAATTTTTGA TGCTGCTGGC AGCAAATCAT CGAAGCGCTT TGCCCAATGG ACTCAGCCCG AAGTTGATTT GGCGGTCAGT CTGCGCGATT CGGCTGATCC GGTACAGGTC AACACCCCAT TCCAATACAA TGTGAGCTTA ATCAACCAAG GCATCATTAC CGCAACCAGC GTGATTTACG AGCAAACATT TGATAGCTCG GTTAGTTTTG GCAATATTAG TGCTAGCCAA GGCACATGTA GTTTCAGCAA TGCCACGACC TTGCGTTGTA CGCTTGGCAC GCTCACGCCC AATGCCCGCG CTACTGTGGT AGTCAATGCT ACGCCAACTC AAGTTAGAAC GATCACAAGC CGTGGCACGC TTAGTTCGCC GGCTGATGAG CCTATTACAG CCAACAATCA ACAACAAATT AGCACCCAAA TTATTGCTCC AGGCAACCCA GTGCCAACTA TCAGCAGCAT TACGCCCAAT AGTTTTGTGC AACAAGCAAT TGGCTTGCCA GTGCAAATCA GCGTCAGCGG CACGGGTTTT GTTGCTAGCT CCAAGGTGTT TGTTGATGGA ATTGAACGCC CAACTACCTT CTTAAATAAT GTTCAAATTA ATTTCTCAAT GCCAGTCAAT ACAGCACTGG GTAATCATAG CGTGATTGTG CGCAATCCCA CACCTGGCGG CGGCGATTCG AATAGCGTTA ATTTGGCAGT TTTGCGCAAT AACGTCGGAT TTAGCAGCAT CACTCCCGAT ACAGGCGGTG TAGATGTAGG TTTGCAAACC ACCTTCACGA TTAGCTGGAC TCACACCACC GATCCATGGC GGATTATCGA TAGCCTTGAT TTACGCTTGG TCAATGCCGA TGGAATTGGG TTGTGGGCAC GCTTTACTGA GGGCGTTTCG GGCACATTTA GTTTGCTCAA CAGCGAGGGT GATATCATCG GCACAGCGAT TGCCGAAACT CCAAACATCC TCAGCAGCGA CACTGCCGAG CTTGATGTTG AAGCCAGCAG TTTTGCCGGC AGCGGCCCAA CTGGCTTTAG CATGAATGTA ACCTTCAACG TGACCTTCAA CGAGGCGGCG CGTGGTCGCT ATAATATCGA ACTCTACGCC AGCGATGATC ATGGCGAGTT GCAAGGGCCA GATGTGCTGG GCACCTTCGA TGTTGGCATT CACCAAATCT TCCTGCCAAT GACAATTAAA TAA
|
Protein sequence | MRRMLQFLAI VMLVIGGTWP QFQANAAQSE PNLQAPTALN AVADDQFWNN DNLVADANNK INAITTLNND LFVGGLFSRV GGIDAARVAR WNGERWFALG SGISGANAKI EAIDASTSGN VYAVGEFASA GGVAADSIAR WNIATQQWSA LATNVNGSVD AVVVVPDANG DIVYVGGSFT SIDGVSANRI ARWTNGTWSA LADGLAGTNA QVFAVAFNPS NTAQILAGGV FTSANGLVAN NIALWDGNAW QSLGTGTNNG VNGRVRFVDF RSSNFVVVGG SFTSAGVVLP VGGSAAWRGG NTWEAFTGRG FNPIASNSEI RAIAELPGRI FVGGNYFGLV NSSGQVTTGN LMGLWNGTTW QFIPNSPFKI SQIERIGTTE QFFVTGESTN ILNPSGFVSL FIPTPTGSNQ PHQFLPLGGT ISGPDNAVFA VHASASGVFV GGEFNRASDQ AINNIALFNP TTRTWSPLFG KTENGTNDTV RAIAPFGSDQ LIVAGDFSEA GGINTAGVAK WDIGDQQWFE ISNSINGSVR AIAINGSEIV IGGDFTQIDG TTVNHIARST NGGNSWSALG SGIGGPVHAL VFRSGTLFAG GDFSTASGVS ANNIARWNGT AWQALRGGTD DIVFSLAAFG TQIAVGGSFN LADGVAGTSA IALVHPTTGV WSPLAQGFGI GNTVQTLAVR GTDLYAAGTF FYSQPNPTRP TNIARWDGTA WQALGSGISG GTGNADSVKG FAMSVRGDDL FVGGIFDAAG SKSSKRFAQW TQPEVDLAVS LRDSADPVQV NTPFQYNVSL INQGIITATS VIYEQTFDSS VSFGNISASQ GTCSFSNATT LRCTLGTLTP NARATVVVNA TPTQVRTITS RGTLSSPADE PITANNQQQI STQIIAPGNP VPTISSITPN SFVQQAIGLP VQISVSGTGF VASSKVFVDG IERPTTFLNN VQINFSMPVN TALGNHSVIV RNPTPGGGDS NSVNLAVLRN NVGFSSITPD TGGVDVGLQT TFTISWTHTT DPWRIIDSLD LRLVNADGIG LWARFTEGVS GTFSLLNSEG DIIGTAIAET PNILSSDTAE LDVEASSFAG SGPTGFSMNV TFNVTFNEAA RGRYNIELYA SDDHGELQGP DVLGTFDVGI HQIFLPMTIK
|
| |