Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4484 |
Symbol | |
ID | 5736335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5738668 |
End bp | 5742786 |
Gene Length | 4119 bp |
Protein Length | 1372 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281647 |
Product | hypothetical protein |
Protein accession | YP_001547244 |
Protein GI | 159900997 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.517652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAATC GTAGTTCATG GCAATTTCGC AGTCAGTTGT TGGTGAGTAG CGGCCTTGTA GGGCTTATGG TGCTACTCAC AACGTTCTTA ACCACATCAT CATCGGCCCA AACCCGCGAG TGTCGGATGG CTGATGCTTT GAAGATTTGT GCTAATACGT TTGTTGCAAC CGATCCCACA CACTTTATTG CTCGCGATGA TGTAACGCTG GCGATTGGTG ATGCGCCACC ATTGATCAGC GCTGGCGCGG TTGGCGCGAA CCTTGGCGAG TTTGTGTTCA CTGCCGATCA GTCGGTGCTG TCGGGCGCAG TTAAATTTAT TGGCGATAAT GCTAGCTTGC CGCTAGTTGC CTCAACCTAT AATGCCAACA ACACGCCGAA AGAGGTGTTT GAGGTTGATA CGACTGGCCT AACGATTACC AATGATCAAA GCAGTGCTGA TCCAATCGGC GTAATCGCCA ATAGCACGAT CAGCCTGCAC TTTCTTGATC GCTCGGGTGT GCGCAGTTTT TATAAGACGA CCGACCCCAG CGAGACCGAA GATCTGAGCT TTGTGTTCGA TCTGGCCGCA GCAGAATTTC GGGCTGAGCT GCCGATTAAC CTCAACATCG CAGCATTAAA CCCAAGCACG TCTGAGAATC CCAATCTCAA TATTGTGGTG AATCTCAAAT ATTCGCAACA AGGCGTGCTC TCAGGCAATG TCGATAATTT CACCATGTCA TTGGCTGGCA TGAGTGTTGC AGTCAAAGAG ATTGCGCTAT CAACTGGCTC TTTTGAAGCT GGTTTGGTCG AAGTTTCACG AGCGGCTAAT CCCGATTTAC CAAACCTTGA TCCGGCTAAG CCAAATTTGG TCTTTTCGCT GCAAAATCTC AAGTATGCGA ACCGCAGCTT TAGCATTGGC GGCGGCTCGG TGCCAATTCC CGATTGGAAA TTTGGCGGCA ACTTTAGCAT GACCGACCAA ACCTTGAGCA TTGGCCATGA TAATGCGACT GGTACGTCAA CCGTTAGCGT CAACTCAACC TTGATTTTTG GCAATGTGCT GAACCAACCA GAATCAGCCA CGCCGCGCGA AGTAACCCTG ACATTTAGTG CCGTCAAAGT CAACGGTGTT TTCAAGCCTG TTTTCAGTGC TACTGTTGCT GAAACCACCG TTGCAATGGG GCCGTTGAAT TTTCGCCTGC GCGGGTTGAG CTTAATCGGC GATACAGTAG AAAATTTCTA TGGCTTGAAA GCAAGTGCGG TTGATCTGCT GTGGTCAAGC GATATGGGCG GTCAATCGGC AGCAGGTATT ACTGGCTTCA AATTTGGCAT CAATAAAGAT GGTAAGTTGC AATTTGCCCT CGGCGGTGCG ACGATCAGCA CTCCTAATAT TTCGAGCGGC GTGTTGGAAG GCAGCAGCAT CGTTGGTCAG TTTGGCGTGG CCCAACAAAC CCTCAACCTA ACCGTGACTG GTAACCTGAG CGTAAAAATC CCAGGTAACA GCGGTGTTGG TGCAGGCTTG GTGATGGTCG TGCGTGGCGG CCCGAATGTT GCGCCGATTG GCAGCCAAAA CTGTGCCCAA TCACCATGTA TCAAACGTTT CGAAGCCTCG CTAACCAACT TTAGCGTCAA AATTGCTGGT TTCAGCATGG CCTTGCAGAA TCCCCGCTTC CTTGATGATG GTGGTTTTGC CGCCGATAGT GCTCGACTCT CGATGTCAGA CCTGATGGGC AACCTTACCG CCGATGTGTC TGGCCTCAGC ATCAGTGGGC GCGGCGAAGT TTCGGTAACT GGCGGCGGGA TTGAGTTGCC ACCGCTCAAG ATTGCAGGAA CTAATTTTGT TGGTTTCCGT GGCTTCTTCA GCAAAGATGG CGCTGGCTAC ATGTTTGCTG GTGGGGCAAC TCTGAGCATG CCTGGCTTTG ATCCTAGCGG CGGCTCAACG ATTTCGGTTG ATGTTTCGGT CAAAACCTTG CCAACCGGGG TGTTTAATGA GTTGGATGTC GTGGTGGCCT TCGAGTCATC GCCAGGCATT CCGTTGGCCA ACAGTGGCGC TGCCCTGACC AAGATGAGTG GTTCGTTCTC GCTCAAGTCA GGCTCGGTCA CGATTGGGGT TGGCATTGAA GTAAGTTCAG TTGCTCAGCT CGCAGGAATT CCGCTAGTTT CGGCAGAGGG AACTGCAACC TTGGTGGTTG ATCCATTCAA GTTCTCGCTG ACCGCCAGCA TGAAGGTATT GATTTTTGAA GTTGCTAGCG CTAGTGTCGA AATTGGCCAC GAAGCTGGGT TTAGCGGCGG CAAGGGTCTG CATGCTAAGT TCCAATTTGA AGCAGTGATT GTACGTGGCG GCTTGGAACT GCGGGTTGGC ACGGTCACGG TTCGTAGCTG TACGCCTGCT GGCTCAACCA ATTGTGTTGA TAAACAAAAA CTACGCTTTG CTGGCTCAGC ACGAATGTCG GTTGGCTTGC GCAAAGGCCA GTTCGGCAAA GCCTTACCAC CAAAGAATAT TACCTTTGGC TCAGTTTCAT TCCAAATGGG CGAGTTCGAA AAATCGGGTG GTGGTACGAC CGTAGGGATG TTGGGCCGCG TGAGCTGCTG TTTCGGCATC TTCAAAGTGA GTGTATTTGT TGATTTGAGC AAGCCGGTTG GATTGAACAC TGGTTTTGTC AAGCTCGTCA ACCCCAAAAA TTATCGCCTG ATTAACTCGC TCCAAATTGC GCAGAGTATC GAGCAAGGCG AACCAGGCTA CAGCCAACGA ATCATCTCAC GGCCTGTCAA CCCAGGGAAA TCGGGTGGTG CCTTGTTTGC GGCAATCCCT GAAGTTACAG TACCAGTGGT GATTACCTCG ACCGCCAGTG GCTACTTTGG GATTCACTTT AGTGGAACTC CTTCAGTTGA ACCAGTTATC CGCTTGATTC TGCCTGATGG AACGGAACTC AATGAAGGCA ATGTCAATGA CACAACTCAG ACCTTGATTC GCGATTACAC CACCGTGATT ACGGAAGGCA ATGACTTGGC CTTTATGCTC GAAGCGGCCA CACCTGGTAC GTATAGTTTG ATTATTGAAG GGCCACCAAG CCAATACGAA GTGGTTGCTT ACCAACTGAA TAACCCACCA ATTTTCGATA GCGCAACCTT GGCTTGTGGC GGTGCGGCAA CCCCAGGCGT AACCGTAACA TGTAACTCGG CTCCAACTGG CAGCAAAGTT ACGGTTAATT GGGCGACCCG CGATACCGAT GACCCTAACG CCAAAGTTTC GCTGGTCTAT GCTAGCGTGA TCACCCCAAC CGATCCCATT GATGTAGGCT TGAGCACGGT CATTAGCGAT AACATCAAAC TTGGTACAGG CCAGCATGTT TGGGATCTAA GCGAGATTCC AAGTGGTCAA TATAAGCTGG CGCTCTTCGC CGACGATGGC CATAATCAAC CAACCGTTCA ACAATTGGAT ACCTTGATTG TGGTCAATGA TCAGCGTGCG CCCAAAATTC CAACCAATCT TCAAGCGACT CCATTGCCCG GTCAACTGTT GGTCAAGTGG ACACCGAACA GCGAAATGGA CCTTGGTGGC TATGAGATTG GCTTTGGTGA AGTCAATGAT CCAAATGAGT TCCTCTACTC CCGCAATATG GGTGGCAAAG AGATGATCTT TACTGCGACG AACCAACTTG ATGCCAAACT GTGGGGCTTG AAAGACAATC AATCGATCTT CTATGGGATT CGAGCCTATG ATCTTAGTGG CAACTTCAGC GCTTGGTCAC CGTTAGTTGT GGGCACGCCG TGGTCGCTAA GCCCACATGC TTGGAATCCA GTGCCTGGAG GACGCGGTGT GACAACCACC AAGATTGAGG CCGCCTTTGA AACTCCACTG AGCGAGGCAT CGCTGACGAA TGCCTTCCAA GTTCGCAATG CCAGCAATCA GTTGGTAGCC GGAACACCAA TCTATCTCTA CAATCTTGAT AAAACTGAGA TTATCGGATT TAGCTTCAAG CCTAGCGCTA CGCTGGTCGA TGGTGAAACC TACACCGTGA CCATTCGTGG CGGAGCCAAC GGAATTAGAT CGAAAGATGG CCGTCAAATG CCGGCTGACT TCAGTTGGAA ATTCGAGGTC GAATCGTATC AAATCTATCT GCCAGCCGTG AAGCGCTAA
|
Protein sequence | MINRSSWQFR SQLLVSSGLV GLMVLLTTFL TTSSSAQTRE CRMADALKIC ANTFVATDPT HFIARDDVTL AIGDAPPLIS AGAVGANLGE FVFTADQSVL SGAVKFIGDN ASLPLVASTY NANNTPKEVF EVDTTGLTIT NDQSSADPIG VIANSTISLH FLDRSGVRSF YKTTDPSETE DLSFVFDLAA AEFRAELPIN LNIAALNPST SENPNLNIVV NLKYSQQGVL SGNVDNFTMS LAGMSVAVKE IALSTGSFEA GLVEVSRAAN PDLPNLDPAK PNLVFSLQNL KYANRSFSIG GGSVPIPDWK FGGNFSMTDQ TLSIGHDNAT GTSTVSVNST LIFGNVLNQP ESATPREVTL TFSAVKVNGV FKPVFSATVA ETTVAMGPLN FRLRGLSLIG DTVENFYGLK ASAVDLLWSS DMGGQSAAGI TGFKFGINKD GKLQFALGGA TISTPNISSG VLEGSSIVGQ FGVAQQTLNL TVTGNLSVKI PGNSGVGAGL VMVVRGGPNV APIGSQNCAQ SPCIKRFEAS LTNFSVKIAG FSMALQNPRF LDDGGFAADS ARLSMSDLMG NLTADVSGLS ISGRGEVSVT GGGIELPPLK IAGTNFVGFR GFFSKDGAGY MFAGGATLSM PGFDPSGGST ISVDVSVKTL PTGVFNELDV VVAFESSPGI PLANSGAALT KMSGSFSLKS GSVTIGVGIE VSSVAQLAGI PLVSAEGTAT LVVDPFKFSL TASMKVLIFE VASASVEIGH EAGFSGGKGL HAKFQFEAVI VRGGLELRVG TVTVRSCTPA GSTNCVDKQK LRFAGSARMS VGLRKGQFGK ALPPKNITFG SVSFQMGEFE KSGGGTTVGM LGRVSCCFGI FKVSVFVDLS KPVGLNTGFV KLVNPKNYRL INSLQIAQSI EQGEPGYSQR IISRPVNPGK SGGALFAAIP EVTVPVVITS TASGYFGIHF SGTPSVEPVI RLILPDGTEL NEGNVNDTTQ TLIRDYTTVI TEGNDLAFML EAATPGTYSL IIEGPPSQYE VVAYQLNNPP IFDSATLACG GAATPGVTVT CNSAPTGSKV TVNWATRDTD DPNAKVSLVY ASVITPTDPI DVGLSTVISD NIKLGTGQHV WDLSEIPSGQ YKLALFADDG HNQPTVQQLD TLIVVNDQRA PKIPTNLQAT PLPGQLLVKW TPNSEMDLGG YEIGFGEVND PNEFLYSRNM GGKEMIFTAT NQLDAKLWGL KDNQSIFYGI RAYDLSGNFS AWSPLVVGTP WSLSPHAWNP VPGGRGVTTT KIEAAFETPL SEASLTNAFQ VRNASNQLVA GTPIYLYNLD KTEIIGFSFK PSATLVDGET YTVTIRGGAN GIRSKDGRQM PADFSWKFEV ESYQIYLPAV KR
|
| |