Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5227 |
Symbol | |
ID | 5737185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 330457 |
End bp | 334641 |
Gene Length | 4185 bp |
Protein Length | 1394 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641282391 |
Product | hypothetical protein |
Protein accession | YP_001547982 |
Protein GI | 159901736 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCACA ACCGGACGTT TAAAACGGTG ATCCGCACCC TCGCTTGGCT CATTCCAGCC GTTGGTATTT TACTCACCCT TGCCTCCGTA TCCGCGAGAC AGCAGCCATC CGACCTTTCC GCAACCAATC GCCAAACGCA GCCGCTCCCA CCCAATCAAT TTCTCTATCA AGAATTTACT GAAACAGATC CCCATACTCT TGCCCTTGCT GGCAGTGCAG CGCTGATTGC CGACGAGGGG GTCATCCGAC TTACCGATGA GACCACCACT GCCCAAGCCG GAGCAATGTG GTATCGCACC AAGCAACAGG TCAGTGATGG ATTTGATACC ACTTTTGATT TTCGGATCAC GGGATCACCC ACTGGTGGGG CCGATGGATT AGCCTTCGTT GTCCATAATG ATCCTCATGG GATCAACACC CTAGGGATAA GCGGTTGTCA ACTAGGCTAT GGTTCGATTA AACAAGGCCT TGCAGTCGAA TTCGACACCT ATCAGAATAA TCGGTATTGT GCTACAATCG GTGATCCCAA TGGGAATCAC GTTGGTATTC TGACGGGGGG GACGGGTTCA TTGAGTCCGA TTCACACGTC TCCTGCAAAT CTAGGAATCA CATCCCTAGA CCCTAACAAA AATAATTTGA AAGCCGGATT CCATACCGCA CGGATTAGTT ATATCGTCGA TACCCACCGC CCAAAGGGGG TTTTACAGGT GTACCTCGAC GATATGACGA CCCCCATCTT AAGTACCACC CTCAATCTCT CCCAAACCCT TGGTCTCAGC GATGGACGAG CGTGGGTTGG CCTTGTTGCC GGGACAGGAA CAATCCTGTC AACCCATGAT ATTGGCAATT GGTTTTTTGT CGATCAGTGG GATCTCTCGT GTGCCACCTA TAATTTCACG CGTGTCCGTG AAGCAGCTCA AGGGTTTATC CCTGATCCAT ATCCGAGCGG TGCTTCATGG CCTTTAGAAG GCCAAAGTTG GATCAGTGTC GCTGGGCGGC TTGGGATATC ACGGGGGATT ATCAACGATG GTATGCCGAT CCTTGACCCA ACCGACCTGA TCGATCAGCG CTTACGTCGC CTGATTCAGC AACACCGCCC GATTCGTGCC ATTCCACTCG AATTGACCTG GACGGATCAC GCGGGGAACG GACCAAACGC GACCGATCAC TTACAGGTTA CCTTTGACAT GAAGGTCGGC AATCGGCTTA TCTCCGGCAG TCATGTGCTC AGTCCGCTCG ATACGCCCTC TCAGCCTGGA GGACGATTTA CGCTCCGCAT CCCCCTCAAT GAGATGGATA TTCCCACCGC CATCAAACCA ACAGGATTGC GCGTCGTCCT CCAAAGTAGC AATCCAAACC ATGCCCTGCG GCTCCATTCG ACCGGGGTTT GTCTCGATGC AGGTGTCCCC GATCGGCCCG GGGTTGTTCC ACCAGGAGGC CGCGAAGCCT GTGCCACCTT ACAAGAATAT CTCAAAACCC GTAGCCAACC CTTTGGCCCA AATGGAGTGG CATCGCAACC GGAACTCTTT GCATCGACCA CGGATAAATC TCTGCGATTA ACGCCCTTTC CCGACTATAC TATTGGCGTG AAAGATTACC CCACCAGTAT GACCTTTCAT ACGGTTCCGC CAGGTCATGC GGTTGTTCCC TGTGCCTTTA CAAGGCAGGG TGTCATCTAT AATCCAACCG CGCCGACGAC CCCCATTGTG CGTTTTGCCG CAGTCGGCAA TGATCGCCAT TGGCGTGAAA GCCTCATCTT ACTCAATTAC GCCGCCTATT TTTCCCAAGG GCTTACAAAG TGGTGTGCCT TAGGCGATAC CGTCTATCCC AGTATGTATG ATCGTGATCA AGCAAGCAAC TTTACACCAG GCTATAATGT TCCTGCCCCT CGGTATAAAT GTGCACCCAT GAGTATTGGG ACGGGGATTG AAAATGGCCC GTGCGGTACC TTCGCGGGAT CACTCTTTCG CGCCATGGGG TATTTTAATG TCCCGTCTTT CGCGAGCCAT CTGCATTCGC CGGGTGTCCT CTCAGCGCTC GGCAACCTCT ACTATGGCCC AATCGGCTAT AACCGCTATG CACTCCCCAA TGAGAATTAT TATGCACCCG TTCCCAATCC CTTAATCCGC GTTGACACGA CGGATGCCGA ACGCTATCGT GCGCAGATTG TCCAGTTTGA GGGCATTCGC GTCTATCCCC ATCTGAATGG TCAGCGAGTC TATGATAACC CTAACGATGC GATCAGTCCC TATCAGATTA CATGGGGTTG GTGGAATGGA CCGACAGCGC ATGAAGTACC CGTTGCTGCG ATTGTGCCCG CACCCCAAGA CCAAACGCCA GTCAGTGATC CCTTAATACG ATGGCGAACG ATCAAACCGG GCGATATCGC CATTACCGTG GTAGAACGCA CTGAGGATGT TGATCTCGAT CTCCATATCG CCATCGTTGT TGGCTGGGGA CCCCAACAAT TTACGCCGAA TGCTTGGCGC GGCGAGCATC TGCATCCAAC CTATCAGCCG TGGATGCAAC AGGGCGGCAA CTATGCCTAT GTCCCCTATG TGATGGATCG CTTCTCACTC CCTGGAACTG GAGCCATTGC TGGCCCACGC CCGTTCAATT ATCGTATCTC GCACACCGCC ACCGATTTTT GGATCTCCAA TTATTCCATG ACCATGACAT CGCCTGCCCC CCAAGCCACC ACCACTTCCA TGAATAGTAC TGCCTTCCCC GCCAACGAAC CGCTGAAGGT TGATCAGCCA TCACCAACGG TGAATGCGCA GCCCATCGTC GATCAGGTTA ACATCGCTCC CTCGCTTGCA TGGAACACCC AAACCCAGCA AATCGCCATC GGCGGAGCCT TTGGTGTGCG AATCACCGAC CCCACCCTCA CCCAAGTCGT CACATGGCCA ACAATCAGCG GGGTGACGGA TCTGGCGTGG AATCATACCG GAACCGCCAT TGCCATCGCC GAAGCCGCTC AACAGGTGAC GATTCGTGAT CAGAGTGGTG CGCTGATCAG TGAACTCATC GGGATGCGCA GTATGCCAAC CATGGTGGCA TGGAATGCCA GTGATACGCA ACTGGCTACC GCGAGTCATG ACCCACAACT GGTGATTTGG AACACCGAGT CATGGCGTGT CGAATCCACC ATCACCTACA CCGGAACCGA TCGGATTCGC GCCATCGCCT GGAATCCTGA CGGCAGTCTG CTTGCCGGCA GCGATCGCCA TCAGATCTTC ATCTGGACTA CCGACGGACA GCTACAGCAG CATATCCCCC TTGGCTGGTC GCCGCAAGGA ACACTTGCTT GGAGCAACAA CGGCCAGTCC CTCATCACTG CGGGAGGACA GCATTGGAGC ATCCAACATG CAGGACAGTT GCACGGGAGT GTGCTGCCGT GTAGTGCCGG AACCGACATC CGCAGCATCC TCAACAACAA TACAACCTCT ATGGTCAGCA TTGGCATTGC CGCCGATGGA GTTCATGCCT GTATTCAATC GATCAGTCCC ACTGGCCAAG TTCTGCCAAT GACTGACCTC TCCCTACCAC GGGGCATGCC AGCGATCACC GATGGAGCAT GGAATGCTGA CCAGACCCAG ATCGCCCTGA TAACCACGAA TGGCATCATC CAGCTCTGGG AGCGCAGCAC GGGAACCGTA CTCAAAGCGC AGCCACTTGG AGGCATGTCG GTCGAAACCT TAAGCAGCGC CGTCCGAACC TGTATCCCGC ATGATCAAAC CCAGATTGCC GTGCTGCATG CGATCACCCA GGGGGATTAC GCATTGTTTG TCGCCACCAT CCAAGCTCAT CACCAGCGTA TTGATAGCGC CTGTGCAGCC TATTTGTCCG CACTCGGAGA GGCATTTCAA GTGAATCCGC CAACACCACC CACCAGTGAG CCATCCGCGC CACAACCAAT GACTGCGGTG GCCTGGTGTA CGTCACCCAC CCAGCGCGGC TGGCTGCTTC GCAATCCCAA TCCGTTTGAT GTGCGGATCG GGTGGGGTTG GGCGACCCAC GCACCACACC TCGTTGAAAC GTCCATCACG CTTTCGGCAG CCCGCGATGG GGTCGATGGA ACCTATGTCC TCCGCACCCC CATGAATGGT GCGATTCAGG TGCAATCAGG AAGCACCAGC ATCACCGTCC CGTGGAAAAC ACGTATCACT CGCCAATGCC GCTAG
|
Protein sequence | MIHNRTFKTV IRTLAWLIPA VGILLTLASV SARQQPSDLS ATNRQTQPLP PNQFLYQEFT ETDPHTLALA GSAALIADEG VIRLTDETTT AQAGAMWYRT KQQVSDGFDT TFDFRITGSP TGGADGLAFV VHNDPHGINT LGISGCQLGY GSIKQGLAVE FDTYQNNRYC ATIGDPNGNH VGILTGGTGS LSPIHTSPAN LGITSLDPNK NNLKAGFHTA RISYIVDTHR PKGVLQVYLD DMTTPILSTT LNLSQTLGLS DGRAWVGLVA GTGTILSTHD IGNWFFVDQW DLSCATYNFT RVREAAQGFI PDPYPSGASW PLEGQSWISV AGRLGISRGI INDGMPILDP TDLIDQRLRR LIQQHRPIRA IPLELTWTDH AGNGPNATDH LQVTFDMKVG NRLISGSHVL SPLDTPSQPG GRFTLRIPLN EMDIPTAIKP TGLRVVLQSS NPNHALRLHS TGVCLDAGVP DRPGVVPPGG REACATLQEY LKTRSQPFGP NGVASQPELF ASTTDKSLRL TPFPDYTIGV KDYPTSMTFH TVPPGHAVVP CAFTRQGVIY NPTAPTTPIV RFAAVGNDRH WRESLILLNY AAYFSQGLTK WCALGDTVYP SMYDRDQASN FTPGYNVPAP RYKCAPMSIG TGIENGPCGT FAGSLFRAMG YFNVPSFASH LHSPGVLSAL GNLYYGPIGY NRYALPNENY YAPVPNPLIR VDTTDAERYR AQIVQFEGIR VYPHLNGQRV YDNPNDAISP YQITWGWWNG PTAHEVPVAA IVPAPQDQTP VSDPLIRWRT IKPGDIAITV VERTEDVDLD LHIAIVVGWG PQQFTPNAWR GEHLHPTYQP WMQQGGNYAY VPYVMDRFSL PGTGAIAGPR PFNYRISHTA TDFWISNYSM TMTSPAPQAT TTSMNSTAFP ANEPLKVDQP SPTVNAQPIV DQVNIAPSLA WNTQTQQIAI GGAFGVRITD PTLTQVVTWP TISGVTDLAW NHTGTAIAIA EAAQQVTIRD QSGALISELI GMRSMPTMVA WNASDTQLAT ASHDPQLVIW NTESWRVEST ITYTGTDRIR AIAWNPDGSL LAGSDRHQIF IWTTDGQLQQ HIPLGWSPQG TLAWSNNGQS LITAGGQHWS IQHAGQLHGS VLPCSAGTDI RSILNNNTTS MVSIGIAADG VHACIQSISP TGQVLPMTDL SLPRGMPAIT DGAWNADQTQ IALITTNGII QLWERSTGTV LKAQPLGGMS VETLSSAVRT CIPHDQTQIA VLHAITQGDY ALFVATIQAH HQRIDSACAA YLSALGEAFQ VNPPTPPTSE PSAPQPMTAV AWCTSPTQRG WLLRNPNPFD VRIGWGWATH APHLVETSIT LSAARDGVDG TYVLRTPMNG AIQVQSGSTS ITVPWKTRIT RQCR
|
| |