Gene Haur_5227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5227 
Symbol 
ID5737185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp330457 
End bp334641 
Gene Length4185 bp 
Protein Length1394 aa 
Translation table11 
GC content55% 
IMG OID641282391 
Producthypothetical protein 
Protein accessionYP_001547982 
Protein GI159901736 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCACA ACCGGACGTT TAAAACGGTG ATCCGCACCC TCGCTTGGCT CATTCCAGCC 
GTTGGTATTT TACTCACCCT TGCCTCCGTA TCCGCGAGAC AGCAGCCATC CGACCTTTCC
GCAACCAATC GCCAAACGCA GCCGCTCCCA CCCAATCAAT TTCTCTATCA AGAATTTACT
GAAACAGATC CCCATACTCT TGCCCTTGCT GGCAGTGCAG CGCTGATTGC CGACGAGGGG
GTCATCCGAC TTACCGATGA GACCACCACT GCCCAAGCCG GAGCAATGTG GTATCGCACC
AAGCAACAGG TCAGTGATGG ATTTGATACC ACTTTTGATT TTCGGATCAC GGGATCACCC
ACTGGTGGGG CCGATGGATT AGCCTTCGTT GTCCATAATG ATCCTCATGG GATCAACACC
CTAGGGATAA GCGGTTGTCA ACTAGGCTAT GGTTCGATTA AACAAGGCCT TGCAGTCGAA
TTCGACACCT ATCAGAATAA TCGGTATTGT GCTACAATCG GTGATCCCAA TGGGAATCAC
GTTGGTATTC TGACGGGGGG GACGGGTTCA TTGAGTCCGA TTCACACGTC TCCTGCAAAT
CTAGGAATCA CATCCCTAGA CCCTAACAAA AATAATTTGA AAGCCGGATT CCATACCGCA
CGGATTAGTT ATATCGTCGA TACCCACCGC CCAAAGGGGG TTTTACAGGT GTACCTCGAC
GATATGACGA CCCCCATCTT AAGTACCACC CTCAATCTCT CCCAAACCCT TGGTCTCAGC
GATGGACGAG CGTGGGTTGG CCTTGTTGCC GGGACAGGAA CAATCCTGTC AACCCATGAT
ATTGGCAATT GGTTTTTTGT CGATCAGTGG GATCTCTCGT GTGCCACCTA TAATTTCACG
CGTGTCCGTG AAGCAGCTCA AGGGTTTATC CCTGATCCAT ATCCGAGCGG TGCTTCATGG
CCTTTAGAAG GCCAAAGTTG GATCAGTGTC GCTGGGCGGC TTGGGATATC ACGGGGGATT
ATCAACGATG GTATGCCGAT CCTTGACCCA ACCGACCTGA TCGATCAGCG CTTACGTCGC
CTGATTCAGC AACACCGCCC GATTCGTGCC ATTCCACTCG AATTGACCTG GACGGATCAC
GCGGGGAACG GACCAAACGC GACCGATCAC TTACAGGTTA CCTTTGACAT GAAGGTCGGC
AATCGGCTTA TCTCCGGCAG TCATGTGCTC AGTCCGCTCG ATACGCCCTC TCAGCCTGGA
GGACGATTTA CGCTCCGCAT CCCCCTCAAT GAGATGGATA TTCCCACCGC CATCAAACCA
ACAGGATTGC GCGTCGTCCT CCAAAGTAGC AATCCAAACC ATGCCCTGCG GCTCCATTCG
ACCGGGGTTT GTCTCGATGC AGGTGTCCCC GATCGGCCCG GGGTTGTTCC ACCAGGAGGC
CGCGAAGCCT GTGCCACCTT ACAAGAATAT CTCAAAACCC GTAGCCAACC CTTTGGCCCA
AATGGAGTGG CATCGCAACC GGAACTCTTT GCATCGACCA CGGATAAATC TCTGCGATTA
ACGCCCTTTC CCGACTATAC TATTGGCGTG AAAGATTACC CCACCAGTAT GACCTTTCAT
ACGGTTCCGC CAGGTCATGC GGTTGTTCCC TGTGCCTTTA CAAGGCAGGG TGTCATCTAT
AATCCAACCG CGCCGACGAC CCCCATTGTG CGTTTTGCCG CAGTCGGCAA TGATCGCCAT
TGGCGTGAAA GCCTCATCTT ACTCAATTAC GCCGCCTATT TTTCCCAAGG GCTTACAAAG
TGGTGTGCCT TAGGCGATAC CGTCTATCCC AGTATGTATG ATCGTGATCA AGCAAGCAAC
TTTACACCAG GCTATAATGT TCCTGCCCCT CGGTATAAAT GTGCACCCAT GAGTATTGGG
ACGGGGATTG AAAATGGCCC GTGCGGTACC TTCGCGGGAT CACTCTTTCG CGCCATGGGG
TATTTTAATG TCCCGTCTTT CGCGAGCCAT CTGCATTCGC CGGGTGTCCT CTCAGCGCTC
GGCAACCTCT ACTATGGCCC AATCGGCTAT AACCGCTATG CACTCCCCAA TGAGAATTAT
TATGCACCCG TTCCCAATCC CTTAATCCGC GTTGACACGA CGGATGCCGA ACGCTATCGT
GCGCAGATTG TCCAGTTTGA GGGCATTCGC GTCTATCCCC ATCTGAATGG TCAGCGAGTC
TATGATAACC CTAACGATGC GATCAGTCCC TATCAGATTA CATGGGGTTG GTGGAATGGA
CCGACAGCGC ATGAAGTACC CGTTGCTGCG ATTGTGCCCG CACCCCAAGA CCAAACGCCA
GTCAGTGATC CCTTAATACG ATGGCGAACG ATCAAACCGG GCGATATCGC CATTACCGTG
GTAGAACGCA CTGAGGATGT TGATCTCGAT CTCCATATCG CCATCGTTGT TGGCTGGGGA
CCCCAACAAT TTACGCCGAA TGCTTGGCGC GGCGAGCATC TGCATCCAAC CTATCAGCCG
TGGATGCAAC AGGGCGGCAA CTATGCCTAT GTCCCCTATG TGATGGATCG CTTCTCACTC
CCTGGAACTG GAGCCATTGC TGGCCCACGC CCGTTCAATT ATCGTATCTC GCACACCGCC
ACCGATTTTT GGATCTCCAA TTATTCCATG ACCATGACAT CGCCTGCCCC CCAAGCCACC
ACCACTTCCA TGAATAGTAC TGCCTTCCCC GCCAACGAAC CGCTGAAGGT TGATCAGCCA
TCACCAACGG TGAATGCGCA GCCCATCGTC GATCAGGTTA ACATCGCTCC CTCGCTTGCA
TGGAACACCC AAACCCAGCA AATCGCCATC GGCGGAGCCT TTGGTGTGCG AATCACCGAC
CCCACCCTCA CCCAAGTCGT CACATGGCCA ACAATCAGCG GGGTGACGGA TCTGGCGTGG
AATCATACCG GAACCGCCAT TGCCATCGCC GAAGCCGCTC AACAGGTGAC GATTCGTGAT
CAGAGTGGTG CGCTGATCAG TGAACTCATC GGGATGCGCA GTATGCCAAC CATGGTGGCA
TGGAATGCCA GTGATACGCA ACTGGCTACC GCGAGTCATG ACCCACAACT GGTGATTTGG
AACACCGAGT CATGGCGTGT CGAATCCACC ATCACCTACA CCGGAACCGA TCGGATTCGC
GCCATCGCCT GGAATCCTGA CGGCAGTCTG CTTGCCGGCA GCGATCGCCA TCAGATCTTC
ATCTGGACTA CCGACGGACA GCTACAGCAG CATATCCCCC TTGGCTGGTC GCCGCAAGGA
ACACTTGCTT GGAGCAACAA CGGCCAGTCC CTCATCACTG CGGGAGGACA GCATTGGAGC
ATCCAACATG CAGGACAGTT GCACGGGAGT GTGCTGCCGT GTAGTGCCGG AACCGACATC
CGCAGCATCC TCAACAACAA TACAACCTCT ATGGTCAGCA TTGGCATTGC CGCCGATGGA
GTTCATGCCT GTATTCAATC GATCAGTCCC ACTGGCCAAG TTCTGCCAAT GACTGACCTC
TCCCTACCAC GGGGCATGCC AGCGATCACC GATGGAGCAT GGAATGCTGA CCAGACCCAG
ATCGCCCTGA TAACCACGAA TGGCATCATC CAGCTCTGGG AGCGCAGCAC GGGAACCGTA
CTCAAAGCGC AGCCACTTGG AGGCATGTCG GTCGAAACCT TAAGCAGCGC CGTCCGAACC
TGTATCCCGC ATGATCAAAC CCAGATTGCC GTGCTGCATG CGATCACCCA GGGGGATTAC
GCATTGTTTG TCGCCACCAT CCAAGCTCAT CACCAGCGTA TTGATAGCGC CTGTGCAGCC
TATTTGTCCG CACTCGGAGA GGCATTTCAA GTGAATCCGC CAACACCACC CACCAGTGAG
CCATCCGCGC CACAACCAAT GACTGCGGTG GCCTGGTGTA CGTCACCCAC CCAGCGCGGC
TGGCTGCTTC GCAATCCCAA TCCGTTTGAT GTGCGGATCG GGTGGGGTTG GGCGACCCAC
GCACCACACC TCGTTGAAAC GTCCATCACG CTTTCGGCAG CCCGCGATGG GGTCGATGGA
ACCTATGTCC TCCGCACCCC CATGAATGGT GCGATTCAGG TGCAATCAGG AAGCACCAGC
ATCACCGTCC CGTGGAAAAC ACGTATCACT CGCCAATGCC GCTAG
 
Protein sequence
MIHNRTFKTV IRTLAWLIPA VGILLTLASV SARQQPSDLS ATNRQTQPLP PNQFLYQEFT 
ETDPHTLALA GSAALIADEG VIRLTDETTT AQAGAMWYRT KQQVSDGFDT TFDFRITGSP
TGGADGLAFV VHNDPHGINT LGISGCQLGY GSIKQGLAVE FDTYQNNRYC ATIGDPNGNH
VGILTGGTGS LSPIHTSPAN LGITSLDPNK NNLKAGFHTA RISYIVDTHR PKGVLQVYLD
DMTTPILSTT LNLSQTLGLS DGRAWVGLVA GTGTILSTHD IGNWFFVDQW DLSCATYNFT
RVREAAQGFI PDPYPSGASW PLEGQSWISV AGRLGISRGI INDGMPILDP TDLIDQRLRR
LIQQHRPIRA IPLELTWTDH AGNGPNATDH LQVTFDMKVG NRLISGSHVL SPLDTPSQPG
GRFTLRIPLN EMDIPTAIKP TGLRVVLQSS NPNHALRLHS TGVCLDAGVP DRPGVVPPGG
REACATLQEY LKTRSQPFGP NGVASQPELF ASTTDKSLRL TPFPDYTIGV KDYPTSMTFH
TVPPGHAVVP CAFTRQGVIY NPTAPTTPIV RFAAVGNDRH WRESLILLNY AAYFSQGLTK
WCALGDTVYP SMYDRDQASN FTPGYNVPAP RYKCAPMSIG TGIENGPCGT FAGSLFRAMG
YFNVPSFASH LHSPGVLSAL GNLYYGPIGY NRYALPNENY YAPVPNPLIR VDTTDAERYR
AQIVQFEGIR VYPHLNGQRV YDNPNDAISP YQITWGWWNG PTAHEVPVAA IVPAPQDQTP
VSDPLIRWRT IKPGDIAITV VERTEDVDLD LHIAIVVGWG PQQFTPNAWR GEHLHPTYQP
WMQQGGNYAY VPYVMDRFSL PGTGAIAGPR PFNYRISHTA TDFWISNYSM TMTSPAPQAT
TTSMNSTAFP ANEPLKVDQP SPTVNAQPIV DQVNIAPSLA WNTQTQQIAI GGAFGVRITD
PTLTQVVTWP TISGVTDLAW NHTGTAIAIA EAAQQVTIRD QSGALISELI GMRSMPTMVA
WNASDTQLAT ASHDPQLVIW NTESWRVEST ITYTGTDRIR AIAWNPDGSL LAGSDRHQIF
IWTTDGQLQQ HIPLGWSPQG TLAWSNNGQS LITAGGQHWS IQHAGQLHGS VLPCSAGTDI
RSILNNNTTS MVSIGIAADG VHACIQSISP TGQVLPMTDL SLPRGMPAIT DGAWNADQTQ
IALITTNGII QLWERSTGTV LKAQPLGGMS VETLSSAVRT CIPHDQTQIA VLHAITQGDY
ALFVATIQAH HQRIDSACAA YLSALGEAFQ VNPPTPPTSE PSAPQPMTAV AWCTSPTQRG
WLLRNPNPFD VRIGWGWATH APHLVETSIT LSAARDGVDG TYVLRTPMNG AIQVQSGSTS
ITVPWKTRIT RQCR