Gene Haur_2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2540 
Symbol 
ID5734418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3257625 
End bp3262463 
Gene Length4839 bp 
Protein Length1612 aa 
Translation table11 
GC content45% 
IMG OID641279680 
Producthypothetical protein 
Protein accessionYP_001545306 
Protein GI159899059 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.316751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTTG ATTTCGATTC CAATGATATT CAATTGAACA AAGGCGATAT TCGTCAATTG 
CAAAATGCCG ATGCGTTGGC GGGCTTTTTG GCGCGTTTGC AGTATGCGAT CGATGTGCGC
ACAACGATCG ATTCGTATGC TCAGGTTGGC CTCGATAGCG CCTATTTGCG CCAAGAAATT
AAGCATCTGG AGTTGTTGGC AACCGATCCC CACGATCAAG AAATAAAAAT TTATTTGTTA
GAAGTGCGCT CACTCACCGC TGCTTTACGC AATGCTATCG CCCGCGCGTT TCGCGATCGG
CCTGAGTTGG CGTTGGTTAT TTTGACCAAA GATTATGAGA GCTTTGAGTT TGTGATCTTG
CTGCGCGAGT TGGCGCAAAA TACCCAGCGT GGCGCGGCGA TTCGCCAAAT GTTGCGGCCT
GTGCCACTCA CGATCAATCG TTTGCATGTT TCGGATGTAG CGTTGCGCGT GCTCAAACGC
TTTACCATGA CCGAACCCGA TGCGCTGTAT CAGTGGGATA AGCTGCGCTC GGCCTTTGTG
TTGGCCGAGT GGAGCGAGCA ATATTTTAAT AATCGGGCGC TGTTCTCGGA TTATTATTTG
AAACATCGTT TGACTGATGC CAAATTAAAT GCTGAATGGA ATGAAGATGT GTTGCCAATT
GGCCGCAAAA TCAACCCGCT GCTCAAAACT GCCCGTCAAC AATTAACTGA TCAGCCTCTT
GCGACGATGC AAAAGCAATT TTATGCGCCA ATCTTGGCGG ATTTGGGCTT TGTGCTGCAA
GCGGTCGCGG GTGAGCCAGC CTATCGCTAC GATTTGGCCT TGCCCAATAA TCCCCAACCA
GTCGCTGCTT TTTTTGGCTA TGTTTGGAAT CGTAATTTAG ACGATCAAGA TAGCCAGCGC
GATCCCGCTA CGCCGCTGAT CATTCCTGGG GCCGAGGTGG TTTCGACGCT TGAGGCTGGC
ACGGTGCCAT GGGTGATTGT GAGCAATGGC AAATATTGGC GCTTATATTC AACCACTGCC
AGCAATAAGG CTACCAACTA CTACGAGGTT GATTTAGAAG AGGCTTTCGG GGCGCAGGAT
GCGCTGGTTG CACTCAAATA TTGGTGGTTG TTTTTTCGTG CTCAGGCGTT CGGCGGATTT
TTGGAGCGCT TGCTCAAACA ATCGGCGGAG TATGCCAAGG GCTTGGGCGA ACGCTTGAAA
GATCGGGTGT TTACTGAGAT TTTTCCCCAA TTTGCCCAAG GTTTTATTGC CGACATGCGC
CGCCAAGGTG CTAACGACGT TTCTGAGCAG CAACTCAACC CGGTTTTTGA AGGTACGTTG
ACCTTTTTGT ATCGATTGAT GTTTGTGTTG TATGCCGAAA GCCTGGATTT GCTGCCGCTG
AATGAACATA ACGGCTATCG CGAACGCAGC TTGTACACGC TCAAACGTGA GATCGCTGAG
ATTGCTGGTA GCTTGCTCGA TCAGCGGGCG GCCAATTTGC AAGCCCACTA CAGCGCCACT
TCAACCGCGC TTTATCAGCG CATCCTCGAT TTGTGTGCGG TGATCGATCG TGGCTCGCCC
GATTTGAATA TGCCAACCTA CAACGGTGGT TTGTTTAGCG CAACTAGCAC GAGCGGCCAG
TTTTTGCAAC GCTACGCCGT ACCCGATCGC TACTTGGCCT TGGGCTTGGA TCGGCTTGCC
CGCGATCTTG ATGATCGGAG CCAAGCATTA GTATTGATTG ATTTTAAGTC GTTGGGCGTG
CGCCAGCTTG GGAGCATTTA CGAAGGTTTG CTGGAATTTA AGTTGCACAT CGCCAGCGAA
TGCTTGGCCG TGACTAAGGA GAAAGGCAAA GAGGTTTATC AGCCAGCCGC TAAAGTTGCC
AAACCACTGG CAATTATCGA GCGGGGCATG GCCTATTTGG TCAACGATAA AAAGGAGCGC
AAGGCCACGG GCAGCTATTA CACGCCCGAT TATATTGTGA AATATATTGT GCAGCAGACG
GTTGGCGCGG TGCTTGATCA GAAATTTAAG GCCTTGGCTC CGCGTTTGCA CGAGGCCCAG
AAACAATATC GCAATTATGC TAATTTGGTC GCTGCTCGGG CCAAATCGAG CAAGCGCCCC
GAAAATCCGG CGGTGTTTTG GACTGACCCG AATGGTGCCA TGGGCCAGTT GCTCGACGAT
TGTTTGAATC TGCGGGTGCT TGACCCAGCT ATGGGCAGCG GCCATTTTTT GGTTGAAGTC
GTCGATTTTA TTAGCAATCG CTTGATTCAC TTTTTGAATG CCTGGAGCGA AAACCCCGTT
TGGGCGATGA TCGACCGCAC CCGCAGCGAA ATTGTGGCCG ATATGGAGCG CCAAGGGGTG
ACGATTGACC CCGAACGGCT TACGCGGGTG GCGCTGCTCA AACGGGCGGT GCTCAAACGC
TGCATCTACG GGGTTGACCT GAATGCGATG GCGGTGGAGT TGGCTAAGGT CAGTTTGTGG
CTCGATGCCT TTACCCTTGG CGCTCCCTTG AGTTTTCTCG ACCATCACTT GAAGCATGGC
AATAGCCTGA TTGGGGCGCG GGTTGCCGAG GTGCAAACCT ACCTTGATGT TGGTGGCCGC
CAAAGCCATA TGTTGGCGGG CAACGAGTTT GCAGGCCTGA GCCTGGCCAC CGATTTGATG
CGCCAAGTCA GTTTTTTGAG CGATAACACG GTTGAACAAG CGCAACAAAG CGCGGTGGCT
TTTCGCGATG CCGACCAGCA TCTCGCGCCC TTCAAGCGTA TGCTCGATGT TTATACCTCG
CGTTGGTTTG GCAATCAGCC CGCCAAAAAA AGCAAAAGCG ATTGGGTCAA CATCTTTTTG
CGCATGCCCA GCATCAAGCC TTGGTTGCAC GATTCCACAG TTAAATTAGA TGATAGCTTG
ATTGAGGCTA CCAAGATTGG CCAGATTGCG CTTGAAGCCG CTAGCAGCAA GCACTTTTTC
CATTGGGAGC TTGAGTTTCC CGAAATCTTT TTTGCGCCCA GCACGCCTGG CGGTCAAGAT
GTGCAGCTCA ATCCCAACGG TGGCTTCGAT GCGGTGGTGG GTAATCCGCC GTATATTAGA
ATTCAATTTC TAGATAAAAG TGATGTTGAT TATTTTAATA ATATATATCT CTCTCCTAAT
GGTTCATATG ATATATATAT ATTATTCATT GAAAAAAGTA TTGAATTATT AAATATTAAC
GGAATTAGTG GATATATATG TCCTAATAAG TTTATGACAA ATGCATATGG GGATAAAATT
AGGAATATAA TTGGAGAAAG TAGAAATCTT TTTCGTTTAG TTGATTTTGG CGATTATCAA
TTATTTGAAG GAGCTACTAC GTATTGCTGT TTAGTATTTT TATGTAAAAA TAATAATCGT
ACATTAGATA TTCCTGTTAT TTCTGTAAAA GATTACATTA ATATAGAAAA TAATAAAGTT
ATTAAATTTA ATATTGATAA TTTGATTAAT AATAAATGGT ATTTTGGTGT AAATAATGAA
TTAAATGAAA AAATTAAAAA AGTGGCAGGT AGAGATTTGG GAGAGATTGC TCATCCTCAT
TATTGTCTTT TTACGGGATT AAATGATGCT TTTGTTGTGA GTGAAAGTGA TATATTTAAT
TATGAACTTG AGAGAGATAT ATTAAAACCT TTATTAAGAG GACAAGATGT GAAAAGATGG
GAGGTTTTGC ATGAAAAACT ATATATTATC TATCCATACG AATTATATAA TAATAAAATG
TCTGTAATTG ATATAAATAA ATACCCAAAT GTTAAAAATT ATTTATTTAA ATTTAAAGAT
CTATTAATAA ATAGAGTGAA ATTTATTCAA AAAGTAAAAA ATATACACGA ACGAGAATCT
AGATGGTATG AATATATTGA TCCAAGATCA ACTTATCAAT TTGATCCATT GAAGATAATT
ACTCCTGAGA TTGCCGGATA TGGATCATTT GTTGTTGATA CCGAAAGTTA TTACTGTCTT
AATAAAGTAT ATGTTATAAA CCTTGAAAAT GTTGTCGAAA ACCCCTATTA TATTACATCT
ATCTTAAATT CATCTATTTC TTTCTATATG TTCAAAGATA TATCTACAAA GCTTGCAAAT
GGTTATTATG AATATATAAC TCAATATTTG AAACAAATAC CCATACCACG CATCGACTTC
ACGACCGAGC CAAGCGAACG GGAGCAATTA ACGCAGATCG CTATTGATTC GTATACGCAA
AAAAATGATA ATAATGTATT GACGCTCGTC AATACATTAT TCACCGCTGA GAATCCAATC
CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC
CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC
CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC
CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC CAATCCAATC
CAATCCAATC CGCTATGGCG ATGTTGTCCA CGATCTATTA GCCTACCTCG CCCAACAAAT
GATCGAGCTG AACAAAGCCA AGCAACAAGC CAGCAAGCGC TTCTTGAATT GGCTCGAAAG
CCAACTACGG ATTCAGCCCA AAAAAGGCGA AACTGGCCTC GACAGCCTGA CGGGCAAAAC
GATCATCCAA GGCTACCTTG GCGACTACCA AAAAGGCCAA CGCGCCGTCA GTTGGAGCGA
CTTCTGGTAT CGGCTGCAAC AAAACCGCAA TCGTTTTGCC GCCAATCTCA GCGAAATCGA
AGCCAGCATT GCCCAAGCCT ACCGCCAATC GCTCGATGA
 
Protein sequence
MTLDFDSNDI QLNKGDIRQL QNADALAGFL ARLQYAIDVR TTIDSYAQVG LDSAYLRQEI 
KHLELLATDP HDQEIKIYLL EVRSLTAALR NAIARAFRDR PELALVILTK DYESFEFVIL
LRELAQNTQR GAAIRQMLRP VPLTINRLHV SDVALRVLKR FTMTEPDALY QWDKLRSAFV
LAEWSEQYFN NRALFSDYYL KHRLTDAKLN AEWNEDVLPI GRKINPLLKT ARQQLTDQPL
ATMQKQFYAP ILADLGFVLQ AVAGEPAYRY DLALPNNPQP VAAFFGYVWN RNLDDQDSQR
DPATPLIIPG AEVVSTLEAG TVPWVIVSNG KYWRLYSTTA SNKATNYYEV DLEEAFGAQD
ALVALKYWWL FFRAQAFGGF LERLLKQSAE YAKGLGERLK DRVFTEIFPQ FAQGFIADMR
RQGANDVSEQ QLNPVFEGTL TFLYRLMFVL YAESLDLLPL NEHNGYRERS LYTLKREIAE
IAGSLLDQRA ANLQAHYSAT STALYQRILD LCAVIDRGSP DLNMPTYNGG LFSATSTSGQ
FLQRYAVPDR YLALGLDRLA RDLDDRSQAL VLIDFKSLGV RQLGSIYEGL LEFKLHIASE
CLAVTKEKGK EVYQPAAKVA KPLAIIERGM AYLVNDKKER KATGSYYTPD YIVKYIVQQT
VGAVLDQKFK ALAPRLHEAQ KQYRNYANLV AARAKSSKRP ENPAVFWTDP NGAMGQLLDD
CLNLRVLDPA MGSGHFLVEV VDFISNRLIH FLNAWSENPV WAMIDRTRSE IVADMERQGV
TIDPERLTRV ALLKRAVLKR CIYGVDLNAM AVELAKVSLW LDAFTLGAPL SFLDHHLKHG
NSLIGARVAE VQTYLDVGGR QSHMLAGNEF AGLSLATDLM RQVSFLSDNT VEQAQQSAVA
FRDADQHLAP FKRMLDVYTS RWFGNQPAKK SKSDWVNIFL RMPSIKPWLH DSTVKLDDSL
IEATKIGQIA LEAASSKHFF HWELEFPEIF FAPSTPGGQD VQLNPNGGFD AVVGNPPYIR
IQFLDKSDVD YFNNIYLSPN GSYDIYILFI EKSIELLNIN GISGYICPNK FMTNAYGDKI
RNIIGESRNL FRLVDFGDYQ LFEGATTYCC LVFLCKNNNR TLDIPVISVK DYINIENNKV
IKFNIDNLIN NKWYFGVNNE LNEKIKKVAG RDLGEIAHPH YCLFTGLNDA FVVSESDIFN
YELERDILKP LLRGQDVKRW EVLHEKLYII YPYELYNNKM SVIDINKYPN VKNYLFKFKD
LLINRVKFIQ KVKNIHERES RWYEYIDPRS TYQFDPLKII TPEIAGYGSF VVDTESYYCL
NKVYVINLEN VVENPYYITS ILNSSISFYM FKDISTKLAN GYYEYITQYL KQIPIPRIDF
TTEPSEREQL TQIAIDSYTQ KNDNNVLTLV NTLFTAENPI QSNPIQSNPI QSNPIQSNPI
QSNPIQSNPI QSNPIQSNPI QSNPIQSNPI QSNPIQSNPI QSNPIQSNPI QSNPIQSNPI
QSNPLWRCCP RSISLPRPTN DRAEQSQATS QQALLELARK PTTDSAQKRR NWPRQPDGQN
DHPRLPWRLP KRPTRRQLER LLVSAATKPQ SFCRQSQRNR SQHCPSLPPI AR