Gene Haur_4643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4643 
Symbol 
ID5736490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5932500 
End bp5935553 
Gene Length3054 bp 
Protein Length1017 aa 
Translation table11 
GC content50% 
IMG OID641281807 
Producthypothetical protein 
Protein accessionYP_001547402 
Protein GI159901155 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.162315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACAAC GTTGGTTAAT TGGCTGTTTG GTTGTGTCGC TGATGTTGAT TCGACAGCCC 
GTGATTGCCC AAATTACCAC GGCGACACTC ACGCTACAAC GCCACGATCA GCAACTGCAT
ATCAACGTGC AATTACCCAA ACCAACCCTG CAACCCAATA GCATCAACAT TGCAGGCTGG
CAAAATGATG CCACACCTGA TCAGCCGGCC TTGCCGCGTT CAAGCCACTG GTTAGTTGTG
CCAGCAGGCT ACCAACTCAG ACTAAAATCG GTTAATCCTC AACAACTACA ACACTATCAG
CAACAACTTA GCCTCACTCC TAGCAGTGGT TGGCAGGTTG ATCCACTCGC GCCAAGCAAG
GCCATAGCGC TAAGCGCACC AAGTGTAGCC GTCAAACAAG CCCAATATCC AACCACATGG
GCCAACCTTG GTCAGAGCGT CCAAGTGCGG GAGCAACAAC TTGTGCCATT AACTATTTTC
GGGGCGCAAT GGCAACCAAG CAAACAGCAA ATAGTTGTAC CAAGCTCAAT CGACATAGCG
CTAGAATTTG TGGCAAGTAC CGAGCAACCC AGCTTGCGAG CTGATCCATT TTGGAACGAA
CTGCTACGTC AGCAAGTGCT CAATCCCAGC GATCTACAAA ATCCAGCATT ACGCCCAGCC
TTTGCCACAA CCACGCCAGT AACCAATGGA GTGCGAGTGA GTTTTGCCAA CCCAGGCATC
AGCGAAATTC GTTGGAGCGA TTTGCAGGCG GCGGGCGTGC CAAGCCAATG GCTCAATCAA
TCGGCTAATT TACAACTATG GCAAGGGCGC AATCAACTGC CACGGTTGCT GACTGCCACG
GGCATGATTT TTTATCTACC GCCCTACAAT CGTGATCAAA GCCTGCAAGG GAGCGTGATT
GTGCGCTGGA ATGGACAGCA ACCAGGCAAT GTGTTGGTTA GCGAATCAGT CAATTCGGCC
AACCCAAGCC TGAGCTACTA TAGCGAGACC TTGCGCTTGG AAGAACAAAA ACTCTATCTG
AGCGCCTTTC CGGCCAGCGG CACAAATCGT TGGTGGTGGC AATATTGGTA TAGCCCAGGC
TCCGGCCAAA GCGCTCAGCC CTTGCAAATT AATTGGAATT TGGATAATGC AACTCGCTTC
GATCAGCCAG CCCGCTTGCG GTTGCGCTTG CATGGCGGCA AGCTTGGCAA TCGGCATCAA
GCCGAAATTC GCCTGAACAA TCGTTTGCTC ACAACCGTCA CAATAACCGG CTTTCAACTG
CTTGAGTCAA CGATTAACCT GCCAAGTGGC TGGCTTAGCG CAACCAACCA ACTCACAATT
ACCCCAATGA GTACCGAGCG CGAAACCAGT TTTCTCGATT GGGTTGAGCT AGATTACCAG
CGTCAAGCGC AGGCAGTTGC GGGCCAATTA CAATGGTCAA GCAGCCAAGC CAACCAAAGC
ATTAGCAATA TCATCAGCGA AAATCCGTTG CTATTCGATG TGCAAACGCC CTTGGCTCCG
CGCCGCTTGA TTGGCTGGAA TTTGCAGCAA GGCCAATTAA GCTGGCAAAC CAGTGGCAAT
CGCCGCTACC TTGTGCAGAG CCAACGCCAA ACACCGTTGA GCAGCGTTTG GTTTAGTCAG
CCCGATTTGA GCAGCACCAG CCAGCAAGCC GATTATTTGC TGATTAGCTA TAACCCAGCC
AACTCCTCGA GCTGGAGCGA TGCACTGCAA CCATTGATTA CCCAACGCGC CAGCCAAGGC
CTCAAGCCAT TATTAATTGA TGTGCAGCAG ATTTACGATC AATTTGGCGA TGGGCGGGTT
GATCAACAGG CGATCGCTGA TTTTATCAAG TATGCCTATC ATAATTGGCA AGCACCAGCG
CCTAGTTTTG TGGTGTTAGT TGGCGATGGC ACGGCAGATC CGCACGATTA TGCTGATATT
ATTGGACAAC CCGTGACCAA TTTTATTCCA CCCTATTTGG CCGATGTTGA CCCGTGGTTG
CGCGAAACAG CCGCCGACAA TCGCTATGTA ACGGTTGCTG GCAACGATAC TTTGCCCGAT
TTGTTCTTGG GGCGCATTCC AGCGCGTTCG CTGAGCGATG TTGAACATGT AGTTGCGAAA
ATATTAAGTT ACGAAGCCAC GCCCAGCAAC GCCGATTGGC TGAACAAGCT GTTGTTTATC
GCCGATGATC CAGATGTATC GGGCGATTTT CCGTGGCTTT CAAATGAGGT GGTGGAAATT
TTACCGCCAA CGGTTGATGA TCAGCAGTTG TACTACACAG CGAATACCAA TCTGACGAAT
TTTCGGGCAG AAATCGTTAA TCAGATCAAT AATGGTCAAT TTTTGGTCAA TTATGTTGGC
CATGCGGGCA TCGATGTTTG GGCTGATCCG ACGATTTTCA ACCAGCAATC GGTGGCAAGT
TTAAGCAATA GCGCCTTGCC GTTGATGCTC TCGTTGAGTT GCTATGCTGG CCATTATCAA
CAAAATGACC TTGAATCGTT GGCCGAAATG CTGGTGTTGC AGCCTGAGCA TGGAGCAGTT
GGTATGTGGG CGGCTAGTGG TTTGGGCATC GCCCATGGCC ACGATTACCT GAATCGTGGG
TTTGTAAACT CGATTATCAA CGATGGTTGG CGCTTGGTTG GGCCAGCAAC AATTCAAGGC
AAGCTTGATT TAGCAGCGGC CAATATCTCG CCCGATTTGC TCGACACCTT CAGCTTTTTT
GGTGATCCGG CCTTGCGTTT GCCCTTACCA ACCAACAATG CTTGGCAACC ACAGGCCGAT
TATTACGAAG TTTTGCAATA TTCACAGGCT AATCGGCTAA CTCCATTGGC GAATGATCAA
GCCGATTTTA GCCAAATTAT CAGCCTTGAA CAACCACAAC ATGGCCAAGT ATGGCTCGAT
GCAGATCAAC GCAGCGTTCG CTACACGCCT GATCCCGTTT ATAATGGGCT TGATTCATTT
AACTATCAGG TGCGCAATTT GAGCTTGAAT CAAACCCTAA GTGCGACAGT GACGATTAGC
GTAACCGCGA TTGCGCCGCA GCTTTACCTA CCATTGACCA TCGCAGATTA TTAA
 
Protein sequence
MLQRWLIGCL VVSLMLIRQP VIAQITTATL TLQRHDQQLH INVQLPKPTL QPNSINIAGW 
QNDATPDQPA LPRSSHWLVV PAGYQLRLKS VNPQQLQHYQ QQLSLTPSSG WQVDPLAPSK
AIALSAPSVA VKQAQYPTTW ANLGQSVQVR EQQLVPLTIF GAQWQPSKQQ IVVPSSIDIA
LEFVASTEQP SLRADPFWNE LLRQQVLNPS DLQNPALRPA FATTTPVTNG VRVSFANPGI
SEIRWSDLQA AGVPSQWLNQ SANLQLWQGR NQLPRLLTAT GMIFYLPPYN RDQSLQGSVI
VRWNGQQPGN VLVSESVNSA NPSLSYYSET LRLEEQKLYL SAFPASGTNR WWWQYWYSPG
SGQSAQPLQI NWNLDNATRF DQPARLRLRL HGGKLGNRHQ AEIRLNNRLL TTVTITGFQL
LESTINLPSG WLSATNQLTI TPMSTERETS FLDWVELDYQ RQAQAVAGQL QWSSSQANQS
ISNIISENPL LFDVQTPLAP RRLIGWNLQQ GQLSWQTSGN RRYLVQSQRQ TPLSSVWFSQ
PDLSSTSQQA DYLLISYNPA NSSSWSDALQ PLITQRASQG LKPLLIDVQQ IYDQFGDGRV
DQQAIADFIK YAYHNWQAPA PSFVVLVGDG TADPHDYADI IGQPVTNFIP PYLADVDPWL
RETAADNRYV TVAGNDTLPD LFLGRIPARS LSDVEHVVAK ILSYEATPSN ADWLNKLLFI
ADDPDVSGDF PWLSNEVVEI LPPTVDDQQL YYTANTNLTN FRAEIVNQIN NGQFLVNYVG
HAGIDVWADP TIFNQQSVAS LSNSALPLML SLSCYAGHYQ QNDLESLAEM LVLQPEHGAV
GMWAASGLGI AHGHDYLNRG FVNSIINDGW RLVGPATIQG KLDLAAANIS PDLLDTFSFF
GDPALRLPLP TNNAWQPQAD YYEVLQYSQA NRLTPLANDQ ADFSQIISLE QPQHGQVWLD
ADQRSVRYTP DPVYNGLDSF NYQVRNLSLN QTLSATVTIS VTAIAPQLYL PLTIADY