Gene Haur_2616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2616 
Symbol 
ID5734494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3354901 
End bp3359121 
Gene Length4221 bp 
Protein Length1406 aa 
Translation table11 
GC content52% 
IMG OID641279756 
Producthypothetical protein 
Protein accessionYP_001545382 
Protein GI159899135 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0177484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACATC GTTACACTCG TTTAGCAACG GTCTTAGCGT TGTTAACCAT GTTAGCGGGC 
AATTTGAGCG CTAACAATGT CACACGGGCG GCAGAACCCC TCAAGCCCGT TGTGGAAACG
GCTACTGATT CGGTTCAAGT GAACCAAGCC AGTGCTAATC GTTCAAGCAA GCCCACAATT
GCCCCATTGT ATGGTGATTT CCCCAAGTCT GGCAAAGTTA AAGTTATTGT TCAATTCCAC
GAAGCTCCGT TGGCAACCTA TGCTGGCGAA GTTAAGGGTT TGAAGGCTAC CGCCAATGCG
CAAACTGGCC GCGCCAAGCT TGATGTCAAG GCGGCTGAAT CACAAGCCTA CTTGAAACAT
CTTGAAGCTC AGCACGCCAG CTTCCTCAAG GATGTGCAGG CCAAATCAAA CCAAATCAAA
CAAATTGCTG ATTATCAAGT TGCCTTGAAC GGGATGTTCT TGGATGTTGA TGTTTCAGCG
ATTACCGTCT TGCGCAACCA CCCTGATGTT GCATCAGTTG AAGTTGCCAA AGTTGAAAAA
CTCGATACAG ATAGCAGCAA CCAATTCATT GGTACGCCAA GCTTTTGGCA AAACCTTGGT
GGTGGCAGCA CCGGCACGAA CGCTGGTGAA GGCATTGTGA TTGGGGTGAT CGATAGCGGG
ATCTACAACC CTTCGAGCAC CTTGACCACG ACTGGTATCC ACCCATCATT GCGCGATCCA
TCACCAGTTG GTGGCGATTA CAGCGCGTGG CCAGGTGGCT ACAAAGGCGT TTGTGCTCCA
TCCAACCCAC AAGCTCAAGA TGGTTCGTTT GGCGCTTGTA ACGACAAACT AATCGGGGCT
TGGTGGTATA ACGGTGGCGG CATCGCATTC CCTGGTGAAG TTGATTCACC AATGGATGAA
GATGGCCACG GCAGCCACAC CGCTACCACT GCTGGTGGTA ATGGTGGCAC TGCTTCACCA
TTCGGCACCG TTTCAGGGGT TGCTCCACGC GCCCGAATCA TTGCCTACAA AGTTTGTTGG
GAAGAAGATC CAGCAATCAG CGATGACGGC GGTTGTAACG GTGGCGACTC GGTTGATGCA
ATCGACCAAG CCTTGATCGA CGGCGTTGAT GTGTTGAACT TCTCGATCAG TGGTGGTAAC
TCACCATGGA CCGATGCGGT TGAAATTGCA TTCCGTAACG CTAACGCTGC TGGTGTGATT
GTTAGCGCCT CGGCTGGTAA CGCTGGTACG GTTGGGAGCG TTGCTCACAA CTCACCATGG
TTGATCACCG TTGCTGCAAG CACCCAAGCC CGCGAATTCA AAGGCTATGT AAGCAACTTG
AGCGGTGGTA CGGGCACAGC TCCGTCACCA TTGGTTGGTG CTTCACTTTA CCCAGGCTCA
GTCACTGGCC AAATCAAGTT GGCTCCAGTG CAAGCAGGCG AAAGTGTTGC CCCAAGCTTG
TGTCAACAAC CCTATGCACC AGGAACCTTC AACGCAACCG ATATCGTGAT TTGTCGCCGT
GGTGTCAACG CCCGGGTGCT CAAATACGCT AACGTGTTTG CTGGTGGTGC TGGCGGTGGG
ATCATCGTCA ACGCTGCTGA TAACCAAGGT ATTGTTGCCG ACTACTGCGC CAAAGTTTGT
ATTCACCTCG AAAAGAATGC AACCTTGGAA AGCGTTGCTG GCGAAGCCTT GGTAACCTAT
GTCCAATCAG GCACTGTTAA TGGTAAAGCT GATGGCGGTG TCAAGGTTAT GGGCGCTGGC
GATAAGATGG CCGGCTTTAG CTCAATGGGT CCTTCGCCCA TCAAGAACGT GCTCAAACCA
GACGTAACTT TGATCGGTGT TGATGTCAAC GCTGGTCATA CTGCATTCGT TTGGGATGAT
GGCTTCGCTG ATGGCGAATT GTTCCAAGTT ATCGGTGGTA CCTCAATGTC AAGCCCGCAC
AACGCTGGCG TGACCGCTTT GATGCGCCAA AAATACCCAA CTTGGACTCC ATTCGAAATC
AAATCGGCTT TGATGACGAC TGCTAAGACC AGCGTTGTGA AGCCCGATGG TGTAACCGCT
GCCGATCCAT TCAACATGGG TGCTGGCCGC GTCGATTTGA CCAAAGTATT CAGCGCTGGC
TTGGTGCTTG ATGAAACCAT TCCAAACTTC ATCAATGCTA ACCCATCAGC TGGTGGCAAC
CCTGCAACCT TGAACATTGC AAGTGCTGCT AACGAATCGT GCCCAGTCGA ATGTACGTGG
ACACGGACAG TCAAGAACAC CTTGAACGTT CCTGCAACTT GGAATGTTAG TGGCTCAACC
TCGCCTGTAA CGGTAACTGC AACTCCAGCC AGCTTCACCA TTCCTGCTGG TGGAACCCAA
GTTATCACGA TCAAGGCCAA TGTTGCTGGC CAACCAATCA ACGGCGTTTG GAAGTTTGGC
GAATTCCGCT TGACCGAAGC CGCAAACCGT GCTCCAGCCG TCCACTTCCC AATTGCCGTC
AAGGCGAGCG CTGGTGGCGT GCCCGAGAGT GTTTCTGCAA CCACCCGCCG CGATACGGGT
ATGGTGACTG CTGGTGGCTT TAGCGCCCTC AACGTAACCA ACCTGACCAT CCGCGAATTT
GGCTTGACCA AGGGTGTTCG CGCAACCGAA GTTATCACTG GCGATACCAC CAATGGTAAC
CCATACGATA GCTTGACCAA CGGTGTCTTC TATCGCGCAT TGCAAGTGCC TACTGGCGCA
AGCAAGTTTG TCGCTAAGAT CACCGATACC GCATCAGAAG ACATCGACTT GTTTGTTGTC
CGCGATGCTA ACGGCGACGG GATTCCTCAA GCAGGTGAAC AAGTTGCTGA ATCAGCAACC
TCATCGGCTT ACGAAACCGT CACGATCAAC AACCCAACGG CTGGTAACTA CATTGTGGTT
GTCCAAAACT GGCAGGCTTC ACCTGCTGCT CCCGACTCAA CCACGGTTGA TATCTACTAC
GTACCTGGCA GCAACCTTGG CAACTGGGAT GTTGTTGGTC CAGCCAATGT TTCAGGCAGC
GAACCATTCA GTGTTGATAT TCACTGGGAT GAGCCAGCTC TCGAAGCTGG TGATGTCTGG
TTCGGCGAAT TCGACCTTGG CACCAGCCCA GCTAGCCCAG GTAGCATTGG TCGTACCGCC
TTCACCCTCG AAGTGTTGGA AGCCGAAGTC AGTGTTGATG TTAGCGATAC CATGGTTGAT
GTTGGCGAAT ATGTCACCTA CACCGTGAAT CTCTCTAACT ACAGCGCAAT GAACGATACC
TTCTATGTAA CCAGCACCTT GCCAGTTGGC TTGGAAGTTG ATGTGAGCAG CTTGCCTGCT
GGCGCGAGCT ACAACGCATC AACCCGCACG GTTACTTGGA GTGGTTTGGT TAATGGTGCT
GAAACTGGTT ATACCTTCAG CGATAGCCGC GATGGTGGCG TAGCATTCGA TAACTTCGAT
GTTTCAGCTG ATGCCAACGC TATTGCGATT TGTGCTGCTG CCAGTACTAC CTGTGACGAA
ACCGCACCAA ACTTCACCTT AGGCACGTAC CTTGTTCGCG CCTACGATGT GAATTATGGC
GCTGTTCGCC CATGGAGCAA CGGTTTGATT CAATTCGACC CAACGCTTCC ATCAGGCAAT
ACCAACTATG TTGCCCAAGA TATGCCAAAC TCAGCAACTC CTAACGGAGT GTACGCTGGT
CTTTGGACGG ACTTGGACTT GGATGGTTCT TCAGCTACCG ATACTGGTGG TGGTGAGTAT
TACATCTTGA TTGTTGATGG GGTCAATCCA AACGATCCAA CTGTACCGTA TGTTGCAGTT
GAATACGAAG ATGCCCAACA ATATGACGAT CCAACTTCAA ACCTCAACTT TACCAACTAT
GCCCGCGCTG ATGGCATTGA GTCAGAGTTC TGTACCGTTT ACGGTTCGCT CACTGGCAAT
TTGACCACCT TCGCCGATGG TGCAGCCGTT GGGATCGAAA ACTTGACTGG TACGGTTGGC
AACACCTACT ACTACAGTGG TGATAGCTCG ACCGCCGCCA ACTTGCCAAC CGCAGGTGCA
ACCATCTGTA CCTCATTGGG TGCTGGCGCA CCAGGCTTGC GCTCGTTCAG CTACCGCGCT
CGCGCAACCC AAGCTGGTGT CTTGACCAAT AACCTTGAAT ATAGCTCAGC TGCTGATAGC
AATACCAGCA CTGAAGCTGT GGCTATCGAA GCAGTTCAAG CAGTGTTCAA GATCTTCATG
CCATTGGTTA GCAAGTCGTA A
 
Protein sequence
MKHRYTRLAT VLALLTMLAG NLSANNVTRA AEPLKPVVET ATDSVQVNQA SANRSSKPTI 
APLYGDFPKS GKVKVIVQFH EAPLATYAGE VKGLKATANA QTGRAKLDVK AAESQAYLKH
LEAQHASFLK DVQAKSNQIK QIADYQVALN GMFLDVDVSA ITVLRNHPDV ASVEVAKVEK
LDTDSSNQFI GTPSFWQNLG GGSTGTNAGE GIVIGVIDSG IYNPSSTLTT TGIHPSLRDP
SPVGGDYSAW PGGYKGVCAP SNPQAQDGSF GACNDKLIGA WWYNGGGIAF PGEVDSPMDE
DGHGSHTATT AGGNGGTASP FGTVSGVAPR ARIIAYKVCW EEDPAISDDG GCNGGDSVDA
IDQALIDGVD VLNFSISGGN SPWTDAVEIA FRNANAAGVI VSASAGNAGT VGSVAHNSPW
LITVAASTQA REFKGYVSNL SGGTGTAPSP LVGASLYPGS VTGQIKLAPV QAGESVAPSL
CQQPYAPGTF NATDIVICRR GVNARVLKYA NVFAGGAGGG IIVNAADNQG IVADYCAKVC
IHLEKNATLE SVAGEALVTY VQSGTVNGKA DGGVKVMGAG DKMAGFSSMG PSPIKNVLKP
DVTLIGVDVN AGHTAFVWDD GFADGELFQV IGGTSMSSPH NAGVTALMRQ KYPTWTPFEI
KSALMTTAKT SVVKPDGVTA ADPFNMGAGR VDLTKVFSAG LVLDETIPNF INANPSAGGN
PATLNIASAA NESCPVECTW TRTVKNTLNV PATWNVSGST SPVTVTATPA SFTIPAGGTQ
VITIKANVAG QPINGVWKFG EFRLTEAANR APAVHFPIAV KASAGGVPES VSATTRRDTG
MVTAGGFSAL NVTNLTIREF GLTKGVRATE VITGDTTNGN PYDSLTNGVF YRALQVPTGA
SKFVAKITDT ASEDIDLFVV RDANGDGIPQ AGEQVAESAT SSAYETVTIN NPTAGNYIVV
VQNWQASPAA PDSTTVDIYY VPGSNLGNWD VVGPANVSGS EPFSVDIHWD EPALEAGDVW
FGEFDLGTSP ASPGSIGRTA FTLEVLEAEV SVDVSDTMVD VGEYVTYTVN LSNYSAMNDT
FYVTSTLPVG LEVDVSSLPA GASYNASTRT VTWSGLVNGA ETGYTFSDSR DGGVAFDNFD
VSADANAIAI CAAASTTCDE TAPNFTLGTY LVRAYDVNYG AVRPWSNGLI QFDPTLPSGN
TNYVAQDMPN SATPNGVYAG LWTDLDLDGS SATDTGGGEY YILIVDGVNP NDPTVPYVAV
EYEDAQQYDD PTSNLNFTNY ARADGIESEF CTVYGSLTGN LTTFADGAAV GIENLTGTVG
NTYYYSGDSS TAANLPTAGA TICTSLGAGA PGLRSFSYRA RATQAGVLTN NLEYSSAADS
NTSTEAVAIE AVQAVFKIFM PLVSKS