Gene Haur_1986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1986 
Symbol 
ID5733875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2443419 
End bp2445467 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content50% 
IMG OID641279130 
Producthypothetical protein 
Protein accessionYP_001544757 
Protein GI159898510 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATACG TGGATCTGGT GATCAAGGGC TATCTTAATG CTTCAAATTA TTTTCAGATG 
AATGTTCAAA TGCGCGGTTT GGTTTCCGAT GGTGGTGCGG TTGATAACAT TCCAATTAGT
TTAGAAACGA TTACCAAATT GCGCGAACGC TATCAACGCT TGATCAAAGA CTTTCATCAC
AGCATCTACA ACCATGCGAA GCCTGAAGAT CCGAGTATGT CCAGGTTGCT CACACAGGCT
GCGGCTGTGG CTGTCTACCA CTCTGCCGAT TCGATCGAGT CCTTGCGGGC CTTGCGATCG
GCGTGCGGAG AGTTTATGAC GGTATGGGAT ACGTCAATTA AGCCACCCGC CTTGTTTCAG
GAACACTATC ACCAGCTTTG GCAACTTGGT AATCAAATGC TGCAAATGCT TCCTGGTACA
GCAAAAACTA TTCTGCGTGA CAGTATTTGG CAAACCCAAT CACGCCAAGC AAAACAGGGT
TTACGGGTTA TTCTCGATGT TGCTGAGAAT GCCCGTAGCC TGCTTGACTT ACCATGGGAG
TTGTTGGTGA TTCCCACAGC GCATCAAACA CTGGCGGCCC AGGGTATCGA GCCGCTAAAT
GAAGCTGAAT TTTCAACCCG TTTTTTGTTT TTGCACCATC ATATGGTGTT AATTCGCCAA
GTTTCGAGCA TGCTGCCCTA TCAGCCAATT ACAATTGACC GTAATTTAGC GTTACAAGTT
ATTGCGGCAC CGTTAGTCCG CTCGCCAATC GATACCAGAT CGTTTATTGC TGAATTAGCC
CCCTTGTTTG CTGGCGAAAC ACTTGAGCGC TGGTGGAGCA GTGATCCAAA TACGATTGAA
ACATTGCATA AGCGCTTGCT TGAACATCAA CCTCAAATTG TCCAATTGTT GTGTCATGGG
CATTCGCCCA AGCCCGAAGC CAAACTACAG CGTCATGATA TGCTTGTAAC CTATATTCTG
AATAATCAAC AGGTGGTTTA TCGAGTTAGC TCGCATGATC TTTGGCCGAT TCTGAGTGCT
TCGGCACGGC TTCAGTTGGT TGTCTTAACG GTGTGTCATA GCAGTGGTGC GCAACAAACC
GAAGAAAATA CGGCAACTGT TAGCAATATT GCCTACGATT TGGTGCGGGC TGGCGTGCCA
ATGGTTATTG GAATGCAAGG AGCGATTGCC CAACATGCCG CCGCACGCTT CTGTGGAGTT
TTATATACGG CCTTGCGTGA AGGCCATACG ATCGAATGGG CGATCACGGC GGCACGGGCG
GCGCTGAGTG GCAATCGCTG GTTTATCGAT TGGACAATTC CCGTGGTGTA TCGCCAAGCT
GATCAACGTG AACGCCCAGC ATGGCATACC CGTTTGGCCG ATTTTCTTGA TGCGCGGTTA
CTTTCCCCTT CCTATCGCCG TGGTTTTCGA GCTGCGGTGA TTGTGCTCGC TTTGGGCTTG
ATCATTGGTG GCTTGAGTCG CGCAATCTTT TGGCCTAGTC AATTGAGGGT CAATCTTGAA
CTATTGCGCA CTGGAGCGTT TCTGTGGGCC ATAATTGGGG TGACTTGTAC CCTACTGGTT
GATCACTTTA TGACCAGTTG GCGGCCCCCT CATTTAGCGC CGCATGAAAT TGTTGCTCGT
CGTTATGCTA GCCGTGGTGG AATGTTGTTA GGCTATGCGA TTGGCGGCTG GGCTGGGGCA
TTATTGCTTG GTGGGCTGTT TTTGAGCATT GGCGAATTAA TCAGTCCACC GATTTGGCAA
GCGCTTTTTC TGGGGCTGGT TGGTTGGTCG AGCTTGTGGG GCTATGTGGT TGCCCGCTCG
GAAAGCCGTG CTGCGCATAA CAATTGGCGG CTTTATCCGC AACTGTATGT AGCCAAGAAT
GGCTGGTTGT CGGTGTATCT TGGGATGCTC CTGCTGCTCT TTATTGTGCC ATTGGGTCTG
TTGACCGCCT ATGGTCAAAG TCTGGTTGGG CTTATCTTGA GTTCAGGTTT GGGCAGCGCA
GCGATGGGGG TAACTGCGCT GACCATGATC TACAGTTTTG ATCGTGAACG GCGGGCTAAG
CTGGGTTAG
 
Protein sequence
MQYVDLVIKG YLNASNYFQM NVQMRGLVSD GGAVDNIPIS LETITKLRER YQRLIKDFHH 
SIYNHAKPED PSMSRLLTQA AAVAVYHSAD SIESLRALRS ACGEFMTVWD TSIKPPALFQ
EHYHQLWQLG NQMLQMLPGT AKTILRDSIW QTQSRQAKQG LRVILDVAEN ARSLLDLPWE
LLVIPTAHQT LAAQGIEPLN EAEFSTRFLF LHHHMVLIRQ VSSMLPYQPI TIDRNLALQV
IAAPLVRSPI DTRSFIAELA PLFAGETLER WWSSDPNTIE TLHKRLLEHQ PQIVQLLCHG
HSPKPEAKLQ RHDMLVTYIL NNQQVVYRVS SHDLWPILSA SARLQLVVLT VCHSSGAQQT
EENTATVSNI AYDLVRAGVP MVIGMQGAIA QHAAARFCGV LYTALREGHT IEWAITAARA
ALSGNRWFID WTIPVVYRQA DQRERPAWHT RLADFLDARL LSPSYRRGFR AAVIVLALGL
IIGGLSRAIF WPSQLRVNLE LLRTGAFLWA IIGVTCTLLV DHFMTSWRPP HLAPHEIVAR
RYASRGGMLL GYAIGGWAGA LLLGGLFLSI GELISPPIWQ ALFLGLVGWS SLWGYVVARS
ESRAAHNNWR LYPQLYVAKN GWLSVYLGML LLLFIVPLGL LTAYGQSLVG LILSSGLGSA
AMGVTALTMI YSFDRERRAK LG