Gene Haur_4189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4189 
Symbol 
ID5736051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5342515 
End bp5346255 
Gene Length3741 bp 
Protein Length1246 aa 
Translation table11 
GC content50% 
IMG OID641281344 
Producthypothetical protein 
Protein accessionYP_001546949 
Protein GI159900702 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCG CTTGGTTGAC TCAATGGCTG GCCAAACAAC GTTTAAATCC CCAATATCAA 
CAAGGGGCAA CCTTGCACCA AGCCTTGCAA TTGGTGCTCT ACGATTGGAT TCCCGCCGTA
TTCGAGGGTT TGGAGCAACA AGCCAAGGCC AATTGTTGGC CTGATCAAGA TTTTAGCCAA
CGCATAACTG CCAGCGCCAA GCAACTGGCA CTGTGGCATG GCAATCCACA ACGTTATTGC
GACTTGCAGC AACAGCAAAC CAACTTACCT TTGCAACTTC CCAGCAGCTG GCAAGCCGAC
CAACCAACCC TGAGCGCCAA ACAAATCGAT GCTTTGCGCT CGTTTTGTAG CAGCGTGAGC
AGCGATCCGC GCTCGGCCAG CTGGACAATT TGCGTTGCCG ATGAGGCTGA ATTGTATGCT
TGCGAGTTGC ATGCCCATGC CCTGTTTGGC GATATTGCCG CGCTGTGGCA GCCAATTATC
CAAGGTTTGG CCAAACTAAA TGAACTTGAA GCAGCGATTA GTTTGATTTT GCTTCAGCGC
GATTTGGCCG ATTATGCTGG GATTATTGCG AGTTTCAAGC GTGCTCAACA GGCGCATCCC
AAGGCCGCTG AGCAGCTTGT GCCTTGGCTC GACGATCAGC TAAAACCAGC CTTGATCGAT
GATTTGCGGG CGATGCTTGA GGCAGTTTTG CTGGCTTATC GGCTGCACAA CCCTGATGAT
CAAACCTTGC TTGAGGCCCT TCAGCAAGCA ATTATGGCGT TAACGCCAAT CGAAGTTGCC
CCCGACGATG GTAAAGGCGA GGAAGCGCCT GCTCCAACTC CGCAACGTGG CGTGCCGTTG
CCAGCGGTTA ATTTGCCAAT GCCGAATATT TTGCGCTTTG CTACGCCCAA AAACCTACTT
TTGAACCAGC TAAAAACCCA TTGGCAAGCC TTGTTGGCAG GGGTTTTGGT CGATGAGCAG
CTTGGGGCAC GCCATCGGAT GGTACTGCAA TGGTGGCTGG CGTGGACTGA ACATCAATTA
TTTAACGTGA CTCAAGCCCA AGCGTTGCTC GCCGAAATTC CCAGCACAAC TGCGCCATTG
GGTTTATTGA CCGAGGCTCT GCGGGGCGAA TTGGCTTTTT TAGCAGGCCA ACCTGAATTA
GCCCAACACT GTTTTGCCAC AACCCTTGCA GCCAGCCAGC AATTAATCAG CGAGCCAACT
TTTGCTAGCC AAACTAGCCA AAATCAGGCT TTTGGTGGCT TATTAGCCCA ATGGCAGGGC
AATAGTTTGG TGATGTTGGG CGATTACACG GCGGCTGAAC AATGCTATGT CAAGCTTGCC
AATTTGCTGG CCGCTGATGA TCGTGAGGGC CAATGTTTGG CAGCGATCAA TCGGGGCAAT
ATCGCCTTTG TCCGCAATAA TTTGCTTGAT CATCGCGGCT ATGTGACCTA CGACAAAACC
GAGCAAGTGC TATTGCGCGG CGATAAACCA CCCGAACATT TTCTAGGGCT GAAATATACC
AAGCATCGTC AAGCGCTTGT TCAAGCGCAA CAAGCTTATG CCGAGGCATT AAACTTGGCT
GAGCAAGAGC CGAATTTGGC TCAATATCGC AGCTTGATCA TCGCCAACCA AGCCAATATT
GTTTGGCTCC AAGCTAATTT GCTGCTTGAG TCTAGTTGGT TTTCCAGCGA TTTGGCGGCA
AGCTTGCAGG ATTTAGGGGA GCCAGCCCAG CTTTATCAAC AAGCCTTGAG CTTATTGCAA
CAAGCCTTGA CATTGCTCAA CCAAACCACG GCGGATCGGG TGCTGCAAGC GGTGTTGTTT
GCCAATATCA GCGAAGTTCA ATTGTTGCTG AAGCAGCCAG CCGAAGCCTT GAATTCCGCC
CAAAACTGCT TGAAAGCGCT CAATCTGAGC GATCTCAAGC CCCAAGCAGC CCTGAAACAA
GCCCAAATGC GCGGCGTGTT GATTCCCGAT GCTGGCTGGC GAGTTTGGTT GACCATGGCA
CGAGCCTACG AAGCGCTCAA CGATCTGGCT AAAGCCCAAC AAGCCTACGC CAATGCCTGC
GACTTGGTGC AAGTGTTGCG CAATAATGTG CAGCAAAGCG ATTGGCAAAT GGCCGCGCTG
CAAGATAAAT TTCAGGTGTT CGAATGCGCG ATGCATTTTC ATTTCAAGCA TTCCAGCGAC
CGCACTAGCT TATTGCAATT GAGTGAAAAT TTGCGTGCCC ATGGCTTTGA ACAATTGCTT
GAGGCCAGCC AAATTGATCG TGAGCGCGAA CTCAGCCCTG AGTTAAAGCA GCAACGGCTA
GCTCTGGCCG CCGCCTTGGC CGCGCGAAGC ATGGCAATTC GTTTAGCCAT GCAACAGCCC
GCTGGCGACG ATTTAGTTAA ATTATTGAAC GATCAGCGAG CTGCATTTGC CGATTGGCAA
GCGCTCCAAA GCAGCATTGC CCAAGCCTTG GAAAGCCAAA ATCAAGAACT CCAGCCGCAA
CTAATAACCT GGCCAAACTT ACAAAAAGCC TTGGCCGAAC GACCCAACAC GGTGATTTTG
AGCTTTACGA TTGGCAGCGA ATGGAGCTAT TTGCTATATA CCGATGGTCA GCAGTTGCAA
GCATTTGAGC TAGCCTGCCG CGCCAAAATT GAATATGCCG TGGCCCGTTT GATTTGGTAT
GCCCAGCGTG GGGCGGCGCG TTGGCAAGAA TTTGTGCGAG CTAATCGCCA TGTTGTTGAG
TGTTTGTTAG GGCCGCTCTG GGCGGCGGGG CTAAAAGAGC AGCTACGTGG CAAACAATTA
ATCATCATTC CAGATGGGAT TTTGTATTAT TTACCGTTTG ATTTGCTGTT TTTCGATGAC
CCAGTTGATG CCAACCAGCA GCCACTTGAT CCTGCGACCT ATCGCCAAAA AGCGCCTGAC
CAGGCTACGC CTGAGCAAAT TTGCCGTGAT CTCGTGCCAT TTTATTGGCT GAATCACGCC
ACAATTTCCA CTGCCCAATC GATTAGCCTT TGGCTGCAAC TTCAGCAACA AGCAGCAACC
CAAGCAGCAA ATCTGGCGCT GGGCGTTTAC AACATCAACT ATCAAACCAA TGTGCCAAGC
GTCTACCCAG GCCATATCTA CGCCCAAGAA TTGATGATTA GCTACAACGA TTTGAGCCAA
ACCAGTGTGC TCAGTAAGGT ATTGGGCGAT TTGGCTGCAC ATGGAACCAC CCTGCAATTA
ACCGCTTGGC AAGCCGATCA CACGCCCTAT CAAGCTGAAT TTCAATCCAA CGAAGCCAAT
TTCAAGCGAA TTTTGGCTGA GCAGGCCATG CGTTACATCA TTTTTGCGGG CCATGGCGTG
TTCAACGATA AATATCCGCA ATTTTCGGGC TTGGTGTTTA ATTTGGCAGC GCCTGATGGC
AGCAGCGATC AGAGCGGGCA AGATGGATTT TTGGGCATTC ATGACCTTTT TGAATTACGC
ATGCCCAATA CCGAATTGAT TTTTTTGGCA GCATGCCAAG GTGGTTTGGG TCTAATCTCA
CGTGGCGAGG GCATCAATGG CCTGACCCGC GCCTTGATGT TTCGCGGTAG TCCGACAATT
ATTGCTAGCT TATGGTCGGT TGATGTGTTG GCAACCATGG ATTTAGTTGA GGCTTATTTC
GAGTTGCTGA GCCAACAACC AACCGCTGAT AAAGCCGAAA TCTTGACCAA AGCTAAGCAA
AAAATGCTGG CCCAGCCCAA TAAACCACAT TTAGTCCATC CATTTTATTG GGCCGCTTTT
ATTCCAATTG GAAAGCGTTA G
 
Protein sequence
MTTAWLTQWL AKQRLNPQYQ QGATLHQALQ LVLYDWIPAV FEGLEQQAKA NCWPDQDFSQ 
RITASAKQLA LWHGNPQRYC DLQQQQTNLP LQLPSSWQAD QPTLSAKQID ALRSFCSSVS
SDPRSASWTI CVADEAELYA CELHAHALFG DIAALWQPII QGLAKLNELE AAISLILLQR
DLADYAGIIA SFKRAQQAHP KAAEQLVPWL DDQLKPALID DLRAMLEAVL LAYRLHNPDD
QTLLEALQQA IMALTPIEVA PDDGKGEEAP APTPQRGVPL PAVNLPMPNI LRFATPKNLL
LNQLKTHWQA LLAGVLVDEQ LGARHRMVLQ WWLAWTEHQL FNVTQAQALL AEIPSTTAPL
GLLTEALRGE LAFLAGQPEL AQHCFATTLA ASQQLISEPT FASQTSQNQA FGGLLAQWQG
NSLVMLGDYT AAEQCYVKLA NLLAADDREG QCLAAINRGN IAFVRNNLLD HRGYVTYDKT
EQVLLRGDKP PEHFLGLKYT KHRQALVQAQ QAYAEALNLA EQEPNLAQYR SLIIANQANI
VWLQANLLLE SSWFSSDLAA SLQDLGEPAQ LYQQALSLLQ QALTLLNQTT ADRVLQAVLF
ANISEVQLLL KQPAEALNSA QNCLKALNLS DLKPQAALKQ AQMRGVLIPD AGWRVWLTMA
RAYEALNDLA KAQQAYANAC DLVQVLRNNV QQSDWQMAAL QDKFQVFECA MHFHFKHSSD
RTSLLQLSEN LRAHGFEQLL EASQIDRERE LSPELKQQRL ALAAALAARS MAIRLAMQQP
AGDDLVKLLN DQRAAFADWQ ALQSSIAQAL ESQNQELQPQ LITWPNLQKA LAERPNTVIL
SFTIGSEWSY LLYTDGQQLQ AFELACRAKI EYAVARLIWY AQRGAARWQE FVRANRHVVE
CLLGPLWAAG LKEQLRGKQL IIIPDGILYY LPFDLLFFDD PVDANQQPLD PATYRQKAPD
QATPEQICRD LVPFYWLNHA TISTAQSISL WLQLQQQAAT QAANLALGVY NINYQTNVPS
VYPGHIYAQE LMISYNDLSQ TSVLSKVLGD LAAHGTTLQL TAWQADHTPY QAEFQSNEAN
FKRILAEQAM RYIIFAGHGV FNDKYPQFSG LVFNLAAPDG SSDQSGQDGF LGIHDLFELR
MPNTELIFLA ACQGGLGLIS RGEGINGLTR ALMFRGSPTI IASLWSVDVL ATMDLVEAYF
ELLSQQPTAD KAEILTKAKQ KMLAQPNKPH LVHPFYWAAF IPIGKR