Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4189 |
Symbol | |
ID | 5736051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5342515 |
End bp | 5346255 |
Gene Length | 3741 bp |
Protein Length | 1246 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281344 |
Product | hypothetical protein |
Protein accession | YP_001546949 |
Protein GI | 159900702 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCG CTTGGTTGAC TCAATGGCTG GCCAAACAAC GTTTAAATCC CCAATATCAA CAAGGGGCAA CCTTGCACCA AGCCTTGCAA TTGGTGCTCT ACGATTGGAT TCCCGCCGTA TTCGAGGGTT TGGAGCAACA AGCCAAGGCC AATTGTTGGC CTGATCAAGA TTTTAGCCAA CGCATAACTG CCAGCGCCAA GCAACTGGCA CTGTGGCATG GCAATCCACA ACGTTATTGC GACTTGCAGC AACAGCAAAC CAACTTACCT TTGCAACTTC CCAGCAGCTG GCAAGCCGAC CAACCAACCC TGAGCGCCAA ACAAATCGAT GCTTTGCGCT CGTTTTGTAG CAGCGTGAGC AGCGATCCGC GCTCGGCCAG CTGGACAATT TGCGTTGCCG ATGAGGCTGA ATTGTATGCT TGCGAGTTGC ATGCCCATGC CCTGTTTGGC GATATTGCCG CGCTGTGGCA GCCAATTATC CAAGGTTTGG CCAAACTAAA TGAACTTGAA GCAGCGATTA GTTTGATTTT GCTTCAGCGC GATTTGGCCG ATTATGCTGG GATTATTGCG AGTTTCAAGC GTGCTCAACA GGCGCATCCC AAGGCCGCTG AGCAGCTTGT GCCTTGGCTC GACGATCAGC TAAAACCAGC CTTGATCGAT GATTTGCGGG CGATGCTTGA GGCAGTTTTG CTGGCTTATC GGCTGCACAA CCCTGATGAT CAAACCTTGC TTGAGGCCCT TCAGCAAGCA ATTATGGCGT TAACGCCAAT CGAAGTTGCC CCCGACGATG GTAAAGGCGA GGAAGCGCCT GCTCCAACTC CGCAACGTGG CGTGCCGTTG CCAGCGGTTA ATTTGCCAAT GCCGAATATT TTGCGCTTTG CTACGCCCAA AAACCTACTT TTGAACCAGC TAAAAACCCA TTGGCAAGCC TTGTTGGCAG GGGTTTTGGT CGATGAGCAG CTTGGGGCAC GCCATCGGAT GGTACTGCAA TGGTGGCTGG CGTGGACTGA ACATCAATTA TTTAACGTGA CTCAAGCCCA AGCGTTGCTC GCCGAAATTC CCAGCACAAC TGCGCCATTG GGTTTATTGA CCGAGGCTCT GCGGGGCGAA TTGGCTTTTT TAGCAGGCCA ACCTGAATTA GCCCAACACT GTTTTGCCAC AACCCTTGCA GCCAGCCAGC AATTAATCAG CGAGCCAACT TTTGCTAGCC AAACTAGCCA AAATCAGGCT TTTGGTGGCT TATTAGCCCA ATGGCAGGGC AATAGTTTGG TGATGTTGGG CGATTACACG GCGGCTGAAC AATGCTATGT CAAGCTTGCC AATTTGCTGG CCGCTGATGA TCGTGAGGGC CAATGTTTGG CAGCGATCAA TCGGGGCAAT ATCGCCTTTG TCCGCAATAA TTTGCTTGAT CATCGCGGCT ATGTGACCTA CGACAAAACC GAGCAAGTGC TATTGCGCGG CGATAAACCA CCCGAACATT TTCTAGGGCT GAAATATACC AAGCATCGTC AAGCGCTTGT TCAAGCGCAA CAAGCTTATG CCGAGGCATT AAACTTGGCT GAGCAAGAGC CGAATTTGGC TCAATATCGC AGCTTGATCA TCGCCAACCA AGCCAATATT GTTTGGCTCC AAGCTAATTT GCTGCTTGAG TCTAGTTGGT TTTCCAGCGA TTTGGCGGCA AGCTTGCAGG ATTTAGGGGA GCCAGCCCAG CTTTATCAAC AAGCCTTGAG CTTATTGCAA CAAGCCTTGA CATTGCTCAA CCAAACCACG GCGGATCGGG TGCTGCAAGC GGTGTTGTTT GCCAATATCA GCGAAGTTCA ATTGTTGCTG AAGCAGCCAG CCGAAGCCTT GAATTCCGCC CAAAACTGCT TGAAAGCGCT CAATCTGAGC GATCTCAAGC CCCAAGCAGC CCTGAAACAA GCCCAAATGC GCGGCGTGTT GATTCCCGAT GCTGGCTGGC GAGTTTGGTT GACCATGGCA CGAGCCTACG AAGCGCTCAA CGATCTGGCT AAAGCCCAAC AAGCCTACGC CAATGCCTGC GACTTGGTGC AAGTGTTGCG CAATAATGTG CAGCAAAGCG ATTGGCAAAT GGCCGCGCTG CAAGATAAAT TTCAGGTGTT CGAATGCGCG ATGCATTTTC ATTTCAAGCA TTCCAGCGAC CGCACTAGCT TATTGCAATT GAGTGAAAAT TTGCGTGCCC ATGGCTTTGA ACAATTGCTT GAGGCCAGCC AAATTGATCG TGAGCGCGAA CTCAGCCCTG AGTTAAAGCA GCAACGGCTA GCTCTGGCCG CCGCCTTGGC CGCGCGAAGC ATGGCAATTC GTTTAGCCAT GCAACAGCCC GCTGGCGACG ATTTAGTTAA ATTATTGAAC GATCAGCGAG CTGCATTTGC CGATTGGCAA GCGCTCCAAA GCAGCATTGC CCAAGCCTTG GAAAGCCAAA ATCAAGAACT CCAGCCGCAA CTAATAACCT GGCCAAACTT ACAAAAAGCC TTGGCCGAAC GACCCAACAC GGTGATTTTG AGCTTTACGA TTGGCAGCGA ATGGAGCTAT TTGCTATATA CCGATGGTCA GCAGTTGCAA GCATTTGAGC TAGCCTGCCG CGCCAAAATT GAATATGCCG TGGCCCGTTT GATTTGGTAT GCCCAGCGTG GGGCGGCGCG TTGGCAAGAA TTTGTGCGAG CTAATCGCCA TGTTGTTGAG TGTTTGTTAG GGCCGCTCTG GGCGGCGGGG CTAAAAGAGC AGCTACGTGG CAAACAATTA ATCATCATTC CAGATGGGAT TTTGTATTAT TTACCGTTTG ATTTGCTGTT TTTCGATGAC CCAGTTGATG CCAACCAGCA GCCACTTGAT CCTGCGACCT ATCGCCAAAA AGCGCCTGAC CAGGCTACGC CTGAGCAAAT TTGCCGTGAT CTCGTGCCAT TTTATTGGCT GAATCACGCC ACAATTTCCA CTGCCCAATC GATTAGCCTT TGGCTGCAAC TTCAGCAACA AGCAGCAACC CAAGCAGCAA ATCTGGCGCT GGGCGTTTAC AACATCAACT ATCAAACCAA TGTGCCAAGC GTCTACCCAG GCCATATCTA CGCCCAAGAA TTGATGATTA GCTACAACGA TTTGAGCCAA ACCAGTGTGC TCAGTAAGGT ATTGGGCGAT TTGGCTGCAC ATGGAACCAC CCTGCAATTA ACCGCTTGGC AAGCCGATCA CACGCCCTAT CAAGCTGAAT TTCAATCCAA CGAAGCCAAT TTCAAGCGAA TTTTGGCTGA GCAGGCCATG CGTTACATCA TTTTTGCGGG CCATGGCGTG TTCAACGATA AATATCCGCA ATTTTCGGGC TTGGTGTTTA ATTTGGCAGC GCCTGATGGC AGCAGCGATC AGAGCGGGCA AGATGGATTT TTGGGCATTC ATGACCTTTT TGAATTACGC ATGCCCAATA CCGAATTGAT TTTTTTGGCA GCATGCCAAG GTGGTTTGGG TCTAATCTCA CGTGGCGAGG GCATCAATGG CCTGACCCGC GCCTTGATGT TTCGCGGTAG TCCGACAATT ATTGCTAGCT TATGGTCGGT TGATGTGTTG GCAACCATGG ATTTAGTTGA GGCTTATTTC GAGTTGCTGA GCCAACAACC AACCGCTGAT AAAGCCGAAA TCTTGACCAA AGCTAAGCAA AAAATGCTGG CCCAGCCCAA TAAACCACAT TTAGTCCATC CATTTTATTG GGCCGCTTTT ATTCCAATTG GAAAGCGTTA G
|
Protein sequence | MTTAWLTQWL AKQRLNPQYQ QGATLHQALQ LVLYDWIPAV FEGLEQQAKA NCWPDQDFSQ RITASAKQLA LWHGNPQRYC DLQQQQTNLP LQLPSSWQAD QPTLSAKQID ALRSFCSSVS SDPRSASWTI CVADEAELYA CELHAHALFG DIAALWQPII QGLAKLNELE AAISLILLQR DLADYAGIIA SFKRAQQAHP KAAEQLVPWL DDQLKPALID DLRAMLEAVL LAYRLHNPDD QTLLEALQQA IMALTPIEVA PDDGKGEEAP APTPQRGVPL PAVNLPMPNI LRFATPKNLL LNQLKTHWQA LLAGVLVDEQ LGARHRMVLQ WWLAWTEHQL FNVTQAQALL AEIPSTTAPL GLLTEALRGE LAFLAGQPEL AQHCFATTLA ASQQLISEPT FASQTSQNQA FGGLLAQWQG NSLVMLGDYT AAEQCYVKLA NLLAADDREG QCLAAINRGN IAFVRNNLLD HRGYVTYDKT EQVLLRGDKP PEHFLGLKYT KHRQALVQAQ QAYAEALNLA EQEPNLAQYR SLIIANQANI VWLQANLLLE SSWFSSDLAA SLQDLGEPAQ LYQQALSLLQ QALTLLNQTT ADRVLQAVLF ANISEVQLLL KQPAEALNSA QNCLKALNLS DLKPQAALKQ AQMRGVLIPD AGWRVWLTMA RAYEALNDLA KAQQAYANAC DLVQVLRNNV QQSDWQMAAL QDKFQVFECA MHFHFKHSSD RTSLLQLSEN LRAHGFEQLL EASQIDRERE LSPELKQQRL ALAAALAARS MAIRLAMQQP AGDDLVKLLN DQRAAFADWQ ALQSSIAQAL ESQNQELQPQ LITWPNLQKA LAERPNTVIL SFTIGSEWSY LLYTDGQQLQ AFELACRAKI EYAVARLIWY AQRGAARWQE FVRANRHVVE CLLGPLWAAG LKEQLRGKQL IIIPDGILYY LPFDLLFFDD PVDANQQPLD PATYRQKAPD QATPEQICRD LVPFYWLNHA TISTAQSISL WLQLQQQAAT QAANLALGVY NINYQTNVPS VYPGHIYAQE LMISYNDLSQ TSVLSKVLGD LAAHGTTLQL TAWQADHTPY QAEFQSNEAN FKRILAEQAM RYIIFAGHGV FNDKYPQFSG LVFNLAAPDG SSDQSGQDGF LGIHDLFELR MPNTELIFLA ACQGGLGLIS RGEGINGLTR ALMFRGSPTI IASLWSVDVL ATMDLVEAYF ELLSQQPTAD KAEILTKAKQ KMLAQPNKPH LVHPFYWAAF IPIGKR
|
| |