Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5124 |
Symbol | |
ID | 5737082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 168633 |
End bp | 171488 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641282289 |
Product | hypothetical protein |
Protein accession | YP_001547880 |
Protein GI | 159901634 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATA GCACTGATGG TTCCGCTTAT GTGCATGATT CGCATATTAA TGGGGTTGTT GTCGGAACGA ATCTTGGTAC GATTCTCTAT GGCCGTCCAC CAGAAGAAGC CGAGCGCGAT CTTTTAGTAC GATATTTGCA ACGAGTGACG GAGTTGCATA GCACGATGCC TGTGGTTGGG CTTGGATCAT CACGGCTCGA TGCTGGTCTT GACCTTGCGT CGGTCTATAT GATGCTCGCG GTGCAGGGCC GTTATCGGGC CGTGCGGACA CTCACGGCAG AAGAAACTGA AGCCTATCGC CAACAGCGCT TTGTCATTCC CAAGGAACTC AGTACGGATC GGTGTTTGCC CGATCAGGCT ATTGTGATGG TTAACACCAA TCAATCAGGT GAATTGGCCT TATTCCGCGC TGAACTCGCA ACTGAAACGG TCTTAGACCA CCCGCACCTT GTGCTGTGTG GCGCACCGGG GTGTGGGAAA TCGACCTTTG CGCATCATTT GGTGTGGGTC TTGGCCCAAC GTGGGTTGGA TCAGATTAAC CACCATACGG GCTTGCTCGG TTGGAATGAC ACACAGCGCC TGTTGCCGAT TGTGATGCCG CTGCGGCGTT TAGCAGGAGC CTTGGTGGGC ACCGATGTGG GGTTGACCGA TGCCATGCCA AATGTTGGGT TGCTGCGTGA TGCGGTGTGT GCCCATATGC AAACGAAATA TGGCATTGAG AAACCACACA CGCTGCTTGA TGCCGGATTA GCACGTTCGC TAAAAGTGCT GTTGGTGTTC GATGGCCTGG ATGAAGTGCC ACTCGAAGCC TCCTCCACGA GCCTTGATCA GAGGACGGTG CTGCGGTTTA TCCGTCGGTG TGCCGGGTTG AACGTCCGCA TCCTGATTAC GTGTCGCTCA CGGGCATGGA CGGATGAGTA TCGCCAGATC ACGCAGTGGC CGATGGTTAA ATTGGCTCCA TTGACGGGCG GCCAGATGAC CCAGTTTATT CACACGTGGT TTCCGCAGTT GCGTACCAAG GGGGTGATTG CATACGAGGC GATTGCGCGA TTGAGCCAGC GCTTAGTGCA GGCATTGCGT GACCCCCACC GTGAAAAATT ACGCAAAATG GCCGAAAATC CCTTATTGCT GAGCATGATC ATCTTTGTAA TGGCGGATAC CGGCAATTTG CCCCACGACC GCGCCAAGCT CTACGAGCAG ATCTTAGAGC AATTGCTGGA ACAGTGGGAT GCCAAACGGA ATGGGCACGA CCTGGCGCAG GCGATTGGTG ATGAACGCAT TACGGGGAAA AAACTCCGTG ATTTCGTGCT GGATCGGCTG TGCTATCAGG CCCATCTCAC AACGACATCG AACGATGGTC GTGGGCAAAT TGACGCAATG CAGCTCAAAA AGGCATTAAT GGACTATTTT GCCAAAATCA AGATCAAAAC GAACGATCCC TATTGGGCGG CTGAACGCTG TGTTGCCTAT ATTGATCAGC GGAGTGGCTT GCTTCAACCT GACGATACAG GCAATGTCTA CACGTTTGCC CACCTGACGT TGCAAGAACA TTGCGCTGGC CGTCATTTGT TGTTTGAGGA GCCGCTCCAG CAGATGTTAG CCGTCCGCCG TGATGATCGC TGGCGTGAGC CGATCTTTTT GGGCGTTGGT TGCCTAGCGG ACGATAAACG GGCATCGAGC AAAATTGGCG AGGTATTATC AGCGTTGATC AACCCTGATG AATTTCGGAG TAAACCGCGC AAACCGAAAC ATCGTTATGA ATGGTATCGC GATCTGGTGC TTGCTGCTGC GATTGGAGCC GATTGCGATT GGGATACGCT GAATGGGACT GATCTGGATG TTGAGTATTT TCACGGTGCG TTGCGGTACG GCATTGTCAT CCTGCTTGAA GACCGTGCGC ATGCCCAAGC AGCACTGGAT TACTACCACG GTCAACTCAT GGAGCCAGCG CCATTATTGG TGCGCGAACG GCAAAAAGCT GCTGAGCTAT TAGCAGGTCT GGGCGACTCG CGTTATCCGG TGAATAGTGA CCAATGGCAA CAGGAGACGC GTCAGCTTTC CACCCAGTTT AGTCGCGAGG GCACCCACTA TTGGCGCTTT GTGCCTGCGG GCCACTATCA GGTTGGCGGT TGGTATTACG ATGAACAACC TCCAACCGTC GTACTTCAAC CCTACTGGGT CGGGCGGTTT ATGATTACGG TGGAGCAATA TCAGGCATTT ATTGAGGCAG GTGGCTATAC CAACGAGGAT TACTGGACGA AGCATGGTCG TGCCTATAAA AAGCGTTCTA ACAAAATAGT GCCTCGCTGG TGGGATGATC AAACCGAGCA GGAATACCGC AATCAGCCTA TTTATGGGGT GAGTTGGTAT GAGGCGGTGG CCTATTGCCA GTGGTTGACC CAGCAGCTTA GCCCATTCTT ACCGCAGGGG TATGGTATTC GCCTTGCCAG CGAAGCTGAA TGGGAGGTGG CAGTGGCTTA TACCACCGAT GGACAGCGCC AACCCTCTCC GTGGGGTGCA CAACCTGTTA CGCCGGAACA TGCGATCTAT GATTGGAGTA CGAAAAACCG CCCCTTATCG GTTGGTGTAG GATTGCTGGG GCAAGCGGCC TGTGGTGCAC TGGATAGCGT TGGCAACCTG TGGGAATGGA CAGCTACGCC CTATCAGCAA AACCATGGTG CGGTGCAGCT GACGCTTGCA GATAGTGACG ATGATATGGC GGTGCGAGGT GGCGCATATT ATAGTAGGAG TACAGATATT CGTTGCACGG CGCGGCACAG GCTTCGTCCC GACTTCGACG ACTATCACCG AGGATTTCGT TGTATGCTCG CCCCTGTTGT GAATGCTGAG TCCTGA
|
Protein sequence | MADSTDGSAY VHDSHINGVV VGTNLGTILY GRPPEEAERD LLVRYLQRVT ELHSTMPVVG LGSSRLDAGL DLASVYMMLA VQGRYRAVRT LTAEETEAYR QQRFVIPKEL STDRCLPDQA IVMVNTNQSG ELALFRAELA TETVLDHPHL VLCGAPGCGK STFAHHLVWV LAQRGLDQIN HHTGLLGWND TQRLLPIVMP LRRLAGALVG TDVGLTDAMP NVGLLRDAVC AHMQTKYGIE KPHTLLDAGL ARSLKVLLVF DGLDEVPLEA SSTSLDQRTV LRFIRRCAGL NVRILITCRS RAWTDEYRQI TQWPMVKLAP LTGGQMTQFI HTWFPQLRTK GVIAYEAIAR LSQRLVQALR DPHREKLRKM AENPLLLSMI IFVMADTGNL PHDRAKLYEQ ILEQLLEQWD AKRNGHDLAQ AIGDERITGK KLRDFVLDRL CYQAHLTTTS NDGRGQIDAM QLKKALMDYF AKIKIKTNDP YWAAERCVAY IDQRSGLLQP DDTGNVYTFA HLTLQEHCAG RHLLFEEPLQ QMLAVRRDDR WREPIFLGVG CLADDKRASS KIGEVLSALI NPDEFRSKPR KPKHRYEWYR DLVLAAAIGA DCDWDTLNGT DLDVEYFHGA LRYGIVILLE DRAHAQAALD YYHGQLMEPA PLLVRERQKA AELLAGLGDS RYPVNSDQWQ QETRQLSTQF SREGTHYWRF VPAGHYQVGG WYYDEQPPTV VLQPYWVGRF MITVEQYQAF IEAGGYTNED YWTKHGRAYK KRSNKIVPRW WDDQTEQEYR NQPIYGVSWY EAVAYCQWLT QQLSPFLPQG YGIRLASEAE WEVAVAYTTD GQRQPSPWGA QPVTPEHAIY DWSTKNRPLS VGVGLLGQAA CGALDSVGNL WEWTATPYQQ NHGAVQLTLA DSDDDMAVRG GAYYSRSTDI RCTARHRLRP DFDDYHRGFR CMLAPVVNAE S
|
| |