Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0099 |
Symbol | |
ID | 5731992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 127983 |
End bp | 129485 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277221 |
Product | hypothetical protein |
Protein accession | YP_001542879 |
Protein GI | 159896632 |
COG category | [S] Function unknown |
COG ID | [COG3372] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000355367 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTTA CACCAGCCGA TTTCAAATAT ACCAGCCGCA CCGGTGAGCT TGGCCGCCAA CTCTACCCGC ATCAATTGCG CGATGATCGC TATTTGGCAG CGATTGACTA TGCGATTGGC TATTACGAGC AGATGCTTGG CCGTGCGCGG CGTGAATTTG AAGCCGCTAC CTTGCTGGAG TTTTTTGGCG ATCCCAAGTT GGCACGCGGC TTGGTAGCCT GTTTAAGCCA AACCTATCGC TGGCATCAAC CGCAACTAGC CGAAGTGCTC GATCAACCTA CCTATGCCCA ATTGGTTGAA CGCGGCTTAC GCAATTCAGC CGATTTTCGG GCCTTGCTGT ATGCCCATGC CAATCAAACT CATGGCTTCA TTTTGCCAAG CGAACGCTGG CCTGCGGTAA GCCAACTAGC CGCTGAATTG GGATTAACGC CCAGCCAATT TGAGCGGGTG CTCTACCTTG ATGATGAGCA AGAAGCATTG TTGGTGCGCA GCGCCGAACG CCCAGAACCA AGCGCAATCG TGGCTTTATA CAATTTTCAT TCGCTCGAAA CTGGTTTGCG CAACTGTCGT TCGTTGCAAC TACGGCTTGA TGGCGATATT AATGCCTTGG CCGTTTCAGC CCATAATTTA GCCCAACGCT ATAACCTGCG CTACGAATTA AGCGAACCAG AAGATTGCAT AGCGACGTTC GTAACCCTGA CCTTGCATGG AGCCAAGGAT GCGCTAGGCA ATTGGACGCG CACAGGCCGC CGAATTGCCC GTTGTGCCTT GCGTCTACTA GCGGCCCATC CCAATGCGGC CAGTGAAGGC CTAATTCAGG TGCATATGCA GGGCAAAAAT AGCCTAATCA AGCTGGCAAA ACGCGAGTTG ACGGTGCTTG GTGGCACTGC TCGTCAGCAA CCAGACAACC TTGGCGATAG CTGGGAAACC ACGCTTGAAC AGCAATTTAG TCAAGCCTGG AGCCGCTTGG TTAGCAAAAG CCAAAATGCA GGCTGGCGCA TTCGACGTGA TCCTGTGCCA TTTAGTTTGG CTCATCGGTT ACTTGTCCCC GATTTCATTG CCCAACGCGG CAGCGAACGT ATTCCGATCT TTGTGCCTGC GACCGAAGCG ATGGCAGCGA GTTTGGCGCA GCGCTTGGTT GGTCAACCGA AGGTCTTGGT CGTAATTGCC AAAAGCTATC AAAACCTCTT GCGCAATTGC CAGGTTGCCA AAGTTATGTA TCAAACCACG CCCGATATGC TGACGGTGCT TGCGCAACTC GAACAACTTA CCCCAGCCCA AGCACCATTA GATCGCTGGA GTCGCCTAGC GTTGCGCTTC GATCAAGCTG GATTTGTGGC CGAAACCGAG CTTTTAGAAA TTTTGGAATG CCGTAATCCT GTTGAAATTA GCTTAGCGCT GCGTGGTTGG CGCGAAGGCA CAGCCCAGTA TGTGCCAAAT CTTGGCCTAT TTACGCCGCA AAAACTGCGT GAATTAGGTA GTATGCTTGG CAAAGCCGCC TAG
|
Protein sequence | MSFTPADFKY TSRTGELGRQ LYPHQLRDDR YLAAIDYAIG YYEQMLGRAR REFEAATLLE FFGDPKLARG LVACLSQTYR WHQPQLAEVL DQPTYAQLVE RGLRNSADFR ALLYAHANQT HGFILPSERW PAVSQLAAEL GLTPSQFERV LYLDDEQEAL LVRSAERPEP SAIVALYNFH SLETGLRNCR SLQLRLDGDI NALAVSAHNL AQRYNLRYEL SEPEDCIATF VTLTLHGAKD ALGNWTRTGR RIARCALRLL AAHPNAASEG LIQVHMQGKN SLIKLAKREL TVLGGTARQQ PDNLGDSWET TLEQQFSQAW SRLVSKSQNA GWRIRRDPVP FSLAHRLLVP DFIAQRGSER IPIFVPATEA MAASLAQRLV GQPKVLVVIA KSYQNLLRNC QVAKVMYQTT PDMLTVLAQL EQLTPAQAPL DRWSRLALRF DQAGFVAETE LLEILECRNP VEISLALRGW REGTAQYVPN LGLFTPQKLR ELGSMLGKAA
|
| |