Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3345 |
Symbol | |
ID | 5735215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4217602 |
End bp | 4219722 |
Gene Length | 2121 bp |
Protein Length | 706 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280492 |
Product | RNA-binding S1 domain-containing protein |
Protein accession | YP_001546109 |
Protein GI | 159899862 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTATG CCCATTTAAT TGCCAAAGGC TTGACGGTGC GTGCCGAGCA AGTTACCGCT GCGATTCAAT TATTCGATGC AGGCAATACT TTGCCGTTCG TCGCTCGTTA TCGTAAAGAG CAAACTGGTG GCTTGGATGA AGAACAATTA CGCAGCATTC AAAGCCAAAT TGCCCGTTTA CGTGAGCTTG ACGAGCGGCG CGAGGCGATT TTGAGTGCTC TGCGCGAGCA AGGCAATTTG AGTGATGAAT TGGCCCAAGC CTTGGCCGCC GCTACTGATA AAACCACCCT CGAAGATTTG TATGCGCCGT TCAAGCCCAA ACGCCGTACC CGCGCCAGCA TAGCCCGCGA ACGTGGCCTC GAAGGCTTGG CTGAAATTAT TCAAATGCAG CCGAATGATC CGATTGATGC TACCGCTCGC CAGTTTCTCA ACGAACAGGT TGCCAGCATT GAAGAGGCCT TGGCTGGAGC ACGCGATATT GTAGCCGAGC AAATCAGCGA TCATCCTGAG GTGCGCCGTC AAACTCGCGA ACGCGCTTTG CGTTGGGGCG TGGTTCGCAG CGAGTTAATC GCCGATGCCG AGGATAGCAA GGGCGTATAT CAAACCTACT ATCAATTTGA GAGCACGGCC AGCCGCCTCA AGCCTTACCA AGTGTTGGCG CTCAACCGTG GTGAAACCGA ACATATTTTG CGTTTTAAAA TTCAGATGGA TCAACGTGAT TGGTTTGATG TGGTTGCCAA GTATTTTCCG CTTGATCAAC GTTCAGCTTG GGCCGAGCAA CTGCGCTTGG CGATCCACGA TGGCGCTGAG CGATTGCTCT TGCCAGCAAT TGAACGCGAT GTGCGCCGCG CCTTGACTGA GCAAGCCGAA AGCCATGCGA TCACCGTCTT TGCCAAGAAT GTGCATTCGT TGTTGCTGCA AGCGCCAATC GCCAATAATG TGGTGCTCGG GCTTGATCCA GGCTACCGCA CTGGTTGCAA AGTGGCGATT ATCGGCCAAA CTGGCAATGT GCTCACGACT GCCACGATTT ATCCTCACAG CGGCGCGGCA GCGCGTGAAC GGGCGTTTCA AGAATTGCAA AGCTTGATCA AACGCTATGC TGTGAGTTTG ATTGCCATTG GCAATGGCAC GGCCTCGCGC GAAACTGAGC AATTAGTCGC CGATGTGATT CGCCACCAAA CTGGTTTGCA CTATTTGATT GTCAGCGAAG CAGGAGCCAG TGTTTACAGT GCTAGTACGC TTGCCCGCAG CGAATTGCCT GATCTCGATG TCAGCTTGCG CGGCGCGGTT TCGATTGCAC GGCGGGTGCA AGATCCCTTG GCCGAGTTGG TCAAAATCGA GCCAAAAGCG ATTGGCGTAG GCATGTATCA ACACGATGTT GATCAATCGG CCTTGGGCAA TGCGCTTGAT GGCGTGGTTG AGAGCGCAGT TAATAATGTT GGGGTTGATG TCAACACCGC TTCGCCTGCG CTTTTACGCT ATGTTGCCGG GATTGGCCCC AAACTTTCGG CTCAAATTGT CAGCCATCGC GAGGAACATG GCCCATTTCG TTCGCGGGTT GCACTCAAAA AGGTCAAAGG GCTTGGGCCA AAAGCCTTTG AACAGGCCGC CGGATTTTTG CGAATTCGCG ATGGCGATGA AGCCTTGGAT GCCAGCGCAA TTCACCCCGA AAGTTATACG GTTACCCGTA ATTTGTTGGA CAAGCTGAAT ATTAACGCCA AAACAGGCCG CAACGAACGA ATCAAACGCT TGGAAGATTT GAAAAATCAG CCATTGCATA GCCTTGCGGC GGAATTGGGC ACAGGCGTAC CAACCCTGAG TGATATTATT GACCAACTGC TGCGGCCAGG TCGCGACCCC CGCGAGGATG TGCCAGCACC AATTTTGCGC AGCGATGTGC TGGCTTTTGA AGATTTGCAG CCGGGCATGC AGCTCAAAGG CACAGTGCGC AATGTCGTCG ATTGGGGCGC ATTTATCGAT TTGGGGGTTA AGCACGATGG CTTATTGCAC CGTTCGCAAA TTCCCCGTGG CCTGAGTTTG AGTGTTGGCG ATATTGTCGA TGTTAGCATT CAATCGATCG ACCCAGATCG CAAACGGATT GCCTTAGTTT TAGCACAATA A
|
Protein sequence | MDYAHLIAKG LTVRAEQVTA AIQLFDAGNT LPFVARYRKE QTGGLDEEQL RSIQSQIARL RELDERREAI LSALREQGNL SDELAQALAA ATDKTTLEDL YAPFKPKRRT RASIARERGL EGLAEIIQMQ PNDPIDATAR QFLNEQVASI EEALAGARDI VAEQISDHPE VRRQTRERAL RWGVVRSELI ADAEDSKGVY QTYYQFESTA SRLKPYQVLA LNRGETEHIL RFKIQMDQRD WFDVVAKYFP LDQRSAWAEQ LRLAIHDGAE RLLLPAIERD VRRALTEQAE SHAITVFAKN VHSLLLQAPI ANNVVLGLDP GYRTGCKVAI IGQTGNVLTT ATIYPHSGAA ARERAFQELQ SLIKRYAVSL IAIGNGTASR ETEQLVADVI RHQTGLHYLI VSEAGASVYS ASTLARSELP DLDVSLRGAV SIARRVQDPL AELVKIEPKA IGVGMYQHDV DQSALGNALD GVVESAVNNV GVDVNTASPA LLRYVAGIGP KLSAQIVSHR EEHGPFRSRV ALKKVKGLGP KAFEQAAGFL RIRDGDEALD ASAIHPESYT VTRNLLDKLN INAKTGRNER IKRLEDLKNQ PLHSLAAELG TGVPTLSDII DQLLRPGRDP REDVPAPILR SDVLAFEDLQ PGMQLKGTVR NVVDWGAFID LGVKHDGLLH RSQIPRGLSL SVGDIVDVSI QSIDPDRKRI ALVLAQ
|
| |