Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2245 |
Symbol | |
ID | 4709500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2458603 |
End bp | 2460999 |
Gene Length | 2397 bp |
Protein Length | 798 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639856721 |
Product | RNA-binding S1 domain-containing protein |
Protein accession | YP_001003811 |
Protein GI | 121999024 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.659597 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCAGTG AGTCGATCCC GCAGCGCATC GCCCAGGAGC TGGGCGTGGG CAACCGGCAA GTCGAGGCCG CCATCGAGCT GCTCGACGGG GGCGCCACGG TCCCCTTCAT CGCCCGCTAC CGCAAGGAGG TCACCGGCGG CCTCGACGAC AGCCAGCTCC GCTCGCTCGA GGAACGGCTG ACCTACCTGC GTGAGCTCGA CGAGCGCAAG GAGACGGTGC TCAAGGCCAT CGACGAACAG GGCAAGCTCA CCGACGAGCT GGCCGAGGCC ATCCGGACCG CCGAGACCAA GACCCGGGTC GAGGACCTCT ACCGCCCCTA CCAGAAGAAG CGCCGCACCA AGGCGCAGAT CGCCCGCGAG GCCGGGCTGG AGCCCCTCGC CGACACCCTC CTCGCCGACC CGAGCCAGGT CCCCGAGGAG GCGGCGGCGC CCTACGTCAC CGGACCCGCC GAGGAAGGGC AGGAGACCCC GGCCGTGGCC GATGTCGAGG CCGCCCTCGA CGGCGCGCGG CAGATCCTCA TGGAGCGCTT CGCCGAGGAT GCCGACCTGC TTGAGGAACT GCGCGACTAC GGCTGGCGCA AGGGCTATCT GGTCAGCCGC GTGGCCGAGG GCAAAGAGAA GGACGGCGCC CGCTTCCGCG ACTACTTCGA GCACGCCGAG CCGCTGCGCA AGATCCCCTC CCACCGCGCC CTGGCCATGC TCCGCGGGCA GAGCGAGGAG ATCCTGCGCC TGGAGATCGC CTGGTCCGAC GCCGAGGCCC GCGGCGAGAG CGACGAGCGC AGCGTCGGTG AGGCGGCCAT CGCCCGGCGG TTCGCCATCG CCGACCACGG CCGACCGGCC GACGCCTGGC TGGCGCGCAC CGTTCGCCTG GCCTGGCGGG CGAAACTCTC CACCCACCTC GACCTGGCCC TCAAGCGCCA GCTGCGCGAG CAGTCCGAAG AGGAAGCCAT CCGCGTCTTC GGCGCCAATC TGGAGGATCT GCTGCTGGCG GCGCCGGCCG GCCCCAGGCC GACCATCGGC CTCGATCCGG GGCTGCGCAC CGGGGTCAAG GTCGCGGTCA TCGACGCCAC CGGGGCCGTG GTCGATACCG CCACGATCCA CCCCTTCACC AGCCGCAACA AAGACCCCGA GGGATCGCTC AAGAGCCTCG CTGACCTGGC CCGCAAGCAC GAGGTGGGGC TGGTGGCCAT CGGCAACGGC ACCGCCTCCC GCGAGACCGA CGCCCTGGTC GGTGAGCTGA TCAAGCGCCA CCCGGAGCTG GGCCTCCACA AGGTGGTGGT CTCGGAGGCC GGCGCCTCGG TCTACTCCGC CTCCGAGCAC GCCTCGCGAG AACTCCCCGA GCTGGACGTC TCCCTGCGCG GCGCGGTCTC CATCGCCCGA CGCCTGCAGG ACCCCCTGGC CGAGCTGGTG AAGATCGAGC CCAAGTCCAT CGGCGTCGGT CAGTACCAGC ACGACGTCAA TCAGAGCCAC CTCGGGCGAA AGCTCGATGC GGTCGTGGAG GACTGCGTCA ACGCCGTCGG CGTGGACGCC AACACCGCCT CGGCGCCGCT GCTCGCCCGC GTCTCCGGCC TCGGCCCGGG CCTGGCCGAG AAGATCGTCC AGCACCGCTT CGACAACGGC CGCTTCCGCA CGCGCAAGGA TCTCCAGGGG GTCCCGCGAC TCGGCCCCAA GGCCTTTGAA CAGGCCGCCG GCTTCCTGCG CATCCCCCAG GGCGACAACC CGCTGGACGC CTCCGCCGTC CACCCGGAGG CGTACCCGGT GGTCGAGCGG ATCTGCGCCG AGACCGGCCG GAGCGTGGCC GATCTGATCG GCGACGAGGG CTTCCTCGGC GGCCTCGACC CGAAGGCGTA CACCGACGAG CGCTTCGGCG AGCCCACGGT GCGCGACATC CTCGGCGAGC TCGCCAAACC GGGCCGGGAC CCGCGCCCCG AGTTCCGTAC CGCCGCCTTC CGCGAAGGGG TGGAGAAAAT CCAGGATCTG GAACCGGGCA TGGTCCTCGA GGGCACGGTG ACCAACGTCG CCAACTTCGG CGCCTTCGTC GATATCGGGG TCCACCAGGA CGGGCTGGTG CACATCTCCG CCCTCGCCCA CGAATTCGTC CGCGACCCGC GCGACAAGGT CCGCACCGGG GACGTCGTCC AGGTCAAGGT CATGGAAGTC GACCTGGAGC GCCAGCGGAT CGGCCTGTCC ATGCGCCTCG ACGACGACCC CAACGCCCAG GCCGAGGGGG GCCGCAAGGG GGCCAATGGC AAGGGCGCCT CGGCCGCCCG GGGCAAGGGC GATGGCTCCG GCAACAAGGC GACCGGGCGC GGATCGAAGA AGGGCAAGAA GCAGGAAAAG GCCGAACCCG CCACCGCCAC AGCGCTGGCC GAGGCCTTCC GCAAGGCCCG CTCCTGA
|
Protein sequence | MVSESIPQRI AQELGVGNRQ VEAAIELLDG GATVPFIARY RKEVTGGLDD SQLRSLEERL TYLRELDERK ETVLKAIDEQ GKLTDELAEA IRTAETKTRV EDLYRPYQKK RRTKAQIARE AGLEPLADTL LADPSQVPEE AAAPYVTGPA EEGQETPAVA DVEAALDGAR QILMERFAED ADLLEELRDY GWRKGYLVSR VAEGKEKDGA RFRDYFEHAE PLRKIPSHRA LAMLRGQSEE ILRLEIAWSD AEARGESDER SVGEAAIARR FAIADHGRPA DAWLARTVRL AWRAKLSTHL DLALKRQLRE QSEEEAIRVF GANLEDLLLA APAGPRPTIG LDPGLRTGVK VAVIDATGAV VDTATIHPFT SRNKDPEGSL KSLADLARKH EVGLVAIGNG TASRETDALV GELIKRHPEL GLHKVVVSEA GASVYSASEH ASRELPELDV SLRGAVSIAR RLQDPLAELV KIEPKSIGVG QYQHDVNQSH LGRKLDAVVE DCVNAVGVDA NTASAPLLAR VSGLGPGLAE KIVQHRFDNG RFRTRKDLQG VPRLGPKAFE QAAGFLRIPQ GDNPLDASAV HPEAYPVVER ICAETGRSVA DLIGDEGFLG GLDPKAYTDE RFGEPTVRDI LGELAKPGRD PRPEFRTAAF REGVEKIQDL EPGMVLEGTV TNVANFGAFV DIGVHQDGLV HISALAHEFV RDPRDKVRTG DVVQVKVMEV DLERQRIGLS MRLDDDPNAQ AEGGRKGANG KGASAARGKG DGSGNKATGR GSKKGKKQEK AEPATATALA EAFRKARS
|
| |