Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0421 |
Symbol | |
ID | 4711453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 488709 |
End bp | 490190 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639854879 |
Product | hypothetical protein |
Protein accession | YP_001002012 |
Protein GI | 121997225 |
COG category | [R] General function prediction only |
COG ID | [COG2251] Predicted nuclease (RecB family) |
TIGRFAM ID | [TIGR03491] RecB family nuclease, putative, TM0106 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.805168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCGCC TGTCCAAATC CCGCTTCATA GCCGGCTGGC AGTGCCCGCT TCGGCTCTGG TACACGGTCC ACGAGCCCTG GCGAGCCACG CCGCCTGATC CCCAGTTGCA AGCGATCTTC GACCAGGGCC ACCGACTCGG GCAATTGGCT CAGCAGCGCT ATCCATCCGG GGTGCTCGTC GAAGCGGATT ACCGACACAC GCGCCAGGCC CTCGAGCAAA CGCAGGTGCT GATGGCCGAC CCGGGCGTGC CGGCGATCTT TGAAGCCGCT CTGGAGCACG ACCAGGTCCT GACCCGAGTG GACGCGCTCG TTCGAAACGG GGACGGGTGG GACCTGGTCG AGGTCAAGGG CGCCACGCGT GCCAAGGAGG TTTTCCAGCT CGACGTGGCG ATCCAGTATT GGATCCTCAC CGGTGCCGGC ATCCCGGTAC GCGATGCCGG GCTGCTGTTG ATCGACCGTG ACTACGTCTA TCCGGGCGGC GCGCACGACC CGCAGGAGTT CTTCCGGTTC GAACCGCTGA CCAGAACCTG CGAGCACTGG CTTGAGTGGA TCGAGGCGCA GGCCGTCTCC TTCCAGGAGG TTGCCGCCCG TACAACACCA CCCCAGATCG AAATCGGGCA GCAGTGCTTC AGCCCTTACC CCTGCCCGTT CTACAACCAC TGCTCTGCTG ACAGGGAGTG GCCCGAGCAC CCGATCGAGG AGCTACCGTA TCTGGCCGGC GAGCGTTACC AGGGCCTCCT CGAGCGGGGT GTCACAACCA TCGACGCCAT CCCCGACGAT TACCCGCTTA CCGTCGCTCA GAAGCGGATC CGAGACGCCG TTGCCAAGGG GCATCCCTGG ACCGATCCCG ACCTGGGTGA AGCCGTTCAG GGTGTGGAGT GGCCTCTCTT CTTCCTGGAT TTCGAGACCG CACAGCCGGC CTTACCCCGT TACCCCGGAA CCTCCCCGTT TCAGGCCTTG CCGTTCCAGT TTTCCTGCCA TATCCAACGA GCCCCCGGGG TCGCCCCGGA GCACACGGAC TTTCTGGCCA CCGCCGATAC CGACCCCCGC CGCCCACTTG CCGAGGCACT GCTCAACGCA CTCGGTGACC GCGGATCGAT CGTCGTTTAC TCCAGCTTTG AGCGGCGGGT CCTCGGCGAA CTCGCCGATG CGCTGCCGGA TCTCGCTGAG CCCTTAGCCG CCCTTCAGGA GCGGCTGTGG GATCTGCTAC GGGTCCTGCG GAACCACTAC TACCACCCGG CTTTCAAGGG CTCCTACTCC ATCAAGCGGG TCCTGCCCGT ACTGGTTCAA GACCTCGACT ACACCGCCCT GGAAGTGGCC GATGGCCAGG CCGCCGCGCA GGCCTGGATC CAGATGCTCG ACACGGATTG CGACGAGGAA CGCGATCGGC TCGCCCAAGC CCTCCGCGAA TATTGCTTCA CCGACACGCT GGCCATGGTG CGACTGCGCG AGGCGTTACT GCGTGCCGCC GGAGAGCTGT GA
|
Protein sequence | MPRLSKSRFI AGWQCPLRLW YTVHEPWRAT PPDPQLQAIF DQGHRLGQLA QQRYPSGVLV EADYRHTRQA LEQTQVLMAD PGVPAIFEAA LEHDQVLTRV DALVRNGDGW DLVEVKGATR AKEVFQLDVA IQYWILTGAG IPVRDAGLLL IDRDYVYPGG AHDPQEFFRF EPLTRTCEHW LEWIEAQAVS FQEVAARTTP PQIEIGQQCF SPYPCPFYNH CSADREWPEH PIEELPYLAG ERYQGLLERG VTTIDAIPDD YPLTVAQKRI RDAVAKGHPW TDPDLGEAVQ GVEWPLFFLD FETAQPALPR YPGTSPFQAL PFQFSCHIQR APGVAPEHTD FLATADTDPR RPLAEALLNA LGDRGSIVVY SSFERRVLGE LADALPDLAE PLAALQERLW DLLRVLRNHY YHPAFKGSYS IKRVLPVLVQ DLDYTALEVA DGQAAAQAWI QMLDTDCDEE RDRLAQALRE YCFTDTLAMV RLREALLRAA GEL
|
| |