Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0422 |
Symbol | |
ID | 4711520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 490262 |
End bp | 491926 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639854880 |
Product | hypothetical protein |
Protein accession | YP_001002013 |
Protein GI | 121997226 |
COG category | [R] General function prediction only |
COG ID | [COG3972] Superfamily I DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCATG TTCACCCTTC CAACATCGAC ACCCTTCGGC TGGCCGGTGC GCCGGAGCGC GAGCTTAGGA CGCTGGAGTG GCTGGGCGAC TCGCTGCCGC AGAGCTACAC CGTCTACCAC GGCGTGCACT GGAGTGCCGG GTCCGGGCGG GGCGCCGTGT TCGGTGAGGT GGATTTTGTC GTCGTTAACG CCGCCGGCGA GGTCCTGCTG ATTGAGCAGA AGAACGGGGC TTTGGCTGAG AGCGACGGCC TGCTCGGCAA GGACTACGGG CATGAGCGCG GCCCCAAGGA CGTGGTCCGG CAGCTGCACC GCAGCCGCGA AGGGTTGCTC GGAGCCCTCG AGCGCGGTCT CGGTGGCCGC AAGCCGCCGG GTATGAGCCT GCTGCTGTAC TGCCCGGCGC ACCGGCTGCA GGGGGACGCA CCGGCTGGGC TCTCTCGCGA GCAGATCGTC GATGCTTCAA GAGTCCAAGA ACTCCCCGCA TCCGTCGAGG CGCAGCTCGG GCCGGGGCAG GACGCGCCGG ATACAGCCCG GACGGTTCGG CGCGTGCTCG CGCAGGAGCT GGATTTGGCA CCGGATCTCG GCGACGAGGT CACGCTTCAG GAGCAGACCT TCCAGCGTCT CGCTGGTGGC ATCACCGAGC TGGTCCAGGG GCTGGAGATG AGCCCATGGC GCCTTCGGGT TATCGGTGCT GCGGGTAGCG GGAAGACGAT TGCCGCGATC GAGTTTTTCG AGGCTGCGCA AGCCCGGGGA GAACGACCGG CGCTGGTCTG TTTCAACCGC GTCCTGGGCG ACCGGCTGCG CGCACGACTG GAGGGCAACG CTGACGTCGG CAATTTCCAC CGCCTTTGTC ATGCCTGGCT TGAGGCCGTT GGTGAGAGCT TCGATGCGCA GCGGGCGCGG CGGGAGCCCC AGGAGTATTG GAGCGAGGTG GCGGACCGGC TCATCGAGCA TTCCGAGCGA TTGCCCTGTT TCGACCGATT GATCGTCGAT GAAGGGCAGG ACTTCTCCGA AGAGTGGTGG GAGCTCCTGC GCATCTGTCT CGTCGACGAT GATGCACCGG TCTTGTGGCT GGAAGACCCC CAGCAGGATC TCTACGGGCG CAACGACCAG CAACAGTCCG CGTTCGTCAC CTACCGAACG GGCAAGGCTT TTCGGACGCC ACGCCGGATT GCGCAGTTCG TCCGTCGCCT GCTCGAGGTC GATATCGACT GGCGTAATCC GCTCGACGGC CATAAGCCGC GGGTTACCCG GTACGCAACG GCCGATGAGC AGCGCGAGGC CCTTCTTCAG GCTGTGGAAC ACCTCGAGAG CGAGGGCTTT CGCAAGGATC AGATGGTGTT GCTGAGCCTG CACGGTCACG GCCGTGATCC ACTGGCGGAG ACGGCCCGGC TGGGTCGTTA CCGCCTCAAG CGGTTTACGG GGGATTTCAC CGAAGACGGC CAGCCGGTGT ACTCGAAGGG CGATTTACGC GTCGAGACGG TCTATCGCTT CAAGGGCGAG CAGCGCCCCG CCGTGATCCT GATGGACGTC GATTTCGACG GTAGCCGGCC CGAGCGTGAG CAGCGTCTGC TCTACTGTGC GCTCACCCGG GCCTCGGTGG CCTGCGAGGT GCTGGTCGCT GAGGGTTCCG CGTGGCGGAA ACGGCTGGAG AACGCGGCAT CGTGA
|
Protein sequence | MAHVHPSNID TLRLAGAPER ELRTLEWLGD SLPQSYTVYH GVHWSAGSGR GAVFGEVDFV VVNAAGEVLL IEQKNGALAE SDGLLGKDYG HERGPKDVVR QLHRSREGLL GALERGLGGR KPPGMSLLLY CPAHRLQGDA PAGLSREQIV DASRVQELPA SVEAQLGPGQ DAPDTARTVR RVLAQELDLA PDLGDEVTLQ EQTFQRLAGG ITELVQGLEM SPWRLRVIGA AGSGKTIAAI EFFEAAQARG ERPALVCFNR VLGDRLRARL EGNADVGNFH RLCHAWLEAV GESFDAQRAR REPQEYWSEV ADRLIEHSER LPCFDRLIVD EGQDFSEEWW ELLRICLVDD DAPVLWLEDP QQDLYGRNDQ QQSAFVTYRT GKAFRTPRRI AQFVRRLLEV DIDWRNPLDG HKPRVTRYAT ADEQREALLQ AVEHLESEGF RKDQMVLLSL HGHGRDPLAE TARLGRYRLK RFTGDFTEDG QPVYSKGDLR VETVYRFKGE QRPAVILMDV DFDGSRPERE QRLLYCALTR ASVACEVLVA EGSAWRKRLE NAAS
|
| |