Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0238 |
Symbol | |
ID | 4711450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 274064 |
End bp | 275353 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 639854698 |
Product | hypothetical protein |
Protein accession | YP_001001834 |
Protein GI | 121997047 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00701852 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCACCC ACGCCCGACA ACCGCCCATG TCCCACTGGC GAACGCTCTC CGGACTCTTC GGCCCGCGCT GGCGTCACCC GGACCCGGCC ACCCGCCGGC GGGCCGCCAT GGATCTCGAC CCCGACGACC CGGAGACCGC CGAGGCGCTG GCCGCCCTGA TCGACGACCC GGCCCCGCCG GTGCGCGAGG CCGCCGCCAA GCGCTGCCGA GACCTGGCCA CCCTGCGCCG GCTGCGCCGC CACGACCCGG ACGCCAGCGT GCGCACCGCC GCCACGGTCC GCTACCGCCA GCGGCTGGTG GGCGACGTAC CGCCGGAGGC GGTAGCCACT GAACTGCGCC ACTGCGACGA CCCCACCGTG GCCGCCCATA TCGCCCAGCA GGGCCGCACC CCGGCGGTGC GCCGCGCCGC CCTGGAGCAT CTGGACCAGG CCCCGGTCCT GCAGGAGGTG GCCCTCCACG ACGACGACCC GGCCACCCGG CTGTACGCCG TGCAGCAGCT CTCCGACCCA CAGACCCTGC GGCGCCTGGC CGAGCGTGCC CGCAGCCTGG CCCCGGACGT GGCCGACGCC GCCGAGCATC GCCTGCACGC CCCGGCGCCG ACCACTCCCG AGGCCGACCC CGAGACCGAC ACCACCGCTG AACCGGAGCC GGCGGCCGAT CCCGCGCCCC CCTGCGAGGA CGAGGCGGCC GCCCCATCCA CCGGCGCCGC TGCCTCGCTG GCCAACGCCA TGACCGGCCT CGCCGAGGGC GGCTGGTGGC CCGGCTACCA CGCCCGCCGG CGGGCGCTGA TGGCGCGCTG GCGGGAGCTG GAGGAACCAC GCCCCCCGGC GCTGAGCGAG CGCTTCCGCC AGGCCACCCT GGCGGCCCTG ATGCAGCAGC CGCGCAACCA CGGCGGCGAG GCCACCCGCA CCCGCCTGCA GCTGGAGAAC CTGCTCGCCG AGACCCGGGG GGAGGACGAT CCGGCCAGCG CCGTGGGGCA GCGCCTGGAG GCCCTCAGCC GCAGTGTTGC CGAGCTGACC GGCGACCACC CCGATGAGGC CCGCGCCCGC ATCCACCTGC AGCGCCTGGC CCGTCGGGTG CCGCAGATCC GCCCGCGGCG CCCGGTCCGC GCCGGGGTGC GCCCGCCGCA TCGCCCTTCG GCGCCGCACG ACCTCCCCGC CCTGGCCCGG AGCCTGGCCG ACGCCGAGGC GGCCATCGCC GCCGGCGACC TGCACCGGGT CCGCGCGGCC CTCGGCGAGG CCCGGCGGCT GGTGGAACCC GCCGAGGCCG GCGAGCCGCC GCAGGGCTAG
|
Protein sequence | MLTHARQPPM SHWRTLSGLF GPRWRHPDPA TRRRAAMDLD PDDPETAEAL AALIDDPAPP VREAAAKRCR DLATLRRLRR HDPDASVRTA ATVRYRQRLV GDVPPEAVAT ELRHCDDPTV AAHIAQQGRT PAVRRAALEH LDQAPVLQEV ALHDDDPATR LYAVQQLSDP QTLRRLAERA RSLAPDVADA AEHRLHAPAP TTPEADPETD TTAEPEPAAD PAPPCEDEAA APSTGAAASL ANAMTGLAEG GWWPGYHARR RALMARWREL EEPRPPALSE RFRQATLAAL MQQPRNHGGE ATRTRLQLEN LLAETRGEDD PASAVGQRLE ALSRSVAELT GDHPDEARAR IHLQRLARRV PQIRPRRPVR AGVRPPHRPS APHDLPALAR SLADAEAAIA AGDLHRVRAA LGEARRLVEP AEAGEPPQG
|
| |