Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2338 |
Symbol | |
ID | 4709261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2563347 |
End bp | 2565176 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639856813 |
Product | cytochrome c biogenesis protein, transmembrane region |
Protein accession | YP_001003903 |
Protein GI | 121999116 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.369158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGGCA GGATGCGATT CGACCGAGGT GGCCTGGTCG CCGCGCTGTG GCTGGCGTTG GGTCTGCTGG TGGCGGCCAC CCCGGCACAG GGAATGGTGG ATCAACAGGA GATCTTCCCG TTCACCGTCG AGCAGGTGGG CGAGGATCGC CTGGTCGCTC GCTGGGATGT GGCCGAGGAC CACTACCTCT ATCGCCACGC CTTCGACTTC GAGCTGCGCG ACGGCGACAA CGAGATCATC GAGGTCTCCT ACCCCAGTGG CGAGGCCTTC TCCGATGAGT TCTTCGGGGA TGTGGTCATC TACCGGGGTA CGCTGCAGGT GCACCTGCAG ACCGCCGAAC CGGTCGGCGA GGGGGCGCAG CTCTACGCCG AGTTCCAGGG CTGCAACGAG CCCGAATCCC TCTGCTATCC GCCGAGCAGT CTGGAAGCCG AGATCGCCTC CGGCGCGGTA CAGTACAGCG ATGGCTCCGG CGGTGCCGGT GCCGCCGGGG GCGGCGGCCA GGGCCCCATG GGCGAGCTGG AGGCGCTGCT CGGTGGGGGG AATCTGTGGG CCATCCTCGG CGGTTTCTTT GCCGCCGGGC TGTTGCTGGC CTTCACCGCC TGCATCTACC CGATGATCCC CATCCTCTCG GGGCTGATCG TCGGTAGCCA GCCGGGCGGC GGGCGGCCGG GTACCGGTCG AGCGCTCTGG CTGTCCTTCG TCTATGTCCA GGGCATGGCG ATCACCTACG CCCTGGCCGG TGCATTGGCC GGACTCTCCG GGCGGGCCAT CCAGGCCGAT CTGCAGGGGC CGGTGGTTAC CGTGGCCTTC AGTGCCCTGT TCGTCGCCCT GGCGCTGGCG ATGTTCGGCC TCTACAACCT GCAGATGCCG GCGTCGGTCC AGGGGCGGCT GCAGGCCGCC TCCAGCCGTC TGCCCGGCGG GCAGGTGGCG GGCGTGGCAG CCATGGGTGT GCTCTCCACG CTGATCGTCG GTGCCTGCTC GGGGCCGGCG CTGGTGGCCG CCCTGGCATT CATCGGCAAC ACCGGTGAGG TGGTGCTGGG CGCCGGCGCG CTCTACATCA TGGCCCTGGG TATGGGGGCG CCGTTGCTGG CGGTCGGCAC GGCCGCCGGG CGCTGGATGC CGCGCTCGGG TCCGTGGATG GAGTCGGTCA AGCAGGTCTT CGGCTTCATC TTCCTCGGCG TGGCGTGGTG GATGTCGTCC CGGCTGATGC CGGATGGGCT GGTGCTCGCT GGTTGGGCGG TGTTGCTGCT GGCGGCTGCG GTGTGGCTGG CCTGGCGGCT GCTGCGCAGT CGCGGCACCG GTGGTTCGGT GCCGGCCCGG GGCGCCGGGG CCACGCTGGC GGCCGTGCTG GCGGTAGCCG GTGCTGCCCA GGTCCTCGGG GCGGTCACCG GTGCCGGTGA CCCGCTGCGC CCCTGGGTGG GGATCACCGG CGACCCATAC GCCCAGGCCC GCGCCCAGGT GCTCGAGGAG TGGCGGCACG TCGAGACCCT CGATGAGCTT GAGGAGCTGC TTGACGAGGC CCGCGAGGCG GGGCGCCCGG TGGTGATCGA CTTCTCGGCC GAGTGGTGCG TCTACTGTGT CCAGCTTGAG GAGCGTACGC TTCCGGATGA CCGGGTCCAG GCGGCCCTTG AAGGCGCCGA GAAGGTACGC ATCGACGTCA CCGACATGAC CGATGCTGAT CGGGAACTGA TGGAGGCCTA CGGCGTCTAT CTGCCTCCCG CGATCCTGTT CTACAACGGC GAGGGCGAGG AGCAATCGGA GTACCGGGTG GCCGGCTTCA AGGACGCCGA GGCGTTTGCC GAGCGCACGC GCGAGGCCTT CGGTGGGTGA
|
Protein sequence | MTGRMRFDRG GLVAALWLAL GLLVAATPAQ GMVDQQEIFP FTVEQVGEDR LVARWDVAED HYLYRHAFDF ELRDGDNEII EVSYPSGEAF SDEFFGDVVI YRGTLQVHLQ TAEPVGEGAQ LYAEFQGCNE PESLCYPPSS LEAEIASGAV QYSDGSGGAG AAGGGGQGPM GELEALLGGG NLWAILGGFF AAGLLLAFTA CIYPMIPILS GLIVGSQPGG GRPGTGRALW LSFVYVQGMA ITYALAGALA GLSGRAIQAD LQGPVVTVAF SALFVALALA MFGLYNLQMP ASVQGRLQAA SSRLPGGQVA GVAAMGVLST LIVGACSGPA LVAALAFIGN TGEVVLGAGA LYIMALGMGA PLLAVGTAAG RWMPRSGPWM ESVKQVFGFI FLGVAWWMSS RLMPDGLVLA GWAVLLLAAA VWLAWRLLRS RGTGGSVPAR GAGATLAAVL AVAGAAQVLG AVTGAGDPLR PWVGITGDPY AQARAQVLEE WRHVETLDEL EELLDEAREA GRPVVIDFSA EWCVYCVQLE ERTLPDDRVQ AALEGAEKVR IDVTDMTDAD RELMEAYGVY LPPAILFYNG EGEEQSEYRV AGFKDAEAFA ERTREAFGG
|
| |