Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1372 |
Symbol | |
ID | 4711386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1479652 |
End bp | 1480548 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639855839 |
Product | TPR repeat-containing protein |
Protein accession | YP_001002941 |
Protein GI | 121998154 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4235] Cytochrome c biogenesis factor |
TIGRFAM ID | [TIGR03142] cytochrome c-type biogenesis protein CcmI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGGAG CCTTCGTCGG GGCCATTGTG GTCCTCTGCA TCATCGCCTT GATCTTCGTG CTCTTCCCGC TGCTGCGGGA TGTGGCACAG ACGCGGACGC AATCCCGCCG CCAGATCAAC GCGGCGATCC ACCGGGACCG CATCCGCGAG CTCGACCAAG ACCTGGATAA CGGCACGCTG AGCCGGGCGC AGTACGATGC GGCCATCGCC GACCTGGACC GGGATCTGGT CCAGAGCGGG GCCATCGACA GCGAGGAGGA TCAGGCCGGT TATATGCCCC GGGCCCGCCG CGGCGTGGTC GCCGCGGCGG CCACGCTAAG CGCGGTGGCG GTGCCGGTGC TGGCCCTGTC CATGTACCAC TCCCTGGGCG ACGAGCGTGC CTTCACCCAG GCCGGCACGC CCACCACGCC GGATCGCCAG CAGCAGGGCG AGGCCCCGGG GCAGCCACAG CAGCACGATC CGGACGAGAT CGAGGCCATG GCGCAGCAAC TGCGCGAGCG CCTGGAACAG AGCCCCGACG ATCCGACCGG GTGGGTGCTC TACGGCCGCA CCATGATCTA CCTGGAGAAC CTGGACGAGG CCGAGAACGC CTTCCGGCGG GCCCTGGACC TGGGGGCCGA CGACGACCCG AGCCTGCTCG CCGAGTACGC CGATATCCTG GCGGCCACCA CCGGCAATCT CCAGGGCGAA CCCATGGAGT ATCTGGAACG GGCGCTGGAG ATCGATCCGG GCCATGTCCG GGCATTGTGG CTGGCCGGAA CGGCGGCCTA CAACCAGGCG GACTACGATC AGGCGCGCTC CTACTGGGAG GATCTCTTGG AGGTCGTGCC GCCGGAGTCC CAAGAGGCCC AGGCCATCCA GTCGAACCTG CGACAGCTCC CCGAAGGCGA GGGCTGA
|
Protein sequence | MSGAFVGAIV VLCIIALIFV LFPLLRDVAQ TRTQSRRQIN AAIHRDRIRE LDQDLDNGTL SRAQYDAAIA DLDRDLVQSG AIDSEEDQAG YMPRARRGVV AAAATLSAVA VPVLALSMYH SLGDERAFTQ AGTPTTPDRQ QQGEAPGQPQ QHDPDEIEAM AQQLRERLEQ SPDDPTGWVL YGRTMIYLEN LDEAENAFRR ALDLGADDDP SLLAEYADIL AATTGNLQGE PMEYLERALE IDPGHVRALW LAGTAAYNQA DYDQARSYWE DLLEVVPPES QEAQAIQSNL RQLPEGEG
|
| |