Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3167 |
Symbol | |
ID | 5735039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3999819 |
End bp | 4000739 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280310 |
Product | cytochrome c biogenesis protein transmembrane region |
Protein accession | YP_001545932 |
Protein GI | 159899685 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0785] Cytochrome c biogenesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00198959 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAAG CGCAAACACA GCGCCAACCC TCCCGACTGT TTACCATGGC TATCATTGGG CTAGCCGTGT TTGTGGTAGC CGCAATTTTG CTGATTGGGG TCGCTCAAGG TGGACAAAAT GTGATCACTT TGGCGGTTCC AGCCTTTATG GCGGGAGTTT TATCGTTTTT ATCGCCCTGT ACCTTGCCCG TGCTACCAGC CTATTTCGCT TGGACGTTTG GAATCAATAG TGCTGGTGAT GAAAGCCTGG CGCAAAAACG CCAACGTACA ATTATCTCAT CGTTGGCCTT TTTTGCGGGC TTGGCAACCA CGATGGTGGT GTTGGGTATG GCGATTACGG CCACTAGCCA AGCCATTCGC GGCATTTCAT CCAACAAAGA TTTGTTCGCC CAAATTGGCG GGGTAATTAT CATTGTCATG GGTGTGATGA GCATTTTTGG GATTGGCTTC ACCGGAGCGA ATATCAAACG TAGCTCCAGC GCCACCGTTG GCAGTGCATA TCTGTATGGA TTAACCTTTG CGCTGGGCTG GACGGCTTGT ATTGGGCCAA TTCTTGGGGC AATTTTGACC CTGTTGATTT CAACCGGAGC TTCGATTATT GCTGGCGCAA CCTTGACCTT TATCTATGTG CTTGGCTTGG CTTTGCCCTT GATGATCGTA GCGACCTTCT TCAACAAACT TGGCCAAGGC TCCAAGGGCT GGAAGCTTCT GCGTGGGCGG GCTTTGGAGT TCAACATTGG CAAGCGCGTC GTGATTTTGC ACTCCACCAG CGTGATCAGC GGTATTTTGT TGATTGCCGT GGGGATTTTG CTGGTAACTG GACAAATGAG CACATTGAAT GATATTGCCC TCGATAACCC CATTAGCAAA TGGGCCAACA ATGTGCAATA CGATATTCAA ACCTTTTTTA CTGGCGAGTA G
|
Protein sequence | MQQAQTQRQP SRLFTMAIIG LAVFVVAAIL LIGVAQGGQN VITLAVPAFM AGVLSFLSPC TLPVLPAYFA WTFGINSAGD ESLAQKRQRT IISSLAFFAG LATTMVVLGM AITATSQAIR GISSNKDLFA QIGGVIIIVM GVMSIFGIGF TGANIKRSSS ATVGSAYLYG LTFALGWTAC IGPILGAILT LLISTGASII AGATLTFIYV LGLALPLMIV ATFFNKLGQG SKGWKLLRGR ALEFNIGKRV VILHSTSVIS GILLIAVGIL LVTGQMSTLN DIALDNPISK WANNVQYDIQ TFFTGE
|
| |