Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0167 |
Symbol | |
ID | 8417971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 209672 |
End bp | 211345 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 645036732 |
Product | cytochrome c family protein |
Protein accession | YP_003197047 |
Protein GI | 258404305 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.333474 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.1597 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACTTCA ACCGACCAGC ACCACTGCGC GTGGCGTGGC TTGGCCTGGC AGTCCTGCTC CTCTGGAGCA CCACGGGGCT GGCCGCTGTG GAGGACCAGC AGCAAAGCGT GGACAAAGAC TACGGCGCGG CCGAAGCCCG CAAAGTCGTC GAAGCGCCCA AGCGATGGAT CACCGCTGAC CACTCAAAGC ACAAAATCCT GCAAAAAGAT TTCGAATCAG GCCCCGAAGT CACAGAGGCG TGTCTGACCT GTCACAACGA GGCCGCTCTC CAGTTCCACG AAACCATCCA CTGGACCTGG ATTTGCCCGG CCGATCCCAA TAAGGAGATG GGCAAGAACG GCGTCAGTAT GAACAACTTC TGAATGTCCA TCAAAAGCAA CGAGGCTCGT TGCACATCCT GTCACGCCGG GTACGGCTGG GAAGACAAGA ATTTTGATTT CACCTCGGAA AAAAATGTCG ACTGTCTCGT CTGCCACGAA CAGACCGGGA CCTATAAAAA ATTCCCCACC GGGGCTGGCC ATCCAGTCTC GGAGCCCAAA AAGTTCGGGG GCAAAATGTT CTATCCGCCG AACTGGAACA AGGTCGCCCA GAGCGTGGGG CGGCCGGACC GTAAAAACTG CGGCACCTGC CACTTCTACG GCGGGGGCGG CGACGGCGTG AAGCATGGCG ACCTGGACAG TTCCCTGTAC AATCCCTCCC AGCAACTCGA CGTCCACATG AGCGAAGAGG GCGGGGATTT TACTTGTGTT CGTTGCCACA CCACGGAAGC CCACTCCATT GCCGGCCGGT GCTATAAAAA GCCCGCCTAC GAAGAACGCA AAAGCCTTAT CGATGACGAC CAGATCAAAC GCATCTCCTG CGTCTCCTGC CACACGACCA CGCCCCACAA GCCGGGGCAC AAGGCCAACG ACCACACGGA CATGGTTGCC TGCCAATCTT GCCATATCCC GGAATTTGCC CGGGTCAACC CGACCAAGAT GTGGTGGGAC TGGTCCAAGG CCGGTCGCCT CAACGAAAAA GGCAAACCGA TCCTCACCGA GGGCAAATAC GGCAAGCACT CCTACCACGG CAAGAAAGGG GAGTTTCGCT GGGAAAAGAA TGTGGTTCCG GAATACGACT GGTTCAACGG CTCCCTGGAA TACCAGTTGT TCACCGAAAA ATTCGATCCG GCCAATGCGC CCATTGAATT GAACAAAGTC CAGGGATCTC GTGAAGATCC CAGGGCTCTG ATCTACCCCT TCAAGATCCA CCGCGCTAAA CAGCCCTATG ACACCAAACT GAACAAGTTC GTCAATGTCC ACCTTTTCGG CAAAGACAAA AACGCCTACT GGAAATCCTA TGATTGGCAG CGGGCCATTA CGGCGGGCAT GGACTACATG GGATTGCCCT ACAGCGGGGA ATTCGATTTC ATTGAAACCG AATACCACTT CCCCATCACC CACATGGTCG CCCCTAAGGA AGACTCCTTG GCTTGCAACC AATGCCACGC GGACAACGCT CGCTTGGCCC AGCTGACCGG CTTCTACATG CCCGGACGCG ATAAAAATGT CCTCGTTGAC ACCATCGGCT GGTTAGCCGT GATCGGCTCG CTGGGCGGGG TTGGCATCCA TGCGGCGTTG CGGCGCAGAT TCGCCAAGAA ACGCCAAACC AACGGAGGAG CCGGGCATGA ATAA
|
Protein sequence | MHFNRPAPLR VAWLGLAVLL LWSTTGLAAV EDQQQSVDKD YGAAEARKVV EAPKRWITAD HSKHKILQKD FESGPEVTEA CLTCHNEAAL QFHETIHWTW ICPADPNKEM GKNGVSMNNF UMSIKSNEAR CTSCHAGYGW EDKNFDFTSE KNVDCLVCHE QTGTYKKFPT GAGHPVSEPK KFGGKMFYPP NWNKVAQSVG RPDRKNCGTC HFYGGGGDGV KHGDLDSSLY NPSQQLDVHM SEEGGDFTCV RCHTTEAHSI AGRCYKKPAY EERKSLIDDD QIKRISCVSC HTTTPHKPGH KANDHTDMVA CQSCHIPEFA RVNPTKMWWD WSKAGRLNEK GKPILTEGKY GKHSYHGKKG EFRWEKNVVP EYDWFNGSLE YQLFTEKFDP ANAPIELNKV QGSREDPRAL IYPFKIHRAK QPYDTKLNKF VNVHLFGKDK NAYWKSYDWQ RAITAGMDYM GLPYSGEFDF IETEYHFPIT HMVAPKEDSL ACNQCHADNA RLAQLTGFYM PGRDKNVLVD TIGWLAVIGS LGGVGIHAAL RRRFAKKRQT NGGAGHE
|
| |