Gene Dret_1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1963 
Symbol 
ID8419808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2246979 
End bp2248202 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content55% 
IMG OID645038551 
Productquinone-interacting membrane-bound oxidoreductase complex subunit C 
Protein accessionYP_003198825 
Protein GI258406083 
COG category[C] Energy production and conversion 
COG ID[COG1150] Heterodisulfide reductase, subunit C 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00829701 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGAAC CCGTGAAAAT TCAGCCGGAT CTGGATTTTA TCAAGGAGCT GCAGGAGGTC 
GGGGGCGACA CCCTGAAGAA GTGCTACCAA TGCGCGACCT GTAGCGTTGT CTGTCCCTTG
TCGCCCATTG ACAAGCCCTA TCCCCGCAAG GAGATGGTCT GGGCCCAATG GGGACTGCGT
GACAAACTTC TTAACGATAT CGATATTTGG TTGTGCCACA ATTGCGGACA GTGCTCTGAT
CTCTGCCCGC GGGGCGCCAA GCCCGGTGAC CTGCTGGCCG GATTGCGCAA TATGGCCTAC
CGCAAACTCG TCGAGCCAAC GATTCTGGGC AAGTGGATGA GTTCGGCCAA GTATCTGCCG
CAACTGATCG GGATTCCGGC GATCATTTAT CTGATCATCT GGATCATCCG CGCCGGGATG
CTCGGCACCG CATTCCCGCT CAACGAACAG GGCAACGTCG AATACGGCGT CCTTTTCCCT
GGTGATTACA CCATCGACAC CGTTTTCGGG TTGGTGGCCC TGTTTGTGAT CTACACCTTC
TACAAAGGGA TACGAAAACT GATCGCGTCA TTCCAGGATC AGCCCAAAAC CTTTGTCATT
GGCTATGAGC CCAAGGGCAT CTGGGCCTCG CTGTGGGATA CGGTCAAAGA TGAAATCGTC
ACCCATCGCA AATGGAAAGA TTGCGGAGAG GATTCCGAGG CCGACGAGCA GAAGTTCAAG
GGCCACTTGT TGACCTTCTA CGCGTTCGTG GCCTTGTTCA TTGTCACCAG TGTCGTGGCT
GTGACCCATT GGGGCGGCAA GATCATTCCT TTCCTGGCTC CCATTGGACA CACCCCGATG
CCGTTGTGGC ACCCAGTCAA GATTCTGGCC AATGTCGGCG CTATTGCCCT CCTTGTCGGC
CTGACCTACT TGACCAAACG CCGCTTGAAC CAGGACCCGA ACAAACACGC CTCGACTTTT
TATGACTGGT ATCTGCTTGG CGTGATCTGG GCCGTGGCAG TGACTGGGGT TTTGTGTGAA
CTGCTCCGTC TGGCCGGCGT GGCTGGTCTG GCGTACCCCA TGTACTACCT GCACCTGATC
TCGGTGTTCA TGCTTATCGC GTACCTGCCG TGGTCAAAAT TGGGGCACAT GGTCTATCGG
ACCGCCGCCC TGGCCTATGC CCGGCATATT GGCCGACTGC CCATGGCTTC TGAGAGCCAA
AAGAAAAAAA TATTTGTCAT CTAA
 
Protein sequence
MSEPVKIQPD LDFIKELQEV GGDTLKKCYQ CATCSVVCPL SPIDKPYPRK EMVWAQWGLR 
DKLLNDIDIW LCHNCGQCSD LCPRGAKPGD LLAGLRNMAY RKLVEPTILG KWMSSAKYLP
QLIGIPAIIY LIIWIIRAGM LGTAFPLNEQ GNVEYGVLFP GDYTIDTVFG LVALFVIYTF
YKGIRKLIAS FQDQPKTFVI GYEPKGIWAS LWDTVKDEIV THRKWKDCGE DSEADEQKFK
GHLLTFYAFV ALFIVTSVVA VTHWGGKIIP FLAPIGHTPM PLWHPVKILA NVGAIALLVG
LTYLTKRRLN QDPNKHASTF YDWYLLGVIW AVAVTGVLCE LLRLAGVAGL AYPMYYLHLI
SVFMLIAYLP WSKLGHMVYR TAALAYARHI GRLPMASESQ KKKIFVI