Gene Dret_0872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0872 
Symbol 
ID8418691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1029906 
End bp1031291 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content57% 
IMG OID645037441 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003197741 
Protein GI258404999 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.301742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.724229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAG GAACATTGTG CAACAAACGC CCCATCGAAA CCAAAGAAGA GCTCGATGCT 
CTTCTTTCGG ACAAGGGGGG CAAGCTATAC TACCAGGAAA TGGAAGAACT CGATGTGGAC
AAAGCCAAGC TGGCCGCATC GCTGGAAAAA ACCTGCAATT CCAAAATGCG CACCTGGCTC
AATATGTGCG CCCGGTGCGG CCTTTGTGCA GAAAGCTGTT TTCTGTATCA GGTCAACAAC
CGCGACCCGA AGCAGGTGCC CTCCTACAAG GTCCAGTCCA CGCTGGGGGA AATCGTCAAA
CGCAAGGGCG ATGTCGACAA CGCCTTCATG CGCATGTGCA TGGAAACCGC CTGGTCCAAA
TGCACCTGTT GCAACCGCTG CGCCATGTAC TGCCCCTATG GCATCGATAT GGGGGTCATG
ATCAGTTATC TGCGCGGGTT GCTCTTTTCA CAGGGATTCG TGCCCTGGGA GCTGAAGGTC
GGGTCTGGCA TGCACCGGGT CTACCGGGCC CAGATGGATG TGACCTCCGA AGACTTCACC
GAAACCTGCG AATGGATGTG CGAAGAATCC GAGGACGAAT GGCCCAACCT TGAAATCCCG
GTGGACAAGG AAGACGCGGA CATCTTGTAC ACCATCAATG CCCGCGAAGT GAAACACTAC
CCCGAAGACA TCGCCGAGGC CGCCATCCTG TTCCACCTCG CCGGCGAGAA TTGGACTGTC
TCCAGCCAAG GATGGGAACA GACCAGCCTG ACCATGTTCG CTGGCGACTG GGAAGGCTGC
AAGATGCAGG TCAAGGAAGT CTACGACGCC ATGGAGCGCC TGCGTCCCAA ACGGATGATC
GGTACCGAAT GCGGTCACGC CCACCGGGCC ACGGTCATTG AGGGGCCGTA TTGGGCCGGG
CGCAAGGACG GCAAGCCCCC TGTTGAATGC ATCCACTACG TGGAATGGGT GGCCGAGGCG
CTGCGCACCG GCAAATTGAA GATCGACCCG GCCAAAAAGA TCAAAGAACC GGTCACGCTA
CAGGATTCCT GCAACTATGT CCGCAATCAT GGTCTGGCCG AAACCACCCG TGAAATCATG
AGCTATATCG CCGAGGATTT CCGGGAAATG GCGCCGAACA AGGAACACAA CTATTGCTGC
GGCGGTGGAG GCGGCTTCAA CGGTATCGGC TTGTACCGCG AGCAACGCAA TGTGGCCCTG
CGCAAAAAAC GGGACCAGAT CCTGGCCACT GGATGCAAAC TGGTCATCGC CCCCTGCCAC
AATTGCTGGG ACGCCATCCG GGACCTGGAC GAGGAATACG AAATCGGTAT CCGTTGGTCT
TTTCTCAAAC CGTTACTCAT CAGCATGGTC GAGGTTCCCG AGCATCTGAA GCCCGCAGAC
GAATAA
 
Protein sequence
MPEGTLCNKR PIETKEELDA LLSDKGGKLY YQEMEELDVD KAKLAASLEK TCNSKMRTWL 
NMCARCGLCA ESCFLYQVNN RDPKQVPSYK VQSTLGEIVK RKGDVDNAFM RMCMETAWSK
CTCCNRCAMY CPYGIDMGVM ISYLRGLLFS QGFVPWELKV GSGMHRVYRA QMDVTSEDFT
ETCEWMCEES EDEWPNLEIP VDKEDADILY TINAREVKHY PEDIAEAAIL FHLAGENWTV
SSQGWEQTSL TMFAGDWEGC KMQVKEVYDA MERLRPKRMI GTECGHAHRA TVIEGPYWAG
RKDGKPPVEC IHYVEWVAEA LRTGKLKIDP AKKIKEPVTL QDSCNYVRNH GLAETTREIM
SYIAEDFREM APNKEHNYCC GGGGGFNGIG LYREQRNVAL RKKRDQILAT GCKLVIAPCH
NCWDAIRDLD EEYEIGIRWS FLKPLLISMV EVPEHLKPAD E