Gene Dret_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1037 
Symbol 
ID8418860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1224904 
End bp1226202 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content58% 
IMG OID645037607 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003197903 
Protein GI258405161 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.106795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.550231 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCGG ATTTGCAAGA ACTTGCAAAA CTCTTGCGAG ATATAGACGA CCAGTTGGTC 
AGCTGCATGA AATGCGGTAT GTGCCAAGCG GCCTGCCCCC TGTTTGCTGA GACAGGCCGT
GAGGCTGACG TCGCGCGCGG CAAGATCGCC TTGCTCGAGA ACCTGGCCAA CGAAATGATA
GAAGACCCCA AAGGCGTCAA AGACCGCCTT GACAAGTGCC TGCTGTGCGG ATCCTGTGCG
GCCGCCTGTC CCAGCGGGGT CAAGGTCCTG GACATCTTCA TCAAGGCCCG AGCCATCATC
ACCGGCTATA TGGGGCTGTC CCCGGCCAAG AAAGCCATCT TCCGGGGCAT GCTCCAGCAC
CCCGAACTCT TCAACAATGT CGTGGGTGTG GCCTCGAAAT TCCAAGGGCT GTTTACCAAG
CCGGTAAACG ACATGATCGG TTCCTCCTGT GCGCGGTTCA TGTCCCCGCT CATTGGGGAC
CGCCACTTCC AGCCCTTGGC CAAGGAGCCG CTGCACAAAA AATACGGCAA AGTGGACACG
GCCGCCGGCA AAAGCGGCAT CAAGGTCGCC TTGTACCCCG GCTGCCTCGT GGACAAAATC
TTCCCGCGCG TCGGCGATGC GGTCATGAAA ATCCTGGAAC ACCACGGTGT CGGAGTCTAC
ATGCCGCTGA AGCAGGCCTG CTGTGGCATC CCGGCCATCT CCTCCGGAGA CAAGCAGACC
TACGACAAGC TGGTCAAACA AAACCTCGAG GTGTTTGAAA AAGGCGACTT CGATTACCTC
CTGACGCCGT GCGCGACGTG CACCTCGACG ATCAAAAAGA TCTGGCCCCT TATGGCCGAA
GATTACGAAG GCGCCCTGCG CAACCGGGTC AACCTCCTCT CTGACAAGAC CATGGACGTC
AACGCATTTC TGGTCGACGT CCTCGGAGTG AAGGGAATCG CGGAGCCCAA TGCTACGGCT
AAATCCATCA CCTATCATGA CCCCTGTCAC CTCAAGAAAT CCCTTGGCGT GGCCGCACAG
CCCCGGACGC TGCTGCATAC CAACCCCGGA TATGAGCTCA AGGAAATGTC CGAGTCCGAT
CGGTGCTGTG GCATGGGCGG CAGCTTCAAC ATCCAGCACT ACGACCTTTC GGAAAAGATT
GGCGGGCACA AACGCGACAG CATCCTGGCC ACCAAGGCTC AGGTTTTGGC CACTGGATGC
CCGGCTTGCA TGATGCAGAT CTCCGATCTG CTTTCGCACA GCGACGCTCA GATCGCCATC
AAGCACCCGG TTGAAATCTA CGCCGAAACA TTGCCCTAA
 
Protein sequence
MSADLQELAK LLRDIDDQLV SCMKCGMCQA ACPLFAETGR EADVARGKIA LLENLANEMI 
EDPKGVKDRL DKCLLCGSCA AACPSGVKVL DIFIKARAII TGYMGLSPAK KAIFRGMLQH
PELFNNVVGV ASKFQGLFTK PVNDMIGSSC ARFMSPLIGD RHFQPLAKEP LHKKYGKVDT
AAGKSGIKVA LYPGCLVDKI FPRVGDAVMK ILEHHGVGVY MPLKQACCGI PAISSGDKQT
YDKLVKQNLE VFEKGDFDYL LTPCATCTST IKKIWPLMAE DYEGALRNRV NLLSDKTMDV
NAFLVDVLGV KGIAEPNATA KSITYHDPCH LKKSLGVAAQ PRTLLHTNPG YELKEMSESD
RCCGMGGSFN IQHYDLSEKI GGHKRDSILA TKAQVLATGC PACMMQISDL LSHSDAQIAI
KHPVEIYAET LP