Gene Dret_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1031 
Symbol 
ID8418854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1213128 
End bp1215281 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content58% 
IMG OID645037601 
Productprotein of unknown function DUF162 
Protein accessionYP_003197897 
Protein GI258405155 
COG category[C] Energy production and conversion 
COG ID[COG1139] Uncharacterized conserved protein containing a ferredoxin-like domain 
TIGRFAM ID[TIGR00273] iron-sulfur cluster-binding protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0202233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGAAG CTAAGAATCT GAAAGAATAT AGAGAAAAGA TCCACGGCAC CCTGGAAAAC 
GACTTTTTGC GTTCTACCCT CGACAATTTC GCCGTGGCGT ACCGTGCCGG CCGAGCCAAG
GCCTTCGCCG GTATGGATGA AAAGGCCCTG ATCAATGAAA TCGCGGCGGC CAAGGACGAT
TCCATCAGCC GCATGGATCA GCTTTTCGAT GAATTCAAAA CCAAGGCCGA ACAAATCGGC
ATCAAGGTCC ATCTGGCGCA CACCGCGCAC GAGGCCAACG AGATCATCGC CAAAATCGCC
GCGCAAAACG ATTGCCAGAA AATCGTCAAA TCCAAATCCA TGACCGCTGA GGAGACCCAC
CTCAACCACC ACCTCGAAAA AGAGGGGCTC AAGGTCACAG AGACCGACCT CGGGGAATGG
ATCATTCAGT TGCGCCATGA AGGGCCGACC CACATGGTCA TGCCCGCCAT CCACCTTTCG
CGTTACCAGG TCGCCGACCT CTTCACCGAT GTGACCAAGC AGAAGCAGGA GTCCGAGATC
GAGAAGTTGG TCAAGGTCGC CCGCCGGGAA CTGCGTCAGC GCTATGTCGA GGCTGACATG
GGCATCAGCG GAGCCAACTT CGCTATTGCC GACACCGCTT CCCTGGGCAT CGTCACCAAC
GAAGGCAACG CCCGTTTGGT GACCACGCTG CCGCGGGTCC ACGTGGCCAT GGTCGGTTTG
GACAAGCTGA CGCCGAATCT GCACGACGCC CTGCGGATGT TGCAGGTTCT GCCCCGCAAC
GCCACCGGCC AGACCATCAC CTCCTACGTG AGCTGGATCA CCGGACCATC CGAGTGCCAG
TCCGCCGAAG ACGGGCAAAA GCAGATGCAC ATCGTCTTTT TGGACAACGG CCGCCGGGCC
CTGGCCAAGG ACGACACCTT TTCCGAGGTC CTGCGCTGCG TGCGCTGCGG TGCCTGCGCC
AACGTCTGCC CTGTTTACCG TATGCTGGGC GGACACGATT ACGGCCACGT CTACATCGGC
GCCATCGGCC TGATCCTGAC CTACTTTTTC CACGGCCGCG AGTACGCCAA AAACCTGGTC
CAGAACTGCA TCAACTGCCA GGCGTGCAAA GAGGTTTGCG CCGCTGGCAT TGATCTGCCG
AGCATGATCA AAGAGATCCA TGCCCAAATT CTGGACGAAG AAGGGCATCC GGCCAGTTCG
ACAATGCTGG CCAAGGTCTT GCGCAACCGC AAGCTGTTCC ACGGCTTGCT GCGCTCTGCC
CGTTACGCCC AGCGTCCGGT CACCGGCGGA ACCCCGTACC TGCGCCACTT GCCGCAGATG
TTCGCCAAGG ACCACAACTT CCGCGCCCTG CCGGCCATCG CCAAAAAGCC GTTCCGGGAC
CAATGGGAAG ACATCAAGCC CACCGAAGCC AAGACCCGCT ACAAAGTGGC CCTTTTCTCG
GGATGTGTGC AGGACTTCGT CTATCCCGAA CAGCTCAAGG CCGCGGTCGA TGTCATTGCA
GGGCACGGTG TCGAACTGGA CTACCCCATG GACCAATCCT GCTGTGGTCT GCCGGTGCAG
ATGATGGGCG AAAAGCAGGC GGCCAAGGAT GTGGCCATGC AGAACATCAC GGCCATGGAC
CCGTCAGAAT ACGACTACAT CCTGACCCTG TGCGCCTCCT GTGCCTCGCA CTTGAAACAC
AACTACCCCA AATTGCTCAA AAACGAATCG GCCATCTCGC AGGTCAAGGT CGAGCAGTTT
GCGGACCGGG TCATCAGCTT CAGCTCATTT ATCCATGATG TGCTCGAACT GGATGAAAGC
GAATTCAAAA AACAGGGCAA AAAGACGACC TACCATGCCC CGTGCCACCT CTGCCGCGGC
ATGGGTGTCA CTGAAGCCCC GCGGGAAATG ATCAGCAGAT CGGGTCTCGA TTTTGCCCCG
GCGGATGAAG AGCAGACCTG TTGCGGGTTT GGCGGAACCT ATTCCAGCAA ATTCCCCCAA
CTCTCACGGG AAATTCTGAA CAAAAAACTC GACGACTTCA AAAAAACTGG TGCTGAGCAA
TTGGTCACCG AATGTCCTGG TTGCGTCATG CAGCTGCGTG GCGGGGTTGA TAAACGGGGC
GACTCGATAG AAGTCCTGCA TATTGCTGAG GCCTTGGCCA AGCAAAAACT GTAG
 
Protein sequence
MQEAKNLKEY REKIHGTLEN DFLRSTLDNF AVAYRAGRAK AFAGMDEKAL INEIAAAKDD 
SISRMDQLFD EFKTKAEQIG IKVHLAHTAH EANEIIAKIA AQNDCQKIVK SKSMTAEETH
LNHHLEKEGL KVTETDLGEW IIQLRHEGPT HMVMPAIHLS RYQVADLFTD VTKQKQESEI
EKLVKVARRE LRQRYVEADM GISGANFAIA DTASLGIVTN EGNARLVTTL PRVHVAMVGL
DKLTPNLHDA LRMLQVLPRN ATGQTITSYV SWITGPSECQ SAEDGQKQMH IVFLDNGRRA
LAKDDTFSEV LRCVRCGACA NVCPVYRMLG GHDYGHVYIG AIGLILTYFF HGREYAKNLV
QNCINCQACK EVCAAGIDLP SMIKEIHAQI LDEEGHPASS TMLAKVLRNR KLFHGLLRSA
RYAQRPVTGG TPYLRHLPQM FAKDHNFRAL PAIAKKPFRD QWEDIKPTEA KTRYKVALFS
GCVQDFVYPE QLKAAVDVIA GHGVELDYPM DQSCCGLPVQ MMGEKQAAKD VAMQNITAMD
PSEYDYILTL CASCASHLKH NYPKLLKNES AISQVKVEQF ADRVISFSSF IHDVLELDES
EFKKQGKKTT YHAPCHLCRG MGVTEAPREM ISRSGLDFAP ADEEQTCCGF GGTYSSKFPQ
LSREILNKKL DDFKKTGAEQ LVTECPGCVM QLRGGVDKRG DSIEVLHIAE ALAKQKL