Gene Dret_1028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1028 
Symbol 
ID8418851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1209531 
End bp1210886 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content55% 
IMG OID645037598 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003197894 
Protein GI258405152 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAATA AAATGGACGA CAACGCTGCG GCCTTCGACC GCTATGTGGC TGACGCCGGG 
CTTGATGCGG GGGTCGAAAA GCTCAAGGAA AACCCGGAAA AGATCAAAAA GGCCGTCAAT
CAGGTTCTTA ACGGCGAAGG TGGAGCCCGC CTCAAAGGAT ACGTCGAGAC CTGCGTGCAC
TGTGGATTGT GCTCCGAGGC CTGTCACTAT TTCCATTCCC ACGACAGGGA TCCGCAATAC
TCGCCTGTCG GCAAGGTTAA ACAGACGTTG TGGGAGTTGA TACGCAAAAA AGGCGACGTC
AGTCCGGAAT TCATTCGGAA TTGCGCCCAA ATCGCCTACA CCGAATGCAA CCTGTGCAAG
CGGTGTGTCA TGTACTGCCC ATTCGGCATC GATACCGCCT ATCTCATGTC CAATGTCCGG
CGGATCTGCC ACCTTCTTGG TGTCACCCCG CAATACCTCC AGGACACGGC CCACAGTCAT
GCCGCGACCT TCAACCAGAT GTGGGTCAAA GAGGACGAAT GGATTGACAG CCTCCAATGG
CAGGAAGACG AAGGCCGCGA CGAGATCCCG AATCTGCGCA TCCCCCTGGA AAAAGAGGGC
GCAGATGTCA TGTACTCCGT TATCGGCCCC GAGCCGAAAT TCCGGACCCA GCTCATCCTG
CAGGCCGGGG TTTTGATGCA CGAATGCGGC ATCAATTGGA CAATGCCAGC GACCACAGGC
TGGGATAACA GTGATATGGC CATGTATTCC GGAGACTCGG AACTCATGGG CCGCTTGAAG
CGGCAGCATT TCGAAACGGC CAGCCGCTTG AAAGTCAAAC GCATCGTCAT GGGCGAATGC
GGCCACGCCT TTCGTTCTGT CTACGATACC GGCAACCGCT GGCTGGCCTG GCAAAAACCG
CCCATCCCCA TTGTCCACGC CATCCAGTTC TACTGGGAAT TGCTGCGGGA CGGAAAGCTC
AAGGTGGCCA AACAATTCGA CAAGCCGGTG ACCATCCACG ACCCCTGCAA TATCATCCGC
GGTATGGGGC TGCATGAAAA GCTCCGGGAG GTCACGCACG CCTTTTGTTC CAATGTGACC
GAGATGTACC CGAATCGGGA GCATAATTAT TGCTGCTGTG CCGGAGGGGG CGTCATCAAC
TGCGGGCCTC CTTTCCGGAA CACCCGAGTC GAGGGCAACC GGATCAAAGC CGAGCAGATC
AAGGAAACCG GTGCAGAAGT CGTTATCTCT CCCTGTCACA ACTGCCACGG TGGCCTTGAA
GACATCATCC ACAAATACCA TCTGGGAACG GAACTGAAAT TCCTCATCGA TATCATCTAC
GAATGCATGG AAAAACCGAA TTCTATCGAG GAATAG
 
Protein sequence
MLNKMDDNAA AFDRYVADAG LDAGVEKLKE NPEKIKKAVN QVLNGEGGAR LKGYVETCVH 
CGLCSEACHY FHSHDRDPQY SPVGKVKQTL WELIRKKGDV SPEFIRNCAQ IAYTECNLCK
RCVMYCPFGI DTAYLMSNVR RICHLLGVTP QYLQDTAHSH AATFNQMWVK EDEWIDSLQW
QEDEGRDEIP NLRIPLEKEG ADVMYSVIGP EPKFRTQLIL QAGVLMHECG INWTMPATTG
WDNSDMAMYS GDSELMGRLK RQHFETASRL KVKRIVMGEC GHAFRSVYDT GNRWLAWQKP
PIPIVHAIQF YWELLRDGKL KVAKQFDKPV TIHDPCNIIR GMGLHEKLRE VTHAFCSNVT
EMYPNREHNY CCCAGGGVIN CGPPFRNTRV EGNRIKAEQI KETGAEVVIS PCHNCHGGLE
DIIHKYHLGT ELKFLIDIIY ECMEKPNSIE E