Gene Dret_0461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0461 
Symbol 
ID8418267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp565346 
End bp566524 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content61% 
IMG OID645037023 
Productprotein of unknown function DUF399 
Protein accessionYP_003197336 
Protein GI258404594 
COG category[S] Function unknown 
COG ID[COG3016] Uncharacterized iron-regulated protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.978142 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTGA TGCCGATCAT CTCTGTTGTG GTTGGCGCTT TGTTGCTGGC GGCCTGTGGC 
GGGCCGCCAG CCCTGCAAAG CGATCCGGCC GCGGGTTTGC GCAAAGGGAC ATTGGTCACC
CACGCCGGGG AGCCAATGTC CCTTTTTGCT TTTGCGCAGC AGGCCCGTCA CGTCGACTAC
CTCCTCATGG GCGAGGCCCA CACCAACGCC TGCGATCATG CGGTGCAGGC GGACGTGCTG
CAGGTCTTGG CCTCCCAGGA CCTGCAGCCT GTGCTTGGCC TGGAAATGGT GCCGGCAGCG
AAACAGCCTG TTTTGGACCG TTTCAACCAG GGCCGGCTCT CCGTGGAGGA GTTGCCTGAG
GCCCTTGATT GGCAGACAAG CTGGGGCCAC CCGTATTCGC TGTACAAGCC GGTTTTCCAA
GTCGCTTCCG ACGCCGGGGT CGATTTGTAC GCGCTGAATA TCGAGCAGGC CGTCCTGGAC
GAGGTGCGGG AAAAGGGACT TGAGGGGATG GCGCCTGAGA AACGGGCGCG GCTCCCCGTA
ACGATCCTGG ATCCGCCAGA ACCTCAGCGG CAGGCCCTCG AGGAAGAATT TTCCGGGCAC
CAGAAACTGT TCCAGCAAAT GGGCAACGCT ACCAGCGGTT CATTGGAGCG GTTCATGTTG
ATCCAGTCCA TCTGGGATAC CCAGATGGCT AGTCGGGCCC GGCGGGTGCA CGCGCAGACC
GGGCGACCGG TGGTTATCCT GACCGGGACC GGGCATGTTG AATACGATTG GGGGATTGTC
TCCCGACTGC ACCGCTGGGA CCCCCAAGCG AAGATCGTAA GCGTGATAGG ATGGCGTGGC
GGGACGCTGC CCGAAGCTGA GGCCGCCGAC TGGTTTTTTT ATTGCCCCTT GCAGCATACC
AGCCGCCTGG GGTTTACCAT GGAGATGCGT CCTGAAGGGG CTCGAGTGAT GACTGTGGAG
CCCGGCAGTC GTGCCGCTCG GGGCGGACTG CAAAGCGGTG ATCTTCTGGT GAAAGTGGGC
GGTGAGCCAT TCACCGGGTT GTGGGACTTG CATCAGGCCG CCATGGACGC AGTCCAGGCC
GAAGAACCGA TGCAGATCAC AGTCAAGCGT GAGGGCGCCT CTGTTGAACT GAGCTTGGAT
TTGCAACGCT CAGGGACTAC AGACGGTCCA ACAGATTGA
 
Protein sequence
MGLMPIISVV VGALLLAACG GPPALQSDPA AGLRKGTLVT HAGEPMSLFA FAQQARHVDY 
LLMGEAHTNA CDHAVQADVL QVLASQDLQP VLGLEMVPAA KQPVLDRFNQ GRLSVEELPE
ALDWQTSWGH PYSLYKPVFQ VASDAGVDLY ALNIEQAVLD EVREKGLEGM APEKRARLPV
TILDPPEPQR QALEEEFSGH QKLFQQMGNA TSGSLERFML IQSIWDTQMA SRARRVHAQT
GRPVVILTGT GHVEYDWGIV SRLHRWDPQA KIVSVIGWRG GTLPEAEAAD WFFYCPLQHT
SRLGFTMEMR PEGARVMTVE PGSRAARGGL QSGDLLVKVG GEPFTGLWDL HQAAMDAVQA
EEPMQITVKR EGASVELSLD LQRSGTTDGP TD