Gene Dret_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0414 
Symbol 
ID8418219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp505923 
End bp507278 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content62% 
IMG OID645036975 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003197289 
Protein GI258404547 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.137763 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAAA ACGCGAATAC CACTCCGCCG ACGCCGGGCC ATCACCCGCC CCATGTATTG 
GACGGTCCCC CCGTCTCGAC CTTCACCGCT GCCCTGCGGG ACCTCCTCCC CGAGGCCGGC
GGACTGGATA TGTGTCTGAC CTGCGGCCTG TGCGCCTCCG GGTGCCCCGC TTCCGGTCTT
GAGGGGCTTG ATCCCCGCAA ATTTCTACGT CTTATCGCCC TGGGGCTGGA GGAGGAGGCC
ATCAACTCCC CCTGGGTCTG GATGTGCACC ATGTGCCAGC GGTGCATCAG GGTCTGTCCC
ATGCAAATCG ACATCCCCCA GCTTGTCTAC CAGGCCCGGG CCCGGTGGCC CCGCGAGGAC
CGACCCAAGG GCATCCGCGG CTCCTGCGAT CAGGCGATCA ACAAGGAGAC CAATTCGGCC
ATGGGCGCCG GGGCGGAAGA CTTCGAATTC GTAGTCGCCG ATGTCCTCGA AGAGATCCAG
GAGAACCAGC CTGACTGCGC TGAACTCCAG GCGCCGATGA ACAAGAAGGG GGCCTACTTT
TTTCTCAACC AGAACTCCCG GGAGCCGGTC ACCGAACCCG ACGAGATGGT CCCGCTGTGG
AAGATCCTCA ACTACGTCGG CGCGGACTGG ACCTATTCCT CCGTGGGCTG GGCCGCGGAA
AATTACTGCA TGTTCCTGGC CGATGACGAG GCCTGGGAAT CCATTGTCCG CAACAAAGTC
CAGGCTGTGG AGAATCTCGG GGCCAAGGTC TGGCTCAACA CCGAGTGAGG CCACGAATTT
TACGCGGTCC GGGCCGGACT GCACAAATTC GGGATCACCC CCAATTTCGA ACTCGATTCC
ATCATTCGCT GGTACGCCCG CTGGATCCGC GAGGGGCTGC TGCCGGTGAA TTCGGACTGG
AACAACGATC TGGGCATCAC CTTCACCGTC CAGGACCCCT GCCAGCTGGT CCGCAAATCT
CTGGGCGATC CGGTGGCCGA GGATTTGCGC TACGTAGTCC GTTCCGTGGT CGGCGAAGCC
AACTTCCGGG AGATGTACCC CAACCGCTCC AACAATTATT GTTGCGGCGG CGGGGGCGGC
TTTCTCCAGT CCGGGTATAC CGAAGCCCGA CATGCCTACG GACAGCGTAA ATACGAACAG
ATTATGGCCA CGGGCGCCCA ATACTGCATC GCGCCCTGCC ACAACTGCCA TTCGCAGATC
CATGACCTGA GCGACCATTT CCAGGGCGGC TGGCACACCG TGCACCTCTG GACCCTGATC
TGCCTCTCCC TGGGCATTCT TGGCGAAAAC GAGCGGGAAT ATCTGGGGCC GGAACTGGCC
GAAGTCGCCC TCCCGTCCCC AAGCACTGAG CCGTGA
 
Protein sequence
MLENANTTPP TPGHHPPHVL DGPPVSTFTA ALRDLLPEAG GLDMCLTCGL CASGCPASGL 
EGLDPRKFLR LIALGLEEEA INSPWVWMCT MCQRCIRVCP MQIDIPQLVY QARARWPRED
RPKGIRGSCD QAINKETNSA MGAGAEDFEF VVADVLEEIQ ENQPDCAELQ APMNKKGAYF
FLNQNSREPV TEPDEMVPLW KILNYVGADW TYSSVGWAAE NYCMFLADDE AWESIVRNKV
QAVENLGAKV WLNTEUGHEF YAVRAGLHKF GITPNFELDS IIRWYARWIR EGLLPVNSDW
NNDLGITFTV QDPCQLVRKS LGDPVAEDLR YVVRSVVGEA NFREMYPNRS NNYCCGGGGG
FLQSGYTEAR HAYGQRKYEQ IMATGAQYCI APCHNCHSQI HDLSDHFQGG WHTVHLWTLI
CLSLGILGEN EREYLGPELA EVALPSPSTE P