Gene Dret_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1008 
Symbol 
ID8418830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1181772 
End bp1182998 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content60% 
IMG OID645037577 
Productpeptidase M24 
Protein accessionYP_003197874 
Protein GI258405132 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.998563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0392371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGCTG CCTTGGAGTG CATTCCTGAT ACTGAGTTGG CCTGGCGGTG GCAACGGTGC 
CAGGCATTGC TTCGCGAACA GCACCCCGAA TGCGGGGGGA TGCTGGTTTT TTCGCGCCCG
AATATCTATT ATTTAAGCGG CCATTGGGGA AACGGCGTTT TTTGGCTGCC AGCTGAGGGC
GACCCCGTTT TTCTGGCCCG TAAGGGGGCA GATCGAGCCG CTTTGGAAGC GCCGGGGCTC
CGCCTGGGAC GGTTCGGGTC TTTTCGGGAC ATCCCAGGGC AGTTGCAGCA CTTCGGGACC
CCGTTGTCCG AAATTCTTGC TGTTGAAAAA AGCGGTCTGA CCTGGACCTT GGGGGAGATG
TTCGCTAGCC GCCTTCCGGA CCACCGGTTT GTGGATGGCG ACCAGATCTT GCATCAGGCC
AAGGCCGTCA AGTCGGAGTG GGAATTGCGG AAAATGGCGC TGGCTGGGAG CCGTCACGAA
GCGGTGTTGC TCAATGACCT TCCGGGGATT TTGCACCCTG GGATGAGCGA GCGCGACATC
GCAGTTGCGC TGTGGCAGGC GATGTTCGAG CGGGGGCACC AAGGGATGAT GCGCATGCAA
AATCCCGGGG AGGAAATCTT TCTCGGGCAT GTCGCGGCTG GCGACTCCGC CAATTACCCG
TCGGTTTTCA ATGGACCGGT CGGGTTGCGC GGCGCCCATC CCGCGATACC GCACATGGGG
TATGCGGGCC AGACCTGGCA ACACGGCAGT CCGCTGGTGA TTGATGTGGG GTTTTGTCTC
GAAGGCTACC ACACCGACAG AACCCAAGTG TATTGGGCGG GGCCGGAAAA TTCGGTCACG
GAGCAGGCCC ACAAGGCCCA CGTCTTTTGC ATAGCCGTTC AGAAATGGCT CGCTGCTCGA
TTATGTCCCG GCGCTGTGCC CAGCGCCCTT TTTCGTGAGG TCTGGAATTG GGCACGCGAA
GAGGGATGGG AAGAAGGTTT CATGGGCCTT GGGGCGAACA AGGTGCCTTT CCTGGGCCAC
GGGATCGGTC TGGCCATTGA TGAACCGCCG GTGATCGCTT CCAGATTCGA TCGACCTTTG
GAGACGAATA TGGTCCTGGC CCTGGAACCG AAGATCGGCC TGGATGGTCT CGGGATGGTG
GGGGTTGAAA ACAGTTTTGT GGTGACCAGT GAAGGGGGGC GCTCCTTGAC CGGCTCCAGA
TGGGACATCT GTTGTGTTGG CCGCTGA
 
Protein sequence
MYAALECIPD TELAWRWQRC QALLREQHPE CGGMLVFSRP NIYYLSGHWG NGVFWLPAEG 
DPVFLARKGA DRAALEAPGL RLGRFGSFRD IPGQLQHFGT PLSEILAVEK SGLTWTLGEM
FASRLPDHRF VDGDQILHQA KAVKSEWELR KMALAGSRHE AVLLNDLPGI LHPGMSERDI
AVALWQAMFE RGHQGMMRMQ NPGEEIFLGH VAAGDSANYP SVFNGPVGLR GAHPAIPHMG
YAGQTWQHGS PLVIDVGFCL EGYHTDRTQV YWAGPENSVT EQAHKAHVFC IAVQKWLAAR
LCPGAVPSAL FREVWNWARE EGWEEGFMGL GANKVPFLGH GIGLAIDEPP VIASRFDRPL
ETNMVLALEP KIGLDGLGMV GVENSFVVTS EGGRSLTGSR WDICCVGR