Gene Dret_1305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1305 
Symbol 
ID8419134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1528136 
End bp1529194 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content64% 
IMG OID645037881 
Producthypothetical protein 
Protein accessionYP_003198171 
Protein GI258405429 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.122473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0347421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTCG ATATCCCGTT TCTCCCCGAT CCCGAGTATC TTCAGGCCCT GACCCGCTTC 
GAACCAGACC TGGCCACCCT CCATTTCAGC CTGCATGCTC CCGGCATCCC TGACGCCCGG
GCGCGACTCC GCGAGGTCGA AACCGACCGG CTGATCGATC TCTTGCGGCT GGGCCCGGGC
TGTGACCGCC TTGCGCTCCT GAACAGCCGC TTGCACACCC CTGCTTTTCT GCGTGAGCCA
GCGGCAAGGC GCCCGCTGCT CAATGCCCTG GAGCGGCTGC TTGCGGCTGA CGTCTGCCAG
GGGATCGTCG TGGCCGATCC GTACCTGCTC AATGTCCTGG GCGACGATTC GCCCCATCTG
GCCGGGGCGC TTCAGGCAGT GCCCAGCGTC AACGCTCGGC TGGCATCGGT CCACGCGATC
CGGGCCTGGC GCACTGTCAT CGAGGAGGCC GGGTTTTTGC CCCCGGCCCG GCTGGTTCTG
GATCGTGAAC TCAACCGGGA TCCAGGGCAG TTGGAAACCT TGAGCGCTGC CTGTCGCCGG
TTCTGGCCGG ATCAAGAGGT GTTTCTGCTC GCCAATGAGG GATGTCTTCC CCATTGCCCC
TACAAGCCGG CCCACGATGG CCAGATCGCT CTTGCGGCTT GCGACTTGAC GGCTGAGGCC
ACATTCGAAC TCAATGCCAC GGCGGGATGT GTCCGGCATT TGTTGCGCGA TCCCGCAGCT
GTGTTGCGCT CGCCGTTTAT CCGGCCCGAG GACGGCGAAC GCCTTGCCGG GATGGTCGAC
GGGCTCAAGC TGTGCGGCCG GAATATGGGG AGTCGTTTCT GCCTGCGCGT GCTCAGGGCC
TATGTACAAG GCCGATTTTC AGGCAATCTG CTGCAGTTGC TCGACAGCGT GGACTGGATG
GCCGAGGGAC TGGTCATTGA CAACGACACC CTGCCTTCAG ATTTTTGGGA CCAGTTGACC
GGGTGCGACG GGGACTGCCA GGAATGCGGA TATTGTCAGG CCCTGTTGCA GAGTGCTGGC
AGACCGACCC CTCTTGGGCC CTGGGGAAAC CATGCCTGA
 
Protein sequence
MRFDIPFLPD PEYLQALTRF EPDLATLHFS LHAPGIPDAR ARLREVETDR LIDLLRLGPG 
CDRLALLNSR LHTPAFLREP AARRPLLNAL ERLLAADVCQ GIVVADPYLL NVLGDDSPHL
AGALQAVPSV NARLASVHAI RAWRTVIEEA GFLPPARLVL DRELNRDPGQ LETLSAACRR
FWPDQEVFLL ANEGCLPHCP YKPAHDGQIA LAACDLTAEA TFELNATAGC VRHLLRDPAA
VLRSPFIRPE DGERLAGMVD GLKLCGRNMG SRFCLRVLRA YVQGRFSGNL LQLLDSVDWM
AEGLVIDNDT LPSDFWDQLT GCDGDCQECG YCQALLQSAG RPTPLGPWGN HA