Gene Dret_1567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1567 
Symbol 
ID8419397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1815968 
End bp1816987 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content60% 
IMG OID645038140 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003198429 
Protein GI258405687 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00816314 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0470804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAATG CGTCGCGTTC AAAGGTCCTC GCCGGGGTGG CAATCTGCCT CGGATGGATT 
TTCCTGGCGG GCTGCGCCAA TCTGCAAACT CGATTCGACA GTGTCCTGAC CTCGTATCAG
GGACAGCGCT ACCTGGAAGA ACACGAATAC GCCCTGGGGG TCGAGGACTT GTCCCACCGG
CTCAAGCAGC AGCCGGACAA CGGTGCGGCC GCCTATTGGC TGGGCCGGCT TTATCTGGCC
CAGGAGCACC CCAGCAAGGC CCTGCCTGCT TTACAAAAAG CGGTGGAACT CAAACCGCAA
TACGCAGACG CCCATTTCTG GCTCGGTGTC GCCCATTGGG CGATGATGGA TTTCGAGAAG
GAGCGGTTGG CCTATGAACG GGCTCTGGCT CTCGAGCCTG ACCACACCCA GGCACGGGTC
TATCTCGGCC ACCATTATGT GGATCGGGAG CAATGGTCTC TGGCCCTGAT CCATTATCGG
CGTGTCCTGG ATGAAGAGCC CGGCCATCCT TCCGCTCTTT TTTATACGGC CGAATGTCTG
GAACAACTGG GGCGGGAACA AAGCGCCCGG CAGGCCTGGA AAGCGTATCT GGACCGCTAT
CCCGACGGTG GGCGGGCCCT GGAGGCGACC CGGCGTCTGA ACGGGTTCGG CGACTTCAGT
TATCGCAACA TCATTCTCGG CAAGCGGCAG GTGACCATCG AAAAAATCCG TTTCGAACAG
GGCACAGCCA CATTGAAGTC CTCCAGTCTG CCCTCACTGG ATCTGATCGG GGCGAACCTG
GAGCGCCGTT CTGATCTCCG TTTGCACGTG ATTGTCTATG TCCAGGAGGA TGCAGCGCTG
GCCCGGAAAC GGGCGCAGGC CATTGAGGAG GCGATTGTCC AGCGCACGAG CGGAGCCGAT
TCCGAACAAC TCCTCTTGAG CTGGTTCGGT CAGGCCGAAA CGATCACAGT GGATGGCGAA
CGTTTCCAAG AACCGCGGTC GGTCCATTTT GTCACCGAGG CCGGGCCGGA CGTATCCTGA
 
Protein sequence
MPNASRSKVL AGVAICLGWI FLAGCANLQT RFDSVLTSYQ GQRYLEEHEY ALGVEDLSHR 
LKQQPDNGAA AYWLGRLYLA QEHPSKALPA LQKAVELKPQ YADAHFWLGV AHWAMMDFEK
ERLAYERALA LEPDHTQARV YLGHHYVDRE QWSLALIHYR RVLDEEPGHP SALFYTAECL
EQLGREQSAR QAWKAYLDRY PDGGRALEAT RRLNGFGDFS YRNIILGKRQ VTIEKIRFEQ
GTATLKSSSL PSLDLIGANL ERRSDLRLHV IVYVQEDAAL ARKRAQAIEE AIVQRTSGAD
SEQLLLSWFG QAETITVDGE RFQEPRSVHF VTEAGPDVS