Gene Dret_2045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2045 
Symbol 
ID8419890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2343132 
End bp2344439 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content53% 
IMG OID645038633 
Productpreprotein translocase, SecY subunit 
Protein accessionYP_003198907 
Protein GI258406165 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000027617 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00154147 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCTGCCAA AAGCGGACAA CCTCCCAGGC TCCAGTGAAC TGAGTAAGCG GATCATAACC 
ACCTTTTTGT TGCTCGTTGT CTATCGGTTG GGGATTCACA TTCCCATACC GATGGTTGAC
GGCCAGGCAC TGGCTGAATT TTTTGCCAAC GCCCAAAATA CCCTTTTCGG GTTGTTTGAC
ATGTTTTCCG GCGGCGGACT GAGTAAACTC TCCATTTTCG CCCTCGGAAT CATGCCCTAT
ATTTCGGCGT CGATTATCAT TCAGCTCTTG ACGGTCGTTA GCCCCAAGCT TGAGGAATTG
AAAAAGGAAG GGGCCTCGGG GCAAAAGAAG ATCACAGAGT ACACCAGATA TCTGACAGTG
CTTATAACCC TGGTCCAAGG GTTCGGGATC TCTTTTGGCC TGGAGCGGAT GACCAGTCCC
ACTGGGGCTC CAGTTATTCC TGATCCGGGC TGGATGTTCC GGCTGACCAC TGTGATCACC
CTCACAACCG GGACCGTTTT TCTCATGTGG CTTGGCGAGC AGATCACTGA CCGGGGAATC
GGCAACGGCA TTTCCCTGAT CATCTTTGGT GGCATTGTCG CCCGGCTCCC CGGGGCTGCT
GGCAATACCT TCCGGCTCAT GTCCGCCGGG GAGATGTCGC TCTTTCTGGT CTTGCTGCTT
TTGACCGTAA TGCTCGGCGT GTTGGGTTTT ATCGTTTTCA TGGAGCGCGG GCAGCGTCGG
TTGCCCATCC ATTATGCCAA GCGACAGATG GGCAGGAAGA TGTATGGGGG GCAAACGAGT
CATTTACCAC TTCGGGTGAA TACCGCCGGG GTCATCCCGC CAATTTTTGC CTCCTCGATT
CTGATGTTCC CGGCGACGGT GGCGAATTTT TCGCAAGTGC AGTGGATGCA GGAGTTCTCA
AATTATTTTC GTCCAGGCTC AACCATCTAC ACACTGGTTT TCGTCGCCTT GATCCTGTTT
TTCTGCTACT TCTATACCGC GATCATTTTC GATCCCAAGG ATATAGCTGA CAATCTGCGC
AAGCAGGGAG GCTTTATTCC CGGGATCCGT CCGGGCCTGA AAACAAAACA GTATATCGAC
AAGGTCCTGT CGCGGATCAC ATTGTGGGGC GCCTTGTATG TGGCAGCGAT ATGTGTACTG
CCCATGCTCC TGATCTCTCA ATTTAATGTC CCCTTTTACT TTGGCGGCAC AGCGTTGCTC
ATTGTCGTTG GGGTGGCCAT GGACACGATG GGCCAGATCC AGTCCCATGT GATCTCGAAT
CAGTATCAGG GACTGATGCA AAAAGCCGGT CTCCGAGGAA GGCGCTAG
 
Protein sequence
MLPKADNLPG SSELSKRIIT TFLLLVVYRL GIHIPIPMVD GQALAEFFAN AQNTLFGLFD 
MFSGGGLSKL SIFALGIMPY ISASIIIQLL TVVSPKLEEL KKEGASGQKK ITEYTRYLTV
LITLVQGFGI SFGLERMTSP TGAPVIPDPG WMFRLTTVIT LTTGTVFLMW LGEQITDRGI
GNGISLIIFG GIVARLPGAA GNTFRLMSAG EMSLFLVLLL LTVMLGVLGF IVFMERGQRR
LPIHYAKRQM GRKMYGGQTS HLPLRVNTAG VIPPIFASSI LMFPATVANF SQVQWMQEFS
NYFRPGSTIY TLVFVALILF FCYFYTAIIF DPKDIADNLR KQGGFIPGIR PGLKTKQYID
KVLSRITLWG ALYVAAICVL PMLLISQFNV PFYFGGTALL IVVGVAMDTM GQIQSHVISN
QYQGLMQKAG LRGRR