Gene Dret_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1996 
Symbol 
ID8419841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2293439 
End bp2295142 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content53% 
IMG OID645038584 
Producttype II secretion system protein E 
Protein accessionYP_003198858 
Protein GI258406116 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGACAAA AGATCAGGCT TGGGGAAATG CTCGTAGAGC AAGGCATGCT TACCAAAGAA 
CAGCTTGACA GCGCTTTGGC TGAGCACAAA AAACAAGGCC TTAAGCTAGG TCAGTATCTT
GTTCGGTTCA ATATCATCCG GGAAGACAAG GTTGTTGAAC TTTTAAGCCA ACAAATGCGA
ATCCCCAGGT ACACTCCCTC CAAGTATCCC TTGGATATGC ATTTGGCCCA GATTGTCCCC
TCGGAGATAG CCCAAAAATA TCAGGCAATA CCTTTGATTC GCAGGGGGAA TGTGCTTGGT
GTTGCGACTT CTGATCCCTT GGATATAGAA GCTTTGGACG GCCTCGAAGT ACACACCAAC
ATGGAGATTG AGCCTGTCGT CTGCACCGAA AACGAATTTG AACAAATATA TACGACTCTG
TACGGCATAG ACAGCGGCAT GGAAGAGGTC ATGGAAGATG TGGAGCACAT GGACCTCACC
CATGAAAGCC AAGACACAGA GCAAGCGACG GATGTTGATG TAAGTTCCCT CGAAGACCAG
GCGGATCAGG CTCCAGTTGT CAGGCTGGTT AACTCCATAC TGGCGCAGGC AATCAAAGAG
AGGGCCAGCG ATATCCATCT CAGCCCTGAA AAAGAGAAGG TCCAAACCAG ATTCCGCATA
GATGGCAAAC TGCGGGAAAT GTCCACCCCT CCCAAGAATC TTTTTCCTGC CCTTGTTTCC
CGGATTAAAA TTTTGGCGAA CATGGATATC TCCGTGACCA GAATTCCTCA GGATGGACGA
TTTACAATCA TTTTGCAAAA AAAAGAAATC AATGTCCGAG CCTCTTGCAT CCCCACCATT
TATGGGGAGA ATGTCGTCCT CAGGCTTTTG GATATGAGCG CCAAGGCCTT TACCTTGGAT
GACCTGGGTA TGCAGGAGGA CGATCTGGCC AAAATGGAAG CTGCTATTCA CAAGCCTTAC
GGCTTGATCT TGTCCACAGG CCCCACAGGC AGCGGCAAGA GTACCAGCCT CTACGCCAGC
CTGCGCCGGA TAAACCACCC GGATATCAAT ATCGTTACCT TGGAGGATCC GGTGGAGTAC
CGGGTGGAGG GTGTCCGACA GGTGCAACTC AATCGCCGGG CCGGGATGAC TTTTCCCTCG
GGGCTGCGCT CCATCCTGCG CCAGGACCCG GACGTCATCA TGGTGGGCGA GATCCGGGAC
TCTGAGACCG CCCATACTGC GGTGCAGTCC GCCATGACCG GGCACAGGGT GTTCTCGACG
GTTCACACCA ATGACGCAGC CGGGGCTATT ACCAGGCTCA TAGATATGGG CATAGAGCCC
TTTCTTGTTT CATCAGTGCT CTTGGTCTCC TTCGCTCAGC GCCTGATGCG TCGGGTCTGC
CCCAATTGCG CTGAGAAATA TCAGCCGACT GGCGAGGGCC TGCGCGCCTT GGGCCTGGAG
CAATCCTCCG GTTGCACTTT TCTCCAGGGC CGCGGATGCA ACATGTGCAT GAATTCCGGG
TACAGGGGAC GAATAGCAGT CTTTGAGATA CTCAACATAG ATGAAGAAAT CCAGGACATG
ATTAACCGAC GCGCCACCAC CCGGGAAATT ACTGCCGCCG CAGTCCAGTC GGGCAAGCTC
AAAACATTGA GACATGATGC CGCCAGCAAG GTCTGCCAAG GTTTGACCAC AGTTGAGGAA
GCCGTATCCG TGGTGATGAC CTAA
 
Protein sequence
MRQKIRLGEM LVEQGMLTKE QLDSALAEHK KQGLKLGQYL VRFNIIREDK VVELLSQQMR 
IPRYTPSKYP LDMHLAQIVP SEIAQKYQAI PLIRRGNVLG VATSDPLDIE ALDGLEVHTN
MEIEPVVCTE NEFEQIYTTL YGIDSGMEEV MEDVEHMDLT HESQDTEQAT DVDVSSLEDQ
ADQAPVVRLV NSILAQAIKE RASDIHLSPE KEKVQTRFRI DGKLREMSTP PKNLFPALVS
RIKILANMDI SVTRIPQDGR FTIILQKKEI NVRASCIPTI YGENVVLRLL DMSAKAFTLD
DLGMQEDDLA KMEAAIHKPY GLILSTGPTG SGKSTSLYAS LRRINHPDIN IVTLEDPVEY
RVEGVRQVQL NRRAGMTFPS GLRSILRQDP DVIMVGEIRD SETAHTAVQS AMTGHRVFST
VHTNDAAGAI TRLIDMGIEP FLVSSVLLVS FAQRLMRRVC PNCAEKYQPT GEGLRALGLE
QSSGCTFLQG RGCNMCMNSG YRGRIAVFEI LNIDEEIQDM INRRATTREI TAAAVQSGKL
KTLRHDAASK VCQGLTTVEE AVSVVMT