Gene Dret_1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1659 
Symbol 
ID8419490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1911242 
End bp1913599 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content60% 
IMG OID645038233 
Producttype II secretion system protein E 
Protein accessionYP_003198521 
Protein GI258405779 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.233969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.903835 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTGGG AAGGGTTCCT CTCGTTTCTC CGCCAGCGGC GCCGGCTCGT GCAGGAGACT 
GCTGAGCAGG GGGATCTGCG TTCCAGCGCC CGCGCCGCGC AGCGAACCCT TTTTGTCCTC
AGGCAGACCA GACCCCTGGC CGCCCGAAAG GGATTGACCA AGCAATGGCG CAAAATGGAG
GACTATTTTG CTGGCCGCAG TGAACCCGGC CTCGGAGCGG AGGAACCCGC AGGAGGCGAA
GTGGCCTTAG GCACCGGCCA GGGTGTCTTG GAGCGGGCTC GCAAGCGGTG TGCCAATGGT
GATTATGAAG GCGGCCTGAC GGTATTGTTG CACGAGTACC GCAACGGCAA ACGGGATCCG
GTGTTGTTGC GCTTTGCCGC TGAATGTTTC GAGCACAGCA ATCGCTACCA GGAGGGGGCG
GATTTTATCG CCAGCGCCCT TTTGGAGCAG GAATTCCCTC CCGCCGACCA AGCCCATCTC
CAATTCGTCT TGAGCGGGTG GTATCTCCAA CTCGGCAAGA CGAATGAGGC CCGGCAGGCG
TTGTGGAAGG TCCAACGCCT CGATCCCCAT TACCCAGGAC TGCAAGGCCG CCTCCAGCAT
CTTTTGCCCG CCCAGACCGA GAAGCCGAAA AGCCGCTACG GGCTGCTGCT CGCCTCGGGC
CAGCTCAGTG AAGAACAACT GCAGCAGGCC ACCGAGGAAG CCAATAAGCG TGACGGTGAT
GTGGACCAGG TCCTGCTTCG CGATTACGGC ATCGACCGTG ACGTCCTGGG GGCCTCCCTG
AGCGCTTTCT ACGATGTCGA TTTTGTCGCT TTTGACCGCG AGATTGACCC GCCGTTCGGG
CTCTTTGAAA AACGGAGTCT GGACCCGGAC TTCCTCAAAC GCTACGGCTG GGTCCCCTTT
GCAGAAGAAG GGCAGGACAT CGTGGTCCTG ATGAGCAATC CTTTTGACCT TGGCCGCATG
GATGAGATCC GGTTTATTCT GGGCACCAGC CGGATCGTCC CCAAGGTCGC CCTTCAGGCT
GATATCCAGG CCTATATCGA CCATTTCTTC CAAAGCTTCG GGCCTTCTGA AGAAATTTTT
TCCTTTGATG AGGAGGTTGT TGCCCTGGAC GAGGACGACA GCCGTTTGGA AGGGGTTGAA
GAGGTCTCGG AGGAAGACAG CGAAGTCGTT CGGCTGGTCA ATTCCCTGCT TATCGAAGCC
TGGAAGCGGC AGGCCTCGGA CATCCATATT GAGCCCGATT CCCGCAACCG GTCCTGCACG
ATCCGGCTGC GCGTTGACGG GACCTGCCAT GAATTTCGCA AATTCCGCAT CGGGCTGGCC
CGCCCCCTGG TCTCGCGGGT CAAGATCATG GCCCATCTGG ACATTGCCGA ACGCCGCCTG
CCCCAGGACG GCAAGATCAA GCTCAAATTG CCGGGGATGA ACACGGTGGT CGAATACCGG
GTGGCGACCC TGCCCACGGT CGATGGGCAG GAGGATGTGG TCATGCGGGT TTTGTCCTCG
GGCAAACCGC TGCCTCTGGA GCAATTGGGG CTGCATTCCG GCGTCCTCGA GGCCTTTAAA
CGCATGGTCT ATCGCCCCTA CGGGCTGTTG CTGGTCGTCG GGCCCACGGG GTCGGGAAAG
ACGACGACCC TGCACTCTGC CGTCAGCTAT ATCAACAGCT CGGATCGGAA GATCTGGACC
GCCGAGGACC CGGTGGAGAT CACCCAGACC GGCCTGCGCC AACTCCAGGT CCAGCCGAAG
ATCGGTCTGA CCTTTGCCGC GGCGCTGCGA TCCTTTTTGC GCGCCGACCC GGACGTGATC
ATGATCGGGG AGATGCGCGA CGAGGAGACC GCCCATATCG GGGTGGAAGC CTCGCTGACC
GGGCACCTCG TCTTTTCAAC CCTGCACACC AATTCCGCTC CGGAAACCAT CACCCGTTTG
CTGGACATGG AACTGGATCC CTTCAATTTT GCCGATTCCC TGCTCTGTGT GCTGGCCCAG
CGGTTGGTCA AGACCTTGTG CCCGCGGTGC AAGAAACCAT ATACGCCGTC TGCCGAGGAG
ATAGAGGAGT TGCGTCTGGA ATTCGGCACG AATTGGGAGA CCGCGGTCCC CGAACAATGG
CGGGCTGAAC CGGTTCTTTA CCAGCCCCAC GGCTGTTCCT CGTGCCTGGG CGGGTATCGC
GGCCGGACCG GCATTCATGA ATTGATGCTC AATACCGGGG GGTTGAAGAC CTGCATCAAA
CACCGCAAGC CCACGGAGGA ATTGCGGCTC CAAGCGGTGG AGGACGGCAT GCTCTCGCTC
AAGGAGGATG GCCTGTTGAA GGTCATTGAG GGGTTGACGG ATGTGAGTCA GGTCCGCAAA
GCCGTCGGAG GAAGCTGA
 
Protein sequence
MSWEGFLSFL RQRRRLVQET AEQGDLRSSA RAAQRTLFVL RQTRPLAARK GLTKQWRKME 
DYFAGRSEPG LGAEEPAGGE VALGTGQGVL ERARKRCANG DYEGGLTVLL HEYRNGKRDP
VLLRFAAECF EHSNRYQEGA DFIASALLEQ EFPPADQAHL QFVLSGWYLQ LGKTNEARQA
LWKVQRLDPH YPGLQGRLQH LLPAQTEKPK SRYGLLLASG QLSEEQLQQA TEEANKRDGD
VDQVLLRDYG IDRDVLGASL SAFYDVDFVA FDREIDPPFG LFEKRSLDPD FLKRYGWVPF
AEEGQDIVVL MSNPFDLGRM DEIRFILGTS RIVPKVALQA DIQAYIDHFF QSFGPSEEIF
SFDEEVVALD EDDSRLEGVE EVSEEDSEVV RLVNSLLIEA WKRQASDIHI EPDSRNRSCT
IRLRVDGTCH EFRKFRIGLA RPLVSRVKIM AHLDIAERRL PQDGKIKLKL PGMNTVVEYR
VATLPTVDGQ EDVVMRVLSS GKPLPLEQLG LHSGVLEAFK RMVYRPYGLL LVVGPTGSGK
TTTLHSAVSY INSSDRKIWT AEDPVEITQT GLRQLQVQPK IGLTFAAALR SFLRADPDVI
MIGEMRDEET AHIGVEASLT GHLVFSTLHT NSAPETITRL LDMELDPFNF ADSLLCVLAQ
RLVKTLCPRC KKPYTPSAEE IEELRLEFGT NWETAVPEQW RAEPVLYQPH GCSSCLGGYR
GRTGIHELML NTGGLKTCIK HRKPTEELRL QAVEDGMLSL KEDGLLKVIE GLTDVSQVRK
AVGGS