Gene Dret_1161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1161 
Symbol 
ID8418989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1365926 
End bp1367335 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content53% 
IMG OID645037736 
Producttype II secretion system protein E 
Protein accessionYP_003198027 
Protein GI258405285 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTTCA ACAATACCAA CAAAAAGTCA GCTGCCAAAG TAATTGATGC TGAGCCGGAT 
CCTCAGCTTC ACCATTCTCA TCCTACCCCC GATTCACCCG AATCCCGGCA ACGTAAACGG
GAGCTCTCGG ATGAGTACTT CCTGCTCAAA AACAGGATCC ATACCCGTTT GTTGGACATG
GTAGACCTGT CCATGATCGA CTCTTTGGAA CCCGAGGTCC TGAAAACCCA AATCAGGAGC
CTGGTAACCA AAATCCTGGA TACTGAGGAG CGAAACGCTC CCCTGAATAT GTCTGAAAGA
GAGCGGCTGT TCAGTGACAT TGAGGACGAA GTCATGGGCT TGGGACCGCT GGAGCCCTTT
CTCAAAGACG ATACGGTGGC CGATATTCTG GTCAATACCC ACAATCAGAT ATATGTGGAA
CGTTTTGGCA AGCTTGAACT CTCCGAGTCC ACTTTCAAGG ACGACGCCCA TTTGATGCGC
ATTATCGACA AGATTGTCTC CTCTGTGGGC CGGCGCATCG ACGAATCTTC GCCTATGGTT
GACGCCCGCT TAGCCGATGG CTCCCGGGTG AACGTGATTA TCCCTCCCCT GGCTTTGGAC
GGCCCTGTGA TGTCCATCCG CAGATTTGGC AAAGACCCCT TGAAGATGGA CGATCTGATC
ATGCTCCGTG CCTTTACCCA GGGCATTGGG GAAATCATGA AGGGTATCGT CCGCTCCGAG
CTGAATGTTG TCATTTCTGG GGGAACGGGC AGCGGGAAAA CCACCCTTTT GAATTGCCTC
TCCCAATTCA TTCCGGCCAC TGACCGGATT ATCACTATCG AAGACGCAGC CGAGCTGCAG
CTCAAGCAGG AGCACGTGGT CCGGCTGGAG ACACGTCCGC CGAATATTGA AGGCAAAGGC
GAAGTTACGG CCAGGGAACT GGTCCGTAAC AGCTTACGCA TGCGCCCGGA CCGGATCATT
GTGGGTGAGG TCCGTGGCTC TGAGTCCTTT GACATGCTCC AGGCTATGAA CACCGGGCAC
GACGGTTCTC TGACCACTAT CCACGCCAAC ACCCCCAGGG ACGCTTTGAT GCGCATCGAG
AGCATGGTCT CCATGGCCAA CCTGGACATC CCTATCGAAT TCATGCGCAG ATTCATCGCT
TCAGCTATCC ACATTATCAT CCAGGTATCG AGGTACTCGG ATGGCACGCG GAAGGTGAAC
AGCATCCAGG AAATCACCGG AATGGAAGGC AACGTGATCA CAACCCAGGA AATCTTCTCC
TTCAATCCCA CGGGAGTGGA CGAAAACGGC AAGGTCAAGG GGTACTTCCG GTTCAACGGC
GTCAGACCCC AGTTTGTGGA CAAGTTCCAT CAGGTGGGTA TTGAAGTGGA TCGAGAGATA
TTCAATCCGG ATAAGATTGT TGAGGTCTGA
 
Protein sequence
MFFNNTNKKS AAKVIDAEPD PQLHHSHPTP DSPESRQRKR ELSDEYFLLK NRIHTRLLDM 
VDLSMIDSLE PEVLKTQIRS LVTKILDTEE RNAPLNMSER ERLFSDIEDE VMGLGPLEPF
LKDDTVADIL VNTHNQIYVE RFGKLELSES TFKDDAHLMR IIDKIVSSVG RRIDESSPMV
DARLADGSRV NVIIPPLALD GPVMSIRRFG KDPLKMDDLI MLRAFTQGIG EIMKGIVRSE
LNVVISGGTG SGKTTLLNCL SQFIPATDRI ITIEDAAELQ LKQEHVVRLE TRPPNIEGKG
EVTARELVRN SLRMRPDRII VGEVRGSESF DMLQAMNTGH DGSLTTIHAN TPRDALMRIE
SMVSMANLDI PIEFMRRFIA SAIHIIIQVS RYSDGTRKVN SIQEITGMEG NVITTQEIFS
FNPTGVDENG KVKGYFRFNG VRPQFVDKFH QVGIEVDREI FNPDKIVEV