Gene Dret_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1104 
Symbol 
ID8418929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1295864 
End bp1297069 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content56% 
IMG OID645037676 
ProductType II secretion system F domain protein 
Protein accessionYP_003197970 
Protein GI258405228 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.250932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00579509 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCGATTT TCATCTACAA GGCGATCAAA GGGAAACGAC GGGTCAAAGG CGATCTGGAC 
GCGGCCAATC TGGAAATGGC GGAAGCCGCA TTGCGGCGTC GTGGCATGAC CAACATCCGG
GTCAAGCCCA AGCCGAAAGA TCTGCTTGAA GGGACCTTTC TGGAAGGCCG GGTCAAAGAC
CGGGATATGG TCATCTTCTC CCGGCAATTC GCGACCATGA TCGATTCCGG AGTGCCCATC
CTGCAGGCGT TGCAGGTCAT GTGCGAGCAG ACGGAAAATG ACAAGCTCCG GCGCAAGCTC
TACGAAGTGC GCAACGACAT CGAGGGCGGG AGTTCCTTGT ACGAGGCCTT GCGCAAGCAC
CCGGATGTTT TTGACGATCT GTACACAAAC ATGGTCAATG CCGGGGAAAC CGGCGGCGTT
TTGGACGTGA TCCTGGATCG CTTGGCGACA TATATCGAAA AGGCGGCCAA TCTGAAATCG
AAGGTCAGGG CCGCGCTCAT TTATCCAGGA GTTGTCTCCT TTGTGGCCGT CGCAGTCATC
GCCATCATCC TGATCTTTGT CATCCCGACC TTCGAGCAAC TTTTCAATGA TTTCGGGAGC
GGCCTGCCCA CGCCGACCAA ACTGGTGATC GGATTGAGCC GTTGGGTCAA GGGCAATCTC
CTCTGGTTGA TCCTGGGGTT GGTGGCGGCG CTAATCGCCT TCCGGTTTTT TTACCGCTGG
GAACGAGGCC GGACCATGGT GGACCGCTTC TTCCTCACGG TGCCGGTCTT CGGTCCCTTG
ATGCGAAAAT TTGCCGTGGC CCGTTTCAGC CGTACCTTCA GCACTATGGT TTCCAGCGGG
GTTCCGATCC TGCAAGCCTT GGACATTGTG GCCAGGACCT CTGGCAACAA GATTGTCGAG
TCCGGGGTGA ATGAAGCGCG TACGTCCATT GCTGAAGGGC AAACCCTGGC CGATCCCCTT
GATGCCACCG GGGTTTTCCC GCCCATGGTC ATCCACATGA TCTCCATTGG CGAAACCACA
GGGTCACTGG ACACCATGCT CGGCAAAATC GCCGATTTCT ATGATGACGA GGTCGATGTG
GCGGTGACGA CCCTGACCTC GTTAATCGAA CCCATCCTGA TTGTCTTTTT AGGGGTCATT
GTCGGTGGCC TCGTGGTCAG CATGTACCTG CCCATTTTCA AGATCGCCGG GACTGTGGCC
GGATAG
 
Protein sequence
MPIFIYKAIK GKRRVKGDLD AANLEMAEAA LRRRGMTNIR VKPKPKDLLE GTFLEGRVKD 
RDMVIFSRQF ATMIDSGVPI LQALQVMCEQ TENDKLRRKL YEVRNDIEGG SSLYEALRKH
PDVFDDLYTN MVNAGETGGV LDVILDRLAT YIEKAANLKS KVRAALIYPG VVSFVAVAVI
AIILIFVIPT FEQLFNDFGS GLPTPTKLVI GLSRWVKGNL LWLILGLVAA LIAFRFFYRW
ERGRTMVDRF FLTVPVFGPL MRKFAVARFS RTFSTMVSSG VPILQALDIV ARTSGNKIVE
SGVNEARTSI AEGQTLADPL DATGVFPPMV IHMISIGETT GSLDTMLGKI ADFYDDEVDV
AVTTLTSLIE PILIVFLGVI VGGLVVSMYL PIFKIAGTVA G