Gene Dret_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1994 
Symbol 
ID8419839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2290231 
End bp2291730 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content45% 
IMG OID645038582 
Producttype II and III secretion system protein 
Protein accessionYP_003198856 
Protein GI258406114 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4796] Type II secretory pathway, component HofQ 
TIGRFAM ID[TIGR02515] type IV pilus secretin (or competence protein) PilQ 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAGGT CAACCACAAT ATGCATTGCA ATATTCACAA TTCTGCTTTT GTCCGGATGC 
GCTACCACCG ATCAGGAAAA GGCACCTGAA TTTATAGATA AGTGGCAGAA ACTGGCGGAA
AAATCCAAAG GCCATTCGCC TGTACCCCAA GAAGCGAATA CTCATGAGCC TGATACCTTT
ACCAGTTTAG AGCGTCATCA ACAAGAAAAA CACAAATTGC CTCGGGAGAA AGTCTCTTTA
AAATTTCGTG ATAACAAAAT ACAAGTTATT TTACGTACTC TAGCATCGGC AGCTAGCCAA
AATATTGTCA TGAGCAATAA TATTAGTGGC ACCATGAGCT TAGATGTCAA AGATATTCCT
TGGAGCCAAG CGTTCTTGAG TGTTATTACC ACCAACGGCC TGACTTATTC CTGGCAGGGA
GATATAATTC AAGTTCAAAG CCCAAAAGAT ATGCAAATGG AGAAAGAACT CCAGCAAATC
CAAAAGGAAA CCCAGATCCT GCAAACGACT GTTGTCGATA TTGACTATGC CCACATTGTA
GACAAAGGCG TAAAGAGCGG AAATACCAAT GACAACGGCA ATCTGGATCA ACTGGAAAAG
ACCTTGCGGG AAGTTCTAAA AAATGCAAGT GGAGGAAGCA AGGAAGGGAC GTTGTTTGTG
GACAGGGAAA ACAACGCCCT AATTATCCAA GCCACGAAAG AGGACACTCA GCGCATTCTT
CACGTTCTCA ATCATTTGGA ACGCCCCAGA AAACAAATCC ATATTGAAGC CAGTATCGTA
GAGGCCACCC AAAATACCGC TCGAGAATTA GGCATGCGCT GGAGAGGAAG GTATGTGACG
TCAGGAAGGG GAATTGAAGA TGTGGGCATC ATAGGCGATG CACAAGAACC CGAGGACTGG
GGATCTGCTA TCACCACCCT TCCAGGTAGC GGAACGGATA CACTCGGTGG TTTAAAATTA
GGGACAGTCG TCGGAGAAAT TGCCGGAAAC GTATTATTTT CTCAGCTTCA AGCTTTGGAA
AAAGAGGGAC AAGTCAATAT CTTGGCTAGT CCATCCCTGA CCACTATGGA TAATCAAAGC
GCCTCAACCC AGCACGGAGA GAGAGTGCCT TACGAAACCA CTGATGAAGA TGGTGATCGT
GTCGTTAAAT TTGAGGATGT GGCAATGGGC CTAAAAGTTC ACCCCAGAAT AATTGAAGGG
GATTTGATGG CTATGGACAT TGTTGTCACC AAAGACGAAG TGGATTTTTC CCAGAATGTC
CAAGGCAATC CCTTGATCCG AACCAAAGAG ACGGAAACCA ACCTCTTGGT CCGCAACGGC
GAAACCATCG TCATATCAGG CTTATCAAAG CAAACCGTCA GTGGCACTGA ACATGGAGTC
CCTGGGCTCA GAAAAGTGCC TGGTCTGAGT TGGCTATTCA AGGGTATAGA TAAAAGTGAA
GATATGGAGG AGTTCATGGT TTTCATCACT CCCACCATTT TGGATCAACC AGGATCATGA
 
Protein sequence
MLRSTTICIA IFTILLLSGC ATTDQEKAPE FIDKWQKLAE KSKGHSPVPQ EANTHEPDTF 
TSLERHQQEK HKLPREKVSL KFRDNKIQVI LRTLASAASQ NIVMSNNISG TMSLDVKDIP
WSQAFLSVIT TNGLTYSWQG DIIQVQSPKD MQMEKELQQI QKETQILQTT VVDIDYAHIV
DKGVKSGNTN DNGNLDQLEK TLREVLKNAS GGSKEGTLFV DRENNALIIQ ATKEDTQRIL
HVLNHLERPR KQIHIEASIV EATQNTAREL GMRWRGRYVT SGRGIEDVGI IGDAQEPEDW
GSAITTLPGS GTDTLGGLKL GTVVGEIAGN VLFSQLQALE KEGQVNILAS PSLTTMDNQS
ASTQHGERVP YETTDEDGDR VVKFEDVAMG LKVHPRIIEG DLMAMDIVVT KDEVDFSQNV
QGNPLIRTKE TETNLLVRNG ETIVISGLSK QTVSGTEHGV PGLRKVPGLS WLFKGIDKSE
DMEEFMVFIT PTILDQPGS