Gene Dret_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1991 
Symbol 
ID8419836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2287245 
End bp2289110 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content46% 
IMG OID645038579 
ProductTfp pilus assembly protein ATPase PilM-like protein 
Protein accessionYP_003198853 
Protein GI258406111 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4972] Tfp pilus assembly protein, ATPase PilM 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAATG AAAAAGATGT ATCTTCAACG GAAAAACTTC TACACGTTAT CCGCAACAAG 
CCTGAAGACG AAACTGTTGC TGTGTTCGAA TCACAAGCTG TCCACAATAA GCAAAAAAGA
ACATCGAAGA AGTCAAAAAG AAATCACTTC GTGCCTTTCA AGCCTAAAGA AACAATCGGT
GTTGAGATTC AAGACACGCA CTTAAATGTA GTTAGAATGG CGCAATCAGG CCATGGCTGG
GAGGCGATCC AGGCCCTCAC TATCCCTATG CGGGAAGAAA TGGCTTTAGA TAGCCCCGAA
TTCAGGTCCT TTCTCAAAAG CCAACTACAA AGCATAGAAG GAATAAAAAA AGCTCATGTT
TGGGCCATGC TTTCGGCTTC CCAAGGAGAA ATCTGGCATA CAAAAATTCC CCTTATGAAG
AAAGGTCTGG CTAACGCTGT CTTTTGGGCT GCGAAGAGAG ACAATGATTT CGAAGAAGAC
AAGGTGGTCT TTGACTATCG AATTCAAAAC CAGATTGCCG ACAAAGACGT CAAAAAATAT
TGGGCGATGG TTTCTACCTT CCCCAAGAAT ACAGTCAAAA ACCTGAAGGA CGTATTCTCT
AAGTCCGGAG TAGACTTAGA GGGCATCACC CTTCCGGCTT TCAGCCTGCA AAACTTGTTC
ACTCACAATT GGGTTGACTC CAAAGAAAAA CCCTTCGCTG TATTACATAT CTACCAGCAT
AACTCCTCGA TCAACATTTA CGACAGCGAC AACAAACTGC TTCTGAGTCG TACGACCAAA
ACCGGCCTGG AAAGTATGCT GGAATCTATG ATTCAGGAAC AGGAGGTATC AGCTCCTGAA
GTCCACGTCC TGGGAGATTC TGATAATTTC CCTCCTCAAT ACAACAGTCC AACGGACAAG
AAGCAAGCCT TAAAACTTAT TAACTGGCTT GAAACCGGCA GCAAATCACA CCCTGACGAT
CACCAAGATA GACAATACCA CCCTCAAAAG ATTCTGGACA TGATCGCTCC CGCCATGGAG
CGGCTAGCTC GACAAGTAGA GAGGACCATT GCTCATTCAA TAAACGTTTT AGGCAATCCA
GCCCCTGCTC GAATTTATCT TGGTGGACGC TTGATCCCGG CTGAGTCGGT CACTGCTTTC
TTGCAAGACC AATTAGGGCT AGAAGTCCAG GTCCTGGAAC CTCTCACCCC CACCAGAAAC
AACATCTCCT CCCCCATCAG CTCCCTGAAC AAGGAAGAGC GCATTTTTTT AACCAGCACT
ACTGGCTTGG CTTTGTCCAG CACAGAGCAG ACCCCAAATT TTTTAAACAC AGCCAAGAAT
CAGGAAAAGC AAAGGGTCGC AAAACGCAAT GCCACATTGG TCGCTTCAGC AGTGATCGCG
ATCTATTTGC TTATAGGGGG GTATTGGGTG CAACTTAATA ATGAGCTGGC CCAGGTCAGA
CAGAAAGTAT TCAGCTTGAA CCACAGGCTC GAAGAATTTT CACCCCTTAT TTCTCAAAAA
ATGATCAAAG ACATGATTGC TCAGGTTCAT AAGGATAGTT CGACACTTCA GGACCACAGC
CACGATCTCC TGCCAGTAGC GGCTATGAGA GACGTCATCT CTGCTACTCC AGAGCGTGTC
AGGCTGTTCA AAATTCGCAT GGAAACTGGC AAACCGAAAT CTGGCCAGGA CGTCCAAGTG
CTGCTTAAGG GCTATATAAT TGGCGAAGAA AAACAGCTCC AAACATATTT AGCAAGTTAC
CTCTATCGCC TGCGTCAATC TCCAGTATTC AACAAAACAA CACTCCAAGA GAGCTCAATC
CAAGAAGTGA GGACGCTAGG AAAAGTACTT GACTTCGTGA TCAAAGTGAA TCTGGAGCAA
ATATAA
 
Protein sequence
MANEKDVSST EKLLHVIRNK PEDETVAVFE SQAVHNKQKR TSKKSKRNHF VPFKPKETIG 
VEIQDTHLNV VRMAQSGHGW EAIQALTIPM REEMALDSPE FRSFLKSQLQ SIEGIKKAHV
WAMLSASQGE IWHTKIPLMK KGLANAVFWA AKRDNDFEED KVVFDYRIQN QIADKDVKKY
WAMVSTFPKN TVKNLKDVFS KSGVDLEGIT LPAFSLQNLF THNWVDSKEK PFAVLHIYQH
NSSINIYDSD NKLLLSRTTK TGLESMLESM IQEQEVSAPE VHVLGDSDNF PPQYNSPTDK
KQALKLINWL ETGSKSHPDD HQDRQYHPQK ILDMIAPAME RLARQVERTI AHSINVLGNP
APARIYLGGR LIPAESVTAF LQDQLGLEVQ VLEPLTPTRN NISSPISSLN KEERIFLTST
TGLALSSTEQ TPNFLNTAKN QEKQRVAKRN ATLVASAVIA IYLLIGGYWV QLNNELAQVR
QKVFSLNHRL EEFSPLISQK MIKDMIAQVH KDSSTLQDHS HDLLPVAAMR DVISATPERV
RLFKIRMETG KPKSGQDVQV LLKGYIIGEE KQLQTYLASY LYRLRQSPVF NKTTLQESSI
QEVRTLGKVL DFVIKVNLEQ I