Gene Dgeo_1725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1725 
Symbol 
ID4058345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1833429 
End bp1834607 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content65% 
IMG OID641230748 
Producttype IV pilus assembly protein PilM 
Protein accessionYP_605189 
Protein GI94985825 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4972] Tfp pilus assembly protein, ATPase PilM 
TIGRFAM ID[TIGR01174] cell division protein FtsA
[TIGR01175] type IV pilus assembly protein PilM 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0148781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGCT TGTTCCAACG CCTACTGAAT CCGCGCCCGA GCGCCATCGG TGTGGAAATC 
GGCACCAGCA CCATCAAGGT GGTGGCCCTG CGCCCCGGCA CGCCGCCGGT CCTCCAGCAT
GCGGTGATGG TCCCCACGCC CATCGGCAGC ATGCGCGACG GCCTGGTGAT CGAGCCGCAG
GCGGTCGCCA ACGAGCTGAA GAACCTGCTG GCCGAACACC GCATCACGGC CCGCCACGCC
GTCACCGCCG TGCCCAACCA GTCGGCGGTG ACCCGCAACA TCATGGTTCC GCGCATGGAG
CGCAAGGATC TCCAGGAAGC CATCAAGTGG GAGGCAGAGC GCTATCTCCC CTACCCCATC
GATGAGGTCA ATTTGGATTT TGACCTGCTC GACGATCCCG CCACCATCCC CGAAGACGGG
CAGATGGAGG CGGTGATTGC TGCCGCTCCC AGCGAAGCGG TGGCGCGCCA GGTGGAGGTG
CTGCGGCTGG CAGGCCTGGA ACCCATCATC GTCGACCTCA AGAGCTTCGC GGCACTGCGT
GCCCTGCGCG GCAACCTGCT GGGCGAACAC CTCAACAAGA CCACGCTGGC GGGCCTGAAC
TACACCGAGG CAGGCGAGGT GGCGCTGGTT CTGGAGATCG GCGCAAGCAG CAGCGTGATC
AGTCTGGTAC GTGGCGACCG CATCCTCATG GCACGCAACA TCGCCATTGC CGCCGACGAC
TTCACCACCG CACTGCAAAA AGCCTTTGAC CTGGACTTCA GCGCTGCGGA GGAGGTCAAG
CTGGGCTACG CGACCGCTAT CACGCCCACC GAGGACGAGG AGGCCTTGCT GGATTTCGAC
CGCGCCCGCG AGCAGTACAG CCCGGCGCGC GTGTTTGAGG TGATCCGTCC GGTGCTGGGC
GACCTGATCA CCGAGATTCG CCGCTCGCTG GAGTTCTACC GGGTGCAGTC GGGCGACGTG
GTGATTGATC GGACCTTTAT CGCGGGTGGC GGTGCCAAAC TGCGTGGCCT CGCCAATGCG
ATTAGCGATG CGCTGGGGTT CCGGGTGGAG GTCGGCAGTC CCTGGCTGAC GGTGCAGACC
GAACAGGCCA ATGCCGATAC CGGCTATCTC CAGGCCAACG CACCCGAGTT CACCGTGCCG
CTGGGGCTGG CGCTGCGGGG AGTCCAGGGT CATGGTTGA
 
Protein sequence
MSSLFQRLLN PRPSAIGVEI GTSTIKVVAL RPGTPPVLQH AVMVPTPIGS MRDGLVIEPQ 
AVANELKNLL AEHRITARHA VTAVPNQSAV TRNIMVPRME RKDLQEAIKW EAERYLPYPI
DEVNLDFDLL DDPATIPEDG QMEAVIAAAP SEAVARQVEV LRLAGLEPII VDLKSFAALR
ALRGNLLGEH LNKTTLAGLN YTEAGEVALV LEIGASSSVI SLVRGDRILM ARNIAIAADD
FTTALQKAFD LDFSAAEEVK LGYATAITPT EDEEALLDFD RAREQYSPAR VFEVIRPVLG
DLITEIRRSL EFYRVQSGDV VIDRTFIAGG GAKLRGLANA ISDALGFRVE VGSPWLTVQT
EQANADTGYL QANAPEFTVP LGLALRGVQG HG