Gene Dret_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2203 
Symbol 
ID8420059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2505665 
End bp2506642 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content57% 
IMG OID645038802 
Producttype IV pilus assembly protein PilM 
Protein accessionYP_003199065 
Protein GI258406323 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4972] Tfp pilus assembly protein, ATPase PilM 
TIGRFAM ID[TIGR01175] type IV pilus assembly protein PilM 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCCCG GTAAGGCCGG CCCGCAGCTG GAACGACTCG GCCGGGTACC CTGGCCGCGA 
CAGGACAACG AGGCCACGGA CGCCAAGGCC GAACGCCTCC GCCAATTGTG GCAGGCGCTG
GAACTCAAAG ACAAAGTCGT CACTTCCTCA ATGGCCGGAC ACGCCGTGAT TGTGAAGCGC
GTGCGCTTCG CCAAGGACAG GATCCGACAC CTCGCCGCCG AAGTCCAGAA AGAGGCGAAG
CAATACATCC CCTTCGATAT CAACGACGTC TACCTCGACT TCCAGGATCT GGGACCTGAA
TCGGAACAGG CGGGGTTCCA TCAGGTTTTG CTGGTGGCCA GCAAGAAAAA GATGGTCCAC
GAGGTCCAAA ATGTGCTCTC GGCAGCCGGG CTGGGATTGT CGGTTCTGGA TGTCGATGCC
TTTGCGCTGA CCAATTGTTT TACCTTCAAT TATCCTGAGT GGAGCGACAA ACCGACCTAT
CTGCTCGATA TCGGCGCCCA GCAGTCCGTC TTTTGCGTTT GTGCTCAAGG GCGTCCTCTG
TTTTTACGCG AAATCGCATT TGGCGGACAT CAGATCACCG AACGGTTGGC GCGGACGTTG
GAGATTACCA AAACCGAGGC TGAAAAACTC AAAGTCAACG GTCCCAAGGA GGAGGACGCG
AGCAATATCG CCACCGTCCA GGATGTCTTG AATAAGGTGT TTGCCGATTG GGCCCAGGAA
ATCCAGCGCA TGCTCACTTT TTACCAATCC TCGGAAAGCG GCGGATTGAC GTCGACGCGG
ATGCTCCTAT CCGGCGGCGG AAGTCTTATT TCCGGTTTAC CTGAGCGGTT TGCCGAACGA
TTGGAGATGG AGGTCGGGCT TCTCGATCCT TTCCGGCGGA TCAATATCTC GCCGAATCTT
TTCGATCGAA ATTATCTGAC TCGCACCGGG CCGCAGTTTG CGGTGGGCAC GGGGCTTGCC
CTGCGACAAG CCGTATAG
 
Protein sequence
MVPGKAGPQL ERLGRVPWPR QDNEATDAKA ERLRQLWQAL ELKDKVVTSS MAGHAVIVKR 
VRFAKDRIRH LAAEVQKEAK QYIPFDINDV YLDFQDLGPE SEQAGFHQVL LVASKKKMVH
EVQNVLSAAG LGLSVLDVDA FALTNCFTFN YPEWSDKPTY LLDIGAQQSV FCVCAQGRPL
FLREIAFGGH QITERLARTL EITKTEAEKL KVNGPKEEDA SNIATVQDVL NKVFADWAQE
IQRMLTFYQS SESGGLTSTR MLLSGGGSLI SGLPERFAER LEMEVGLLDP FRRINISPNL
FDRNYLTRTG PQFAVGTGLA LRQAV