Gene GM21_2669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2669 
Symbol 
ID8138011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3106162 
End bp3107214 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content59% 
IMG OID644870273 
Producttype IV pilus assembly protein PilM 
Protein accessionYP_003022463 
Protein GI253701274 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4972] Tfp pilus assembly protein, ATPase PilM 
TIGRFAM ID[TIGR01175] type IV pilus assembly protein PilM 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.0000583624 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTTTCT CAAAGAAGAA GGACATAGTA GGGGTCGACA TCGGATCCAG CGCAGTGAAG 
CTCGTGCAGC TGCGCCCGGG TAAGGGTGGG TACCAGCTGG TGAAGATCGG CATCTCGCCG
CTTCCCGCCG AGGCCATCGT CGACAACACC CTCATGGACA GCTCATCCAT CGTGGAGACG
GTGAAACAGC TTGTTTCCGG CCTGGGAGTG AAGGCCAAGG AAGTGGCGTG TTCCATCTCC
GGCAACTCGG TGATCATCAG GAAGATCTCG CTCCCGGTGA TGCCGGTGGA AGAGCTGGAG
GACCAGATTC ACTGGGAGGC TGAGCAGTAC ATTCCATTCG ACATCAACGA TGTCAACGTG
GATTTCCAGA TACTGTCTCC CGACGAGCAG GACCCCTCGA AGATGAATGT TCTTCTGGTG
GCCAGCAAGA AGGACATCAT CAACGACTAC CTCTCGGTGT TCGCTGAGGC CGGGCTCAAG
CTGGTGGTCG TGGACGTCGA CTCCTTCGCC GTCCAAAACG CTTACGAGGC GAACTACCCG
GCGGACCCTG ACGAGGTTGT TGCCCTTGTC AACATCGGCG CCAGCATCTT CAACCTCAAC
ATCATCCGGG ACGGGGTTTC GCTGTTCACC CGCGACGTGC AGATGGGGGG GAACCTCTAT
ACCGAGGAGA TCCAGAAGCA GTTCGGGATC AACAGCGAGC AGGCGGAGCA AATGAAGCTT
TCCATCGCCG GTAACGAAGA CCAGAGGCTG GCCGAGACGC TGCAACGGGT GAACGAGACC
ATCGCCCTGG AGATGCGCCG TTCCCTGGAC TTCTACAATT CCACGGCGGG CGAGGGGAGG
ATCACCAAGG TGTATCTCTC CGGCGGCGCC GCGAAGACGG CATTCCTCAT GGAGGCCGTC
CAGCAACGGC TGGCGTTGCC GGTGGAGCTC CTCAACCCGC TCTTGAAGGT CGCTGTGAAC
GAGAAGGAAT TCGACCGCAA GCACCTTGAA GAGATCGCGC CGCTGATGAC GGTAGCGGTG
GGGCTTGCGA CGAGGAGGGT CGGGGACAAA TGA
 
Protein sequence
MLFSKKKDIV GVDIGSSAVK LVQLRPGKGG YQLVKIGISP LPAEAIVDNT LMDSSSIVET 
VKQLVSGLGV KAKEVACSIS GNSVIIRKIS LPVMPVEELE DQIHWEAEQY IPFDINDVNV
DFQILSPDEQ DPSKMNVLLV ASKKDIINDY LSVFAEAGLK LVVVDVDSFA VQNAYEANYP
ADPDEVVALV NIGASIFNLN IIRDGVSLFT RDVQMGGNLY TEEIQKQFGI NSEQAEQMKL
SIAGNEDQRL AETLQRVNET IALEMRRSLD FYNSTAGEGR ITKVYLSGGA AKTAFLMEAV
QQRLALPVEL LNPLLKVAVN EKEFDRKHLE EIAPLMTVAV GLATRRVGDK