Gene GM21_0067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0067 
Symbol 
ID8135366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp84881 
End bp85951 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content63% 
IMG OID644867684 
Producttwitching motility protein 
Protein accessionYP_003019912 
Protein GI253698723 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2805] Tfp pilus assembly protein, pilus retraction ATPase PilT 
TIGRFAM ID[TIGR01420] pilus retraction protein PilT 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.78052e-25 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAGGA TAGACGCACT GTTCAAGCTG TTGCACGAAG CCGGGGCCTC CGACCTGCAC 
CTTTCCGCCG GGTCCCAGCC GATCTTCCGG CTGCGCGGCG AGATGGAGCG GCAGAACTTC
AAGTCGCTTG GGCACGAGGA ACTGAAGGCG CTCCTTTACG AGATCCTGAC CCCGAAGCAG
CGCGAGACCT TCGAGGAGAA GCACGACCTC GACTTCGCCT ACTCGGTCCC GGGCCTGGCG
CGCTTCCGCG GCAACTACAT GATGCAGCAC CGGGGGATCG CGGCGGTGTT CCGCATCATC
CCGAGCAAGA TACTTTCCGC CGACGAGCTG GGGCTTCCGG AAGGGATCCG CAACCTGACC
AAGCTGAGGA AGGGGCTGGT GCTGGTCACA GGTCCCACGG GGAGCGGCAA GTCGACGACG
CTCGCCGCGA TGATCGACCT GATCAACTCT ACCCGTAGGG AGCACATCCT GACGCTCGAA
GACCCGCTGG AGTTCATCCA CGAAAACAAG ATGTCCCTCT TCAACCAGCG CCAGATCGGC
GAGCATTCCG ACAGCTTCGC CAGCGCGTTG AGGGCGGCCC TCAGGGAGGA CCCGGACGTG
ATCCTGGTGG GCGAGATGCG CGACCTTGAG ACCATCGCCC TCGCCATGAG CGCCGCGGAG
ACCGGGCACT TGGTGTTCGG CACCCTGCAC ACCAGTTCCG CCGCGAAGAC GGTGGACAGG
ATCATCGACG TCTTCCCCAA GGACGGCCAG GAGCAGGTGC GCGCCATCCT TTCGGAATCG
CTCCGGGGGG TGGTCTGCCA GCAGCTCCTG AAGACGGCCG ACGGCAAGGG GAGGGCGGCC
GCGCAGGAGA TCATGGTCTG GAACAACGCC ATCGGGAACC TGATCCGCGA AGGGAAGACC
TTCCAGATCC CCTCCATCAT GCAGACCGGC AAAAAGGACG GGATGCAGCT CATGGACCAG
CACATCCTCG ACCTCTTGAA GACCAGGAAA ATCACACCGG AGGAGGCGTA CCGCTGCTGT
CAGGACAAGA GGCAGTTCGA GCAGTACCTC CCGGCGCAGG CGGAGCATTA G
 
Protein sequence
MARIDALFKL LHEAGASDLH LSAGSQPIFR LRGEMERQNF KSLGHEELKA LLYEILTPKQ 
RETFEEKHDL DFAYSVPGLA RFRGNYMMQH RGIAAVFRII PSKILSADEL GLPEGIRNLT
KLRKGLVLVT GPTGSGKSTT LAAMIDLINS TRREHILTLE DPLEFIHENK MSLFNQRQIG
EHSDSFASAL RAALREDPDV ILVGEMRDLE TIALAMSAAE TGHLVFGTLH TSSAAKTVDR
IIDVFPKDGQ EQVRAILSES LRGVVCQQLL KTADGKGRAA AQEIMVWNNA IGNLIREGKT
FQIPSIMQTG KKDGMQLMDQ HILDLLKTRK ITPEEAYRCC QDKRQFEQYL PAQAEH