Gene GM21_3815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3815 
Symbol 
ID8139189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4386976 
End bp4388175 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content64% 
IMG OID644871434 
ProductTPR repeat-containing protein 
Protein accessionYP_003023592 
Protein GI253702403 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGC AAAGAACAGC ATGGGACTAC ATGGACGACA TGTTCAGCAC CCTCGCCTCG 
CAGGAATCGC AGCGGGCGCA GCTGGCCAAC GGGGCCTTGA GTTCGGGCCT CGCCTTCTAC
CAGAAGAAGG ATTACGCCCG CGCCACGAGT GAGTTGAAAC GGGCCATCTC CATGGACCCG
ACCAACACCC AGGCCTACAA GTATCTGGCC GGCGCCTACC AGGCCCAGGG AAAGACCGAC
GAGGCGATCA AGACCTACAA GTACTCGTTG GCGCTCGACC CGACGCAAGC CTCCGTCCAC
ACCAGCCTGG GCAACGTCTA CCTGCAGCAG AAGAAGTACA ACCTGGCCGA AAGGGAGTTC
AAGGACGCCG GCAAGCTCGA CCCCACCGAC ACCCTCGCCC CCTACACGCT GGGGCAGCTC
TACGTGCAGA CCGAGCGCTA CGGCGAGGCG GAGGCGCAGT TCAAGAAGGT CTCCAGGATG
GCCCCCACCG ACCCCAACCC CTACTATTCC CTGGGCGCCG TCTACAACAA GGAAGGGAAG
TACGCCGACG CGGTGAAGCA GCTGACGCAG GCCGTGAAAC TGCGCCCCAA GATGGAGGCT
GCTCATTTCG AGCTAGGCGT CGCCTATGCC GCCTTGGGCG ACACCACCAA CGCGCAGAAA
GAGGTGGATA CCCTCACCAG GCTGAACGCG GCACAAGGGG CCCTGCTCCA GCTGACCATC
GCCCAGCCGA AGTTCGTCGC CGCCGGAGGG GGTGAGACCG ACACCTTCAC CGCCGCCCTG
GGCGCCGGCA CCGATCTCGG TTTGACGATG CTCGGCGCCG ACCCGGTGAC CACCCAGCCG
ATACAGTCCA AGCAATTCAG CCTCACCTTC TACTTCGACT CCGCCATGGA CGCCGCTTCG
GTGCAGGACA CGAGCAACTG GACCATCAGC AAGGCGACGG GGGGTGCGGC AGGCTACTAC
AACAACCTGC AGCCGGTGGT CCCGACCGAG GCCTACATCC CGCAAAACCC TACCAGCGTC
ACCTACGACC CCGAAAAAAG GAGTGCGACA GTCACCTTCC TTTTGAGCCA GAACGACACC
GGCGACGCCA CCATCGACCC ATCCCACATG GTCTTCAAGT TCTCCGGCAC CGATGCCAGG
GGGAAGGTGA TGGACCCCGC AGCCGACGAG TTCGACGGCG CGGCGGAAGC GCCTTTCTGA
 
Protein sequence
MATQRTAWDY MDDMFSTLAS QESQRAQLAN GALSSGLAFY QKKDYARATS ELKRAISMDP 
TNTQAYKYLA GAYQAQGKTD EAIKTYKYSL ALDPTQASVH TSLGNVYLQQ KKYNLAEREF
KDAGKLDPTD TLAPYTLGQL YVQTERYGEA EAQFKKVSRM APTDPNPYYS LGAVYNKEGK
YADAVKQLTQ AVKLRPKMEA AHFELGVAYA ALGDTTNAQK EVDTLTRLNA AQGALLQLTI
AQPKFVAAGG GETDTFTAAL GAGTDLGLTM LGADPVTTQP IQSKQFSLTF YFDSAMDAAS
VQDTSNWTIS KATGGAAGYY NNLQPVVPTE AYIPQNPTSV TYDPEKRSAT VTFLLSQNDT
GDATIDPSHM VFKFSGTDAR GKVMDPAADE FDGAAEAPF