Gene GM21_3005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3005 
Symbol 
ID8138348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3491964 
End bp3493169 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content62% 
IMG OID644870603 
Producttype II secretion system protein 
Protein accessionYP_003022792 
Protein GI253701603 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.00232151 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCCTCT TTACCTGTAA AATAGGCGCC TCCGACGGCA AGGTCCTGGT CAAGGACCTC 
GACGCGGTCA ACGCCGGCTT GCTTCGGCAG AGCCTGGAAG AGCAGGGTTA CGTCGTCTTC
GGGGTGCGCA AGAAGCCGTT TCAGTTCCTG CTGGAATCGG GTATCGGGCG CAAGAAGATC
GGCAACAAGG AGCTCTTGTT ATTCAACCAG GAACTCCTGG TGCTTCTCAA GGCCGGTCTC
CCCATCCTGC AGGCGCTCGA CACCATCCTG GAGTCGGGAG GGGGCAAGCT CAACGAGATA
CTTTCGGCGA TCCGCGAGGA CGTGAAAGGG GGACTGGCGC TCTCCGCCGC CTTCGAGAAG
TTTCCGAGGG TGTTTCCCCA TCTCTACATC GCGTCGGTCC GGGCCGGCGA GAGGACCGGG
GACCTGCCCC AGACCATCCG CCGTTACATC GCCTTCCTCA AGAGAACCGA GGGTTTCCGC
GGCAAGATCA TCGGCGCGCT CATCTATCCC GTCATCCTGA TCGCGGTCGC GGCGGTGGCG
ATCTCTCTTT TGCTCATCTA CGTGGTGCCG ACCTTCAGCA CCATCTACGC GGATTCCGGC
GCCGCTTTGC CGATTCCGAC CCAGATACTG ATCAACTTCA CCGGGCTCTT GCGGCGCTAT
CTGCCGCTGC TTCTGCTGCT GGCGGCCGTG GCGACGACGC TCTTCAAGCG CTGGAGCCAG
ACCGAGTCCG GGCGCTATGC CGTTGACGGC TTCAAGATCA AGACCCCGCT TCTGGGCGCC
ATCACCAGCC GATACGCCCT GGCCGGCTTT ACCCGCACCC TGGCCACGGT GCTCGGCTCC
GGCATCCCGA TCGTCGAGGC GCTGCGGATG TCGGTGGGGA CGCTCAACAA CAAGGTGCTG
GAGCGCGGTC TGCTCCTGGC GGTACACCGC GTCGAGGAGG GGAGCAAGCT TTCCACCGCG
CTGGAAGGGA TGAAGCTGAT GCCCCCCCTG GCGCTGCGCA TGCTCACCGT AGGCGAGACC
ACCGGCTCCC TGGAGGAGAT GCTTTCCGAC ATCTCCGATT ACTTCGAAGA GGAGATCGAA
AGGGATCTCC ATGTACTGAC CACCTCCATC GAGCCGGCGA TCATGGTGGT CATGGGTGTG
GTCATCGGGG TCATCATCGT CACCATGTAC CTGCCGATCT TCAAGATCGC CAGCACCGTC
AGCTAG
 
Protein sequence
MPLFTCKIGA SDGKVLVKDL DAVNAGLLRQ SLEEQGYVVF GVRKKPFQFL LESGIGRKKI 
GNKELLLFNQ ELLVLLKAGL PILQALDTIL ESGGGKLNEI LSAIREDVKG GLALSAAFEK
FPRVFPHLYI ASVRAGERTG DLPQTIRRYI AFLKRTEGFR GKIIGALIYP VILIAVAAVA
ISLLLIYVVP TFSTIYADSG AALPIPTQIL INFTGLLRRY LPLLLLLAAV ATTLFKRWSQ
TESGRYAVDG FKIKTPLLGA ITSRYALAGF TRTLATVLGS GIPIVEALRM SVGTLNNKVL
ERGLLLAVHR VEEGSKLSTA LEGMKLMPPL ALRMLTVGET TGSLEEMLSD ISDYFEEEIE
RDLHVLTTSI EPAIMVVMGV VIGVIIVTMY LPIFKIASTV S