Gene GM21_3204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3204 
Symbol 
ID8138556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3715997 
End bp3717352 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content62% 
IMG OID644870809 
Producttype II and III secretion system protein 
Protein accessionYP_003022989 
Protein GI253701800 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4964] Flp pilus assembly protein, secretin CpaC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.154583 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATGCA TCCCCTCAAA ACCGTGCCTA GCCTTCCTGC TTTTCTGCCT CGGCCTTCTT 
ATCGAGGCGA CGAACGCGAC AGCCGGCGTC CCGACCACCG TGATCGTCAA CCGGGGAGTG
GTTCTCAACC TGAAGAACCC CGCCCGCTAC GTCAACATCA CCGACAAGGA CGTGATCGAC
GTCCCCGATC CCCTGCGCCG CAACCAGCTC CTCATCAACG GCAAGAAGAT CGGCTCCACC
AACCTGATTG TGTGGGAGGA GAACAATGAG AATCCGACCT TCTTCGACGT CCGGGTGGTG
GGGGACCGGG AGGCCATCGA GTCCCAGATC AGGGATTACG CCCCCAACGA CGACATCAGC
GTCCAGTATG CGCAGGACAC GGTGGTCCTC TCCGGCAAGG TCGCCAACGA GATGACCGGC
AAGAAGGCGG AGGAGATCGC CAAGGCCTAC TCAGCGAAGG TGCTGAACCA CATCACCGTC
GATGAGCCGC AGCAGGTGCT TTTGCAGGTC AAGGTGGCGC AGGTGGACCG GACCTCGCTG
AAGCGGCTCG GCATCAGCGC CATGGTGAAG GGAAGGACCG CCGAAGGGTT CATGAACCTG
GTAGGCGCCC CCAGCGGCAC CAGCAGCGTC ACCAACTCGA GCGCCACAAG GTTCACCTCG
ACCGAGGCTT CCGGCATCGC GGGTTCCATC CCGGGGCTTG GGAGCTTCGA CCCGCTGGAC
GCCTTCAACG TCGGGGTCTC CTACTTCCCC GCCGGCATCG GTGCCGTGCT CCAGGCCTTG
AGCAGCAAGG GGCTCGCCAA GATCCTCGCC GAGCCCAACC TGCTGGTGAA AAGCGGCGAA
GAGGGGAATT TCCTCGCCGG GAGCAGGATC CCCTACAGCG TGCTGATCTC GACCGGCGGG
GCGTCCACCT CGTCCATCAT CTTCGAGACC GTGGGGGTGA AGCTCAAGTT CAAGCCGCAG
GTGCTGCAAA ACGGCCTGAT CAACCTGAAG ATCGATCCCG CTGAGGTAAG CAGCATCGCC
GGGACCCTCG CGGTCAACGG CTACCCCATC ATCGACACCA GGGACGTCCG GACCGACGTG
GAACTGCGGG ACGGCGAGAG CCTGATTCTG GCCGGCCTGC TCCAGGAAGA GCAGATCAAG
ACCATGTCCA AGATCCCACT TTTGGGGGAC ATACCGATCC TCGGTGCGCT GTTTCGCTCC
TCGCAGAAGG ACATCCGGGA GAAGGATCTG GTCTTTTTCA TCACGCCGAA AATAGTTAAG
CCCACTCCCG CAGGGGTCGC GACCAAGCTC CCCACCGACG CCGTTACTCC CGCGGAGGAG
AAGGGATACG ACTGGATCCC GCTGGGACGA AAGTAG
 
Protein sequence
MRCIPSKPCL AFLLFCLGLL IEATNATAGV PTTVIVNRGV VLNLKNPARY VNITDKDVID 
VPDPLRRNQL LINGKKIGST NLIVWEENNE NPTFFDVRVV GDREAIESQI RDYAPNDDIS
VQYAQDTVVL SGKVANEMTG KKAEEIAKAY SAKVLNHITV DEPQQVLLQV KVAQVDRTSL
KRLGISAMVK GRTAEGFMNL VGAPSGTSSV TNSSATRFTS TEASGIAGSI PGLGSFDPLD
AFNVGVSYFP AGIGAVLQAL SSKGLAKILA EPNLLVKSGE EGNFLAGSRI PYSVLISTGG
ASTSSIIFET VGVKLKFKPQ VLQNGLINLK IDPAEVSSIA GTLAVNGYPI IDTRDVRTDV
ELRDGESLIL AGLLQEEQIK TMSKIPLLGD IPILGALFRS SQKDIREKDL VFFITPKIVK
PTPAGVATKL PTDAVTPAEE KGYDWIPLGR K