Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3204 |
Symbol | |
ID | 8138556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3715997 |
End bp | 3717352 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644870809 |
Product | type II and III secretion system protein |
Protein accession | YP_003022989 |
Protein GI | 253701800 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4964] Flp pilus assembly protein, secretin CpaC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 0.154583 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATGCA TCCCCTCAAA ACCGTGCCTA GCCTTCCTGC TTTTCTGCCT CGGCCTTCTT ATCGAGGCGA CGAACGCGAC AGCCGGCGTC CCGACCACCG TGATCGTCAA CCGGGGAGTG GTTCTCAACC TGAAGAACCC CGCCCGCTAC GTCAACATCA CCGACAAGGA CGTGATCGAC GTCCCCGATC CCCTGCGCCG CAACCAGCTC CTCATCAACG GCAAGAAGAT CGGCTCCACC AACCTGATTG TGTGGGAGGA GAACAATGAG AATCCGACCT TCTTCGACGT CCGGGTGGTG GGGGACCGGG AGGCCATCGA GTCCCAGATC AGGGATTACG CCCCCAACGA CGACATCAGC GTCCAGTATG CGCAGGACAC GGTGGTCCTC TCCGGCAAGG TCGCCAACGA GATGACCGGC AAGAAGGCGG AGGAGATCGC CAAGGCCTAC TCAGCGAAGG TGCTGAACCA CATCACCGTC GATGAGCCGC AGCAGGTGCT TTTGCAGGTC AAGGTGGCGC AGGTGGACCG GACCTCGCTG AAGCGGCTCG GCATCAGCGC CATGGTGAAG GGAAGGACCG CCGAAGGGTT CATGAACCTG GTAGGCGCCC CCAGCGGCAC CAGCAGCGTC ACCAACTCGA GCGCCACAAG GTTCACCTCG ACCGAGGCTT CCGGCATCGC GGGTTCCATC CCGGGGCTTG GGAGCTTCGA CCCGCTGGAC GCCTTCAACG TCGGGGTCTC CTACTTCCCC GCCGGCATCG GTGCCGTGCT CCAGGCCTTG AGCAGCAAGG GGCTCGCCAA GATCCTCGCC GAGCCCAACC TGCTGGTGAA AAGCGGCGAA GAGGGGAATT TCCTCGCCGG GAGCAGGATC CCCTACAGCG TGCTGATCTC GACCGGCGGG GCGTCCACCT CGTCCATCAT CTTCGAGACC GTGGGGGTGA AGCTCAAGTT CAAGCCGCAG GTGCTGCAAA ACGGCCTGAT CAACCTGAAG ATCGATCCCG CTGAGGTAAG CAGCATCGCC GGGACCCTCG CGGTCAACGG CTACCCCATC ATCGACACCA GGGACGTCCG GACCGACGTG GAACTGCGGG ACGGCGAGAG CCTGATTCTG GCCGGCCTGC TCCAGGAAGA GCAGATCAAG ACCATGTCCA AGATCCCACT TTTGGGGGAC ATACCGATCC TCGGTGCGCT GTTTCGCTCC TCGCAGAAGG ACATCCGGGA GAAGGATCTG GTCTTTTTCA TCACGCCGAA AATAGTTAAG CCCACTCCCG CAGGGGTCGC GACCAAGCTC CCCACCGACG CCGTTACTCC CGCGGAGGAG AAGGGATACG ACTGGATCCC GCTGGGACGA AAGTAG
|
Protein sequence | MRCIPSKPCL AFLLFCLGLL IEATNATAGV PTTVIVNRGV VLNLKNPARY VNITDKDVID VPDPLRRNQL LINGKKIGST NLIVWEENNE NPTFFDVRVV GDREAIESQI RDYAPNDDIS VQYAQDTVVL SGKVANEMTG KKAEEIAKAY SAKVLNHITV DEPQQVLLQV KVAQVDRTSL KRLGISAMVK GRTAEGFMNL VGAPSGTSSV TNSSATRFTS TEASGIAGSI PGLGSFDPLD AFNVGVSYFP AGIGAVLQAL SSKGLAKILA EPNLLVKSGE EGNFLAGSRI PYSVLISTGG ASTSSIIFET VGVKLKFKPQ VLQNGLINLK IDPAEVSSIA GTLAVNGYPI IDTRDVRTDV ELRDGESLIL AGLLQEEQIK TMSKIPLLGD IPILGALFRS SQKDIREKDL VFFITPKIVK PTPAGVATKL PTDAVTPAEE KGYDWIPLGR K
|
| |