Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2609 |
Symbol | |
ID | 2687704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2877043 |
End bp | 2878980 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637127299 |
Product | type IV pilus assembly protein, putative |
Protein accession | NP_953654 |
Protein GI | 39997703 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCAGA AAAAGGTTGG CGAGATACTT ATCGAGCACC GGCTGATCTC CGAGGATCAG CTCCGCGAGG CCCTGGAACT GCAGAAGGTC TTCCCCGACC AACCCGTGGG GCAACTGCTC TGCAAGCTCG GGTTCCTGAG CGAGAGTGAG CTGTCCTATA TCCTGGAGCA GACCGGCAAG CGCCAGAAGC TGGGCGATAT CCTCATCAGG GAGCGCCTGG TCGACGAGGA GCGCCTGAAC CAGGCCCGGG TCGCCGCCAA GCGGGACGGC TCCACGCTGG AGCGTGCCCT CCGCAAGCTG CGTCTCGTCG AAGAGGAGCC TCTGGCCAAG ACCATTGCCA CCCAGTACGA CCTCTCCTTT GTTCACATCA ACACCCTGGA GATCGAGCCG GACCTCGCCC GCTGCATCAA CCCCAACTAT GCCCAGCGCC AGCGGATCGT CCCCATCTCC CGCATCGGCA ACACCATCAC TCTGGCCATG GCCTATCCCA TCAAGCTTCA TGAGCTGAAG GAGCTTGAGC AGAGCATCAA GTCGCGGATC ATCCCGGTCA TTGCCATGGA GAGCGAGATC ATCCAGGCCC AGCAGCGCCT CTACAAGACC GCCGCCAGCG CCGCCCACGC CCTTACCCTC GATGAGGCCG ACCTGGAGAT CGCGCCGGGA AGCATCGTCG ACATCCTGAG CTCCGGCGCC GGGGAGGATG AGCCGGACAT CGATGACGAG GTGCGCACGA TCACCGAGCG CGACAGTGTC ATCGTCAAGC TCGTCAACAA GATCATCTTC GACGCCCACC AGAACCGCGC CTCGGATATC CATATCGAGC CCTATCCCGG CAAGAACGAC GTGATCGTCA GGATGCGGGT CGACGGCAGC TGCAAGGTGT ACCAGCGCAT CCCGTTCAAG TACAAGTACG CCATCCCCTC GCGCCTCAAG ATCATGGCCG AGCTGGACAT CGCCGAGAAG CGCAAGCCCC AGGACGGCAA GATCAATTTC AAGAAATTCG GCCCCCTGGA CCTGGAACTG CGGATCGCCA CCATGCCAAC GGCCGGTGGA CTCGAGGACG TGGTCATCCG GCTCCTGAAC ACGGGCCAGG CCTATTCATT CGACAGTCTC AGCCTCACCG ACCGCAATAT GCGCATCTTC GGGGAATCCA TCACGAAGCC CTACGGCCTT GTCCTGGTGG TGGGCCCCAC CGGAAGCGGC AAGACCACCA CCCTCCACGC GGCCATCGCC CGCATCAACC GACCCGAGGT GAAGATCTGG ACCGCCGAGG ACCCGGTGGA GATCACCCAG AAGGGGCTCC GGCAGGTTCA GGTCAACCAG CGCATCGGCC TCACCTTTGC CGCGGCACTG CGCTCGTTCC TGCGGCTCGA CCCGGACGTG ATCATGGTGG GCGAGATGCG GGACGAAGAG ACCGCCTCCA TCGCGGTGGA GGCGTCCCTC ACCGGGCACC TGGTCCTGTC GACCCTGCAC ACCAACTCGG CCCCGGAGAC GGTCACCCGC CTCCTGGAAA TGGGACTCGA CCCCTTCAGC TTCTCCGATT CGCTTCTCTG CGTCGTGGCC CAGCGCCTGG CCCGCCGCCT CTGCGAGGAT TGCCGCGAGC TTTACCGCCC TGACCGCAAG GAGCTCTCCG AGATCATCGA GGAGTACGGC GAGGAGCAGT TCGCCGCCAC GGGACTGCTG GGCAACGAGG TGGTCCTGGC CCGGCCGGTG GGGTGCACCA CCTGCAACCA GAGCGGCTAC CGGGGGCGCC TCGGCATTCA CGAGGTGCTC GAAGGCACCG ACACCATGAA GAGCCTCGTC AAGAAGAAGT CCGACACCGA GATCATCCGG CGGCAGGCCA TGGCCGACGG CATGACCACC CTGCGGCAGG ACGGCATCCT CAAGGTCTTC CAGGGGCTCA CCGACATCCA CGAAGTACGC AAGGTCTGCC TCAAGTAA
|
Protein sequence | MSQKKVGEIL IEHRLISEDQ LREALELQKV FPDQPVGQLL CKLGFLSESE LSYILEQTGK RQKLGDILIR ERLVDEERLN QARVAAKRDG STLERALRKL RLVEEEPLAK TIATQYDLSF VHINTLEIEP DLARCINPNY AQRQRIVPIS RIGNTITLAM AYPIKLHELK ELEQSIKSRI IPVIAMESEI IQAQQRLYKT AASAAHALTL DEADLEIAPG SIVDILSSGA GEDEPDIDDE VRTITERDSV IVKLVNKIIF DAHQNRASDI HIEPYPGKND VIVRMRVDGS CKVYQRIPFK YKYAIPSRLK IMAELDIAEK RKPQDGKINF KKFGPLDLEL RIATMPTAGG LEDVVIRLLN TGQAYSFDSL SLTDRNMRIF GESITKPYGL VLVVGPTGSG KTTTLHAAIA RINRPEVKIW TAEDPVEITQ KGLRQVQVNQ RIGLTFAAAL RSFLRLDPDV IMVGEMRDEE TASIAVEASL TGHLVLSTLH TNSAPETVTR LLEMGLDPFS FSDSLLCVVA QRLARRLCED CRELYRPDRK ELSEIIEEYG EEQFAATGLL GNEVVLARPV GCTTCNQSGY RGRLGIHEVL EGTDTMKSLV KKKSDTEIIR RQAMADGMTT LRQDGILKVF QGLTDIHEVR KVCLK
|
| |