Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2028 |
Symbol | |
ID | 2688023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2219965 |
End bp | 2222649 |
Gene Length | 2685 bp |
Protein Length | 894 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637126719 |
Product | type IV pilus biogenesis protein PilQ |
Protein accession | NP_953077 |
Protein GI | 39997126 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4796] Type II secretory pathway, component HofQ |
TIGRFAM ID | [TIGR02515] type IV pilus secretin (or competence protein) PilQ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATCC AGACGACATT CTCCCGGAAA CTCGTCCTGA TGTCACTGCT GGCTGTGTTC TGCGGATGCG CCGACGTTCA CTCCAGCGTG AAGGGCGACG CCATGGTCGA GCAGGCCAAG GGGGGCGTCC TGCGTGAGAT CAAGGTGGCG GAAACAGGTG AGGGTGCCCG TGTCGTACTG TCGGCCGACC GTCCCCTGGC CTACACGTTC TACAAAACGG CAGATCCGCC CAAGGCGGTC ATCGATCTGG CCCGCACCGA ACCGGGCGCC ATTGCGTCGC CGATGGACGT GAACTCCGAT ACCATCCGGC GTATCGAGAC CGTGCGGTAC GGTGAGGGTG CCACGGCCAT GACCCGTGTG GAGGTATATT TCACACGCGA CACCGAGGCG AATGCGACCA TCGACGCCAG TGACAAAGGA ATGCTTGTCC TTGCCGTGGC TCGTCCTGTC GCGCCCGCCG CCTCCGTTGC CGTTGAAGCC GCACCGGCTG CCGGCCAGGC CGGCGAGAGC TCTACGGGTG CCCCGGTAGC AGTCGCTGCA GCCCCGGCAG TAGCTCCGGC TCCCGTTGCC GTGGAGCCTC AGGCTCCGAT CGCTCCGGTT GTTGCTCCGC AGACCGGCCC CCCGGTCATC AAGGCCATCG AAGCGCAGAA TGGTTACCTG GTGATCGAAA CGAGCGGGGA GATCAAGGAC TTCAAGTTCT TCCGCCTTGC CAAGCCCGAC CGGCTGGTGG TGGATATCTC GGACGCCAGG CTCGGCATCA ATTCCAAGGC TATTCCCCTG AATGCCCTCG GTGTAGGGAT GGCCCGCATT GGTGCTTATC CCGATAAGGT CAGGATTGTT CTCGATGCTG CGGGGGGGAC CCTCTCTCCT CTCAATGTAA CTAAAGGTGC AGCTGGCCTG ATTGTTGCCC CGGCTGACAA GGTTGTTGAC GTACCTCGTG CTGCCGCCCC TCCGCAGCCG TCCGCCTCGG CATCCCAGGC ACGGCAGCCG GAAGCGGTGA AGTCCGCCGA GCCGGCAAAG GGCTTAACAC CCGCGGTTGA TGCCATTGAG TTCAAAGTAC TTGACGGGAC CTCACGTATC ACTGTTGCCG TCACCGGCGC CTGTGCCAAC GACAAACCGG TCAAATCCGC CGACGGCTTC AGCGTAACGC TTAGAAACTG CATCCTTCCC AAGAAGCTTC AGCGTCATCT CGATACGGGC GCTTTTGCCA GTGTCGTTCA GAAGGTTACC CCCTACCAGG TGAAGACGAA GGGGCGCAGT GACGTAAAGA TCCAAGTGCA ACTCCGCCAG CCCGCATCCT ACGATGTCAG GCGGGATGGG GACCTGCTGC AGGTGAGCGT CCGGAACCCA GAAGGTTTTG AGCCCCCGGT CGCGGATGTG CCTGCCTCCC CCACGCTTCA GGATGGCATG GATCAGGCCG CTGTCCGGCA GAAGGAACCC TCCAGGGAGA CCGACCCGCT TGCCGGTGTC GCCCAGTCGG GTGGAACGAA GAAGGCGTAT ACCGGGCGCC GCGTGACGCT TGAATTTTCC GATGCCGACA TCCGCAAGAT ATTCCAGCTG ATTGCGGAGG TCAGCAATCT CAACTTCCTC GTGGGCGACG ACGTTACCGG CACCATCAGC CTTAAACTGG TCAACGTTCC CTGGGACCAG GCCCTTGATG TCATTCTGGA GAACAAGGGG CTGGGGATGC AGCGGGACGG CAATATCGTA CAGATTCGTC CCAAATCCAA GATCCAGACC CTGGCTGACG AGGAGCAGGC GCTCAAGCGG GCCAAGGAAC GGGGCATGGA GCTGAAGACA GAGGTCTTTG ACATCAACTT TGCGGCGGTG GGGGATATCG TGTCCCAGTT CAACGCCGTC AAGAGCGAGC GGGGGACCAT CAGCCAGGAT GCGCGGACCA ACCGGGTAAT CGTGAAGGAT ATCGAGCCGG CCCTGGCTGA AATGCGCATC CTGCTCAAGA ACCTTGATCT GCCTGAAAAG CAGGTGCTCA TCGAAGCGCG CATTGTGGAA GCAACCTCCA CCTTCACTCG TGATCTAGGG GTCCAGTGGG GCATACACAG CAACGATTCC GGCGCTGATA TTATCAGAAG CGTTGATGCC GGATTCGGCG GGATCGTAAC GCCGCCGCCG GCCAGCGGCT TCCCTGCGGC AACATCATCC GGCGGTGCTG TTGGGATCAG CTTTGCCAAA ATGGGGAGCT TGCAGGTTGA TCTGCGGTTG TCGGCAGCAG CCGTTGCAGG CCTTGTTAAG ATCGTCTCGA CGCCCAAGGT TGTTACGCTT AACAACAAAG CGGCGAAAAT TTCGCAAGGT CAGTCGATAC CGTATCAAAC CACATCAGCC GAAGGAACCA AAACCGAGTT CGTGGAGGCG GCGCTGACAC TCGAAGTAAC GCCTCACATA ACAGCGGACG GCAGCGTTTC CATGAAGATT AAGGCCTCAA ACAACTCAGC TGGCACCGGG TCTCCGCCTC CGATTAACAA GAAGGAGGCA ACCACCGAAC TGTTGGTCAA GAACGGTGAA ACCACCGTAA TTGGCGGTAT CTATGTAGAC AGCGATACTG ATGAAGACAG AGGGGTACCA TTCCTGATGG ATATTCCGGT CCTTGGCTGG CTATTCAAGT CGAACACGAA GAACAAGACA AAAACGGAAT TACTCATTTT CATAACGCCA AAAATTGTGA GCTGA
|
Protein sequence | MRIQTTFSRK LVLMSLLAVF CGCADVHSSV KGDAMVEQAK GGVLREIKVA ETGEGARVVL SADRPLAYTF YKTADPPKAV IDLARTEPGA IASPMDVNSD TIRRIETVRY GEGATAMTRV EVYFTRDTEA NATIDASDKG MLVLAVARPV APAASVAVEA APAAGQAGES STGAPVAVAA APAVAPAPVA VEPQAPIAPV VAPQTGPPVI KAIEAQNGYL VIETSGEIKD FKFFRLAKPD RLVVDISDAR LGINSKAIPL NALGVGMARI GAYPDKVRIV LDAAGGTLSP LNVTKGAAGL IVAPADKVVD VPRAAAPPQP SASASQARQP EAVKSAEPAK GLTPAVDAIE FKVLDGTSRI TVAVTGACAN DKPVKSADGF SVTLRNCILP KKLQRHLDTG AFASVVQKVT PYQVKTKGRS DVKIQVQLRQ PASYDVRRDG DLLQVSVRNP EGFEPPVADV PASPTLQDGM DQAAVRQKEP SRETDPLAGV AQSGGTKKAY TGRRVTLEFS DADIRKIFQL IAEVSNLNFL VGDDVTGTIS LKLVNVPWDQ ALDVILENKG LGMQRDGNIV QIRPKSKIQT LADEEQALKR AKERGMELKT EVFDINFAAV GDIVSQFNAV KSERGTISQD ARTNRVIVKD IEPALAEMRI LLKNLDLPEK QVLIEARIVE ATSTFTRDLG VQWGIHSNDS GADIIRSVDA GFGGIVTPPP ASGFPAATSS GGAVGISFAK MGSLQVDLRL SAAAVAGLVK IVSTPKVVTL NNKAAKISQG QSIPYQTTSA EGTKTEFVEA ALTLEVTPHI TADGSVSMKI KASNNSAGTG SPPPINKKEA TTELLVKNGE TTVIGGIYVD SDTDEDRGVP FLMDIPVLGW LFKSNTKNKT KTELLIFITP KIVS
|
| |