Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2665 |
Symbol | |
ID | 8138007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3101662 |
End bp | 3104442 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644870269 |
Product | type IV pilus secretin PilQ |
Protein accession | YP_003022459 |
Protein GI | 253701270 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4796] Type II secretory pathway, component HofQ |
TIGRFAM ID | [TIGR02515] type IV pilus secretin (or competence protein) PilQ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.000377004 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAAGGT ACCAGTGCAG AATTCTATGT CATGCCGTGG CACTCATCGT TTTGTTGACC GTTTCAGCAG GATGCGTGAA ACGGATGACC GCTGCGAACG AGTCTGCCCA GGCAGAGGCC TCCGCCTTTG CGACCCTCAG GTCGGTCACG GTGTCGGCCG ACGCCACCAG CGTGGAGCTG GTCAGCGACA AGCCGATCAC CTATACCTCC TACAAGGGGG GGGATCCGAC CCAGATCATC GTCGACATCT CCCAAACCGA GCCGGGGGCT GTGACTTCTC CCATCGAGGT GAACCGCGGC AACATCAAGC GGATCGAGGT CGAGCGCCAA CCTTTGGGGG GGAGCGTGCT GACGCATATG ACGCTCGTTC TCACTAAGGA CGTCGACTTC GCGGTCGCCA CCGACCCCTC GGACAAGAGC AGGCTCAAGA TCTACCTTCC GGTAGTGGAG CCCGAGGCGA AGGCTGAGCC GACCGAAGTC AAGGAAGCCG CGCTTGCCGA GAGCAGGATC GACGAGAAGA CCCTGACGCC GGCTGCTGCT ACGCCCGCCG TTGAGGCAGG CTCCGCTCCC ATCAAGGCAG AAGCCGTGCC GGCACCGCCC GCCGCACCTG CCACGGAGTC TAAGGCAGCA ACCGCAGCTG TCGCAGAACC GAAAGCGGAT GCCGCACCTT GGAAGGCTGG CGGCGCAGAA GGGGGGAACG GTCTCAATGC GGTCATCCCA GGGGCGGACG GTGTGGATCT CTCCATCCAA GGGGGGGTAC AGACCTTCAA CTCCTTCAAG TTGACCAAAC CTGACCGCAT CGTCCTCGAC CTCTTCAAGG TCAAGAACTC GCTGCCCCGG AACGTGGTGC CTGTCAACGC CTTTGGCATC GCCAACGCCC GTGTGGGGTC GACTCCCGAC AAGGTGCGAG TGGTGTTGGA CGCAGCGGGG GAGAGCCTTC CCCCGTACGA AGTGGTCAAG AGCGACCTCG GGGTGAAGAT CAGGCTCAAG GGAAAGGCCA CGGCCATCGC CAAGGCTCCG GCTGTTGAGG CCCCGGCAGC GGCGCCCGCC CCGGTTCCTG CGCCTCCCGC AGCGAAACTG AAGACCGAGG TTCCCCACTC CCGCCAGACC AAGGGGGCGC TGGAAGGAAT CGAATTCAAG GTGGTCGGCG GCGTCTCCCG CGTTTCCATG AAACTCTTCG GGACCTGCGA ACCCGGGCAG CCGTTGCGCG GCCCCCAGGG GGTAACGCTC GGCATCGCCA ACTGCCAGGT ACCCAAGCAA CTGCAGCGCG CGCTCGACAC CACCAAATTC GGCACGCCGG TGCTCTCGGT AACGCCTTAC CAGGTGAAGG TCAAAGGGAG CACGGTTACC AAAGTGTTGG TGAAGTTACG CGGCAACCCG GAATTCAGCA CCAGCCGCAA GGGAGACCTG CTCCTTTGGG ATTTCGTAAA CCCGGGTCCT GTAGCTGCTC CCAAACTCCC TGCCGCACCG CCTGCTCCCA AGGCCAAGGC TCCTGTCGAG CCCAGGGTCG CCGAGGAACT AAGCCCGCCG CCTGCGCGCA GTAGCGACGA CATGGCGGTT CAGCTTCCCA GCGACCGCCC GGCCAAGAAG GTCTACACGG GGAGGAGGGT GACCCTCGAG TTCTCCGACG CCGATGTCAG GAAGATCTTC CAACTGATCG CCGAGGTGAG CAACCTAAAC TTCCTCATCG CCGATGACGT CACCGGGACC ATCAGCATAA AGCTGGTGAA CGTCCCCTGG GACCAGGCGC TCGACGTGAT TCTGGATGCC AAGGGGCTCG CCATGATACG CGAGGGGAAC ATCGTTCAGA TCAAGCCCAG GTCCAAGATG CAGAACCAGG CCGACGAGGA ACTCGCGGCT AAGAAAGCGG CTGAGCGCCT GATGGAATTG AGGACCACGG TGTTCGAAGT GAACTACGCC TCGGTGTCCG ACGTGGCGAC CCAGTTCGCC ATGCTAAAGA GCGATCGCGG CGTAATCACC AAGGACGAGC GCACCAGCCG CGTGATCGTG AAGGATATCC AGACCGCGCT GGACGACATG AAGGCGCTCC TGAAAACTCT GGACGCCCCC GAGAAGCAGG TGATGATCGA GGCCCGCATC GTCGAGGCGA CCTCGACCTT CACCCGCGAC CTCGGCGTGC AGTGGGGGCT CAGCTACCGC GACGGCTCCG CTTCCATGGC TGGCATCAAC TCAGTCGACA CTGGTTTCGG CGGCGTCGTT TCCGCGGCAG GGCCGGGTGC AACCGGCGCT GGTGGGCTCG GACTCGGCAT GTCCTTCGGC AAGCTGACCA GCAACATCAA GCTGGACATG AGGCTCGCGG CGGCCGCGAC CATCGGGCAG GTGAAGATCA TCTCCACCCC GAAAGTGGTC ACCCTGAACA ACAAGGCGGC CAAGATCTCA CAAGGGCAGT CCATACCGTA CCAGACCACT TCCGCCGAAG GGACCAAGAC CGAGTTCGTG GAAGCGGCGC TGACCCTTGA GGTGACCCCG CACATAACCG CAGACGGCTC GGTCAGCATG AAGATCAAGG CCAGCAACAA CTCGCCTGGC ACCGGGTCGC CCCCTCCTAT CAACAAGAAG GAGGCGACCA CCGAGCTCGT GGTCTCCAAC GGCGACACCA CCGTCATCGG CGGCATCTAC GTCGACAGCG AGACCGAATC CGATACCGGC GTCCCGTTCC TCTCGGACAT TCCCCTTTTG GGTTGGCTCT TCAAATCCAA CGCGAAGCAG AAGACGAAGA CGGAACTGCT CATTTTTATT ACGCCAAAAA TAATTTTATA G
|
Protein sequence | MTRYQCRILC HAVALIVLLT VSAGCVKRMT AANESAQAEA SAFATLRSVT VSADATSVEL VSDKPITYTS YKGGDPTQII VDISQTEPGA VTSPIEVNRG NIKRIEVERQ PLGGSVLTHM TLVLTKDVDF AVATDPSDKS RLKIYLPVVE PEAKAEPTEV KEAALAESRI DEKTLTPAAA TPAVEAGSAP IKAEAVPAPP AAPATESKAA TAAVAEPKAD AAPWKAGGAE GGNGLNAVIP GADGVDLSIQ GGVQTFNSFK LTKPDRIVLD LFKVKNSLPR NVVPVNAFGI ANARVGSTPD KVRVVLDAAG ESLPPYEVVK SDLGVKIRLK GKATAIAKAP AVEAPAAAPA PVPAPPAAKL KTEVPHSRQT KGALEGIEFK VVGGVSRVSM KLFGTCEPGQ PLRGPQGVTL GIANCQVPKQ LQRALDTTKF GTPVLSVTPY QVKVKGSTVT KVLVKLRGNP EFSTSRKGDL LLWDFVNPGP VAAPKLPAAP PAPKAKAPVE PRVAEELSPP PARSSDDMAV QLPSDRPAKK VYTGRRVTLE FSDADVRKIF QLIAEVSNLN FLIADDVTGT ISIKLVNVPW DQALDVILDA KGLAMIREGN IVQIKPRSKM QNQADEELAA KKAAERLMEL RTTVFEVNYA SVSDVATQFA MLKSDRGVIT KDERTSRVIV KDIQTALDDM KALLKTLDAP EKQVMIEARI VEATSTFTRD LGVQWGLSYR DGSASMAGIN SVDTGFGGVV SAAGPGATGA GGLGLGMSFG KLTSNIKLDM RLAAAATIGQ VKIISTPKVV TLNNKAAKIS QGQSIPYQTT SAEGTKTEFV EAALTLEVTP HITADGSVSM KIKASNNSPG TGSPPPINKK EATTELVVSN GDTTVIGGIY VDSETESDTG VPFLSDIPLL GWLFKSNAKQ KTKTELLIFI TPKIIL
|
| |