Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2679 |
Symbol | |
ID | 8138021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3113445 |
End bp | 3118328 |
Gene Length | 4884 bp |
Protein Length | 1627 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644870283 |
Product | Tfp pilus assembly protein tip-associated adhesin PilY1-like protein |
Protein accession | YP_003022473 |
Protein GI | 253701284 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0000064285 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGACGCT TACTGGTCAT GCTTTTCACC GTTATCACCC TCGCCTGGTC GGCATCTGTC GCACAGGCAG ACCCCGCAGT AGTCACCACC AGCCCCGTCA ATGGTGCCGT CAACGTGGAT CCCACAAGCC TTGACACAAT ATGGGTCGGG TTCAACGACT ACGACATGGA CAAGAGCAGC TTCACGGGAA GCAACGCCGG GAGGGTTTCC GTCAACAACG GGGCCACCTA CACCGTGAGC AAGGTGGACG GCAGCTTGCC CTGGAACGGC GATTTCATCA AGATCTCGCT GAGCAAAGAC CTCAATTACA GCACCACCTA CACCGTCATC ATCGACTACC GAGTCAAAAA CAAGGAGGGT GAAAGGCTTG GTTCGAGCAG CGCCTCCAAC TACGTTTTCA GCTTCACAAC CAAAGCGAAA CCAAGCTCCG ACACGACTGC GCCGGTGGTA GATTCCACCT TTCCCGTCAG CAGTGCCGTC GACGTTCCCA TCACCGCGGC GATAACTGTG ACCTTCGATG AGGTGATGAA AGCCGATACC ATCAACAGCA CCAATTTCCT CATCAACAAC GGCGCGGTCG CCGGGGCAGT CACCCTCGAC GCCAGCGGTA AGGTCGCCAC CTTTACCCCG AGCGCCAACC TGAGCTCGTT CACCACCTAC ACCGCCACCG TCACCACAGG CGTACGCGAT GCGGCAGATA ACGCGCTTGC CAGCAACTAC TCCTGGAACT TCCGCACCAT GGCGCTCGAC AGTGTGCGCC CAACCGTCAC CGTCGTCAGC CCCATCGCCG GCGCCACCAG CGTCGATACC ACCACAGTGA TAACTGCGAC CTTCAGCGAG GCGATGAGAG ACACCACCAT CACCAGCGCC AACGTCTCGG TCAGCGGCGG GGTGACCGGC ACGGTGAGTT ACGATCCGGC GACCTGGACC CTCTCGTTCG CGCCGACCGC GGCTCTCGCC AATTCGACAA ACTACACGGT GACCATTTCC ACCGGCGTCA CCGACCTCGC CGGCAACTCC CTGTTAACGG CGAAGAGCTG GACCTTTACC ACCAGGGCCG TCACGACGCC GCCGCCTTTG AACGATTATT GCCAGGTCCC TCCCTACGTC ACAAGCACCA ACAACATGGT GAAGCCCAAC GTCCTGCTCG TGGTCGACAA CTCGGGGAGC ATGTACGAGT TCGCCTACAA GAACAGCGGC GCAGGCAACA ACTCGTACGA CACCAGCTAC ACCCCCGCGA AGAGTTACTA CGGGTATTTT GACGACAAAA AGATGTACCT CTACTCAGGG GGGGCATTCG TTCCCGACAC CGCCGCCTCG ACGGTCACAG ACACCACTAA ATTTTTATCC GGCAACTTCC TCAACTGGCT CACCATGAGG CGAGTCGACG TGGTCCGTAA GGTACTTGTG GGTGGCAAGA TGACCCCGCG CGTCGGCGCC GGCCGGTACC TCTATCCTGC CGGCTCCCCC GATCGCGACT TCTACAAGAG CTACAACAAC GTGAAGTACA CCGTCCAGGG GGGCGCCTCC ACCGAGGTGA TCAAGGACAC CACCAACAAC GTGACCTACA ACCTGAAAAT CGCGATTGGT GACGAGGGGA GCAATCAGGA CGAAGGGCTG GTGCCGAAGT ACGCCAACAT GATCAACTTC GGCATCATGT TCTACAACGA GGGCTACAAG TACGAAAACT CGGTCAACAA CGTGCGCGAC GGCGGGTACG TCGCCGCCGA CCTGGGCAGC ACCGGAAGCA ACCTGGTCAC GCAGATCGAG AGCACCGATC CCACCACCTG GACCCCCCTC GGCGAAACCC TTTTCGAGGC GACCAGGTAC TTCCAGGCAG GATCCAGTGC CTATAACGGC GGGACCTACT CCGGCAAAGA CCCCATCAAC TACGCCTGCC AGAAAAACTT CGTGCTGATC CTGACCGACG GGGAATCGAC CAAAGACGAG AACATCCCGG GCGGCTCGAC CAACTTTTCC GGAAAGGTAA CCGACAGCAG CTTCAACGTG AAGACCTGGA TGGACAGCAT CGCGACTCAG GAAGGGTATG CCAGCCAGTA CAGCTCCAGT GCCAATACCA GCGAAGGGAC CTACTACTTG GAGGGGGTAG CCTACTGGTC CCACGTGACC GACATGCGTT CGGCCTCCCT CGGCGACAGC GACATCCCCG GCAAGCAGAA CCTCACCATC TACACGGTCT TCGCCTTCGA CGACTCGCCC GTAGGCCGGG ATCTTCTGAA AAAAACCGCC AAGTACGGCG GATTCAACGA CTACGACTCG ACCGGAAAGC CGGACAAGGT CGCCAAGTGG GACCAGGACG GCAACGGGAT CCCTGACACC TATTACGAGG CCTCCGACGG CGCTGCACTC GCGGCCTCGC TGCAGAAGGC GTTCAACGAC ATCCTGGCGC GCGTCTCCTC GGGTACCGCC GCCTCCATTC TCAGTAACAG CGAGGGGAGC GGGGCGAACA TACTGCAGGC CGTGTTCCAC CCGAGGAAAT ACTTCGATGC CCAGACCTCC GCCGACTGGA TCGGGGAGAT GCACAACATG TGGTACTTCG TCGATCCCAA GATCAAAAAC AGCTCCATCC GCGAGGATAG CGACTACATC CCCGGGAGCC CTGCGCCGCC CCACTACCTG AACCTCAGTA AAGACAAGTT GATCAACTTC TACTTCAACA CCGACCAGGC CAAGACGATG GTCAAACGTT ACACCGACGT CAGGGGGGAC GGCCAACCTG ACCTCGACAC CAACGGCGAT CTCAAGGCGG ATTCCTACAC CCCGTACGAC GAGGTCGACT CCGACAGCGT AAAGAGCATC TGGAAGGCCG GAAAGCAGCT CTGGAGCCGC ACCGCCGCAA GGAATATCTA CACGAATCTC GCCGGGAGCC TCACCAGCTT CACCGGTCTG GACACCACCG ACGGCAATAT CCAGCAACTC CTGCAGGCGG CGAACAAGAC CGAAGCGGAC AAGATAATCT CCTACATCGC CGGCACCGAC CAAAGCGGGT ACCGCAACCG GACCGTCAAC ATCGGCGGCG TCACCGGCAC CTGGCGACTG GGGGATATCG TCTCCTCCAC GCCAAGGCTG CAGTCGTCGG TCAAGCAGAA CGTCTACAAC ATGCTTTCGC CGAAGGGTTA CGGCGACAGG TCTTACGGGG ACGACTACAC CAGGAAGGGG TACATCTACA CGTCTTCCTA CACCAACCGC GGCATGGTGT ACGTGGGCGC CAACGACGGG ATGCTGCACG CCTTCAAGCT GGGTCTTCTC GATGTCACTG CCTCCGGAGA CCGTAAAGGG AAGCTCTCCG GAGAAGATCT GGGTGAGGAG CAGTGGAGCT TCATTCCGAA GAACGCGCTG CCGTACCTGA GGTACAACGC CGACCGGGAC TACAACCATA TTTACTACGT GGACGGCACC ACCGTGATCA ACGACGTCAG CATGGGAACA CCGGCGGGCT GCACCGACAA CTACTGGGAC TGCACCAGGG ACGTTGAGAA CGGGAGCAAC TGGCGCACCG TCCTCGTCTC GAGCATGGGG CTCGGAGGTG CGTCGAAGAT CGCAGGCTCC GGCTGCAAGG GGACCACCTG CGTGGAAACC CCGATCACTG ACCCCGCCAA CGCCGGCGAG GGGGTGGGTT ACTCATCGTA CTTCGCCCTC GACATAACCA ATCCGGACAG CCCTTCGCTT TTATGGGAGT TCGGCAAGCC GAACCTCGGC TTCTCGACCA ACGGCGCCGC CTTCGTTAAG ATCAGCGCCA AGAAGGCGGA CGGGGTTACC CCGGACCTCA CCAAAAACGG GAAGTGGTTT GCGGTCCTCG CCTCCGGGCC TACCGGCCCC ATCGACACCG CGCTGCACCG GTTCAAGGCG AGTAGCAACC AGAACCTGAC CATCTACGTC CTGAACCTCG CGGACGGGTC TATCGCAGCC ACCATCGACA CGCTGGCCGA CGGGACCAAG CTAAGCAACG CCTTTGCCGG GACCATCACC AACGCGACGG TCGACACCGA CCGCTGGAAC AGGAACGCAC GTGGGCACTA TGAGGACGAC GCGCTCTACA TCGGCTACAG CCAGCTTTCC GGCACCGACT GGACCGGAGG CGTCTTGCGT CTGATGACCG GGGAGAACCT CGACCCGACA ACCTGGAAGC TCAGCAAGGT GATCGACGGG GTAGGGCCGG TGACCACCAA CTTGACCAGG CTGCAGGACC GTCAGAAGCA CAACCTCTGG CTCTACTTCG GCTCGGGCAG GTACTACTAC AGTCAGGACG ACAACGACGC CACCAGGCTC ATCGCTGCAG TGAAGGATCC TTGTTACGAC TCCGCCAATG ACAAGCTCAT TCCGACCTGC ACCACCAAGA AGAGTCTCTC CGACCTGACC AACCAGTCCA CCTCCCGCAG CGCTGTGACC AATGACGGAT GGTACATCAA CCTCGCCCCT GTCGACGCCA CCCATTCGTT CGGCGCGGAG AGGATGATCG CAGACCCGTC CGCGCTGGCC AACGGGGTGG TCTATTACAC CACCTTCGCT CCCACCACGG ACCCCTGCAG CTTCGGGGGA AAATCGTACA TGTACGGGCT GGATTACGAT AGCGGCAACG CTCCGGCGTG CAATCAGCTC GGAGAGGGCA TAGCGCTGGT GCAACTCTCC ACGGGCGCCT TCCAGGAGAT ACACCTGAAG GATGCTATAG GCTGCTACAA CAACAAGGGC GAGTTTGTTC CCCCGCTTCC CCCGACGGAT ACACAGGTGA ACCCGGCGGG GTACACGCCT CCCCCCGGGA AAAGCTACTC CATGGTCGGC AAACCGCCGG GCGACGCTTC GCCGATTGTT TCCGCCTCTG GCTTAGTACC GGCGAAACGG ATCTTGCATA TCCTGGAGCA TTGA
|
Protein sequence | MRRLLVMLFT VITLAWSASV AQADPAVVTT SPVNGAVNVD PTSLDTIWVG FNDYDMDKSS FTGSNAGRVS VNNGATYTVS KVDGSLPWNG DFIKISLSKD LNYSTTYTVI IDYRVKNKEG ERLGSSSASN YVFSFTTKAK PSSDTTAPVV DSTFPVSSAV DVPITAAITV TFDEVMKADT INSTNFLINN GAVAGAVTLD ASGKVATFTP SANLSSFTTY TATVTTGVRD AADNALASNY SWNFRTMALD SVRPTVTVVS PIAGATSVDT TTVITATFSE AMRDTTITSA NVSVSGGVTG TVSYDPATWT LSFAPTAALA NSTNYTVTIS TGVTDLAGNS LLTAKSWTFT TRAVTTPPPL NDYCQVPPYV TSTNNMVKPN VLLVVDNSGS MYEFAYKNSG AGNNSYDTSY TPAKSYYGYF DDKKMYLYSG GAFVPDTAAS TVTDTTKFLS GNFLNWLTMR RVDVVRKVLV GGKMTPRVGA GRYLYPAGSP DRDFYKSYNN VKYTVQGGAS TEVIKDTTNN VTYNLKIAIG DEGSNQDEGL VPKYANMINF GIMFYNEGYK YENSVNNVRD GGYVAADLGS TGSNLVTQIE STDPTTWTPL GETLFEATRY FQAGSSAYNG GTYSGKDPIN YACQKNFVLI LTDGESTKDE NIPGGSTNFS GKVTDSSFNV KTWMDSIATQ EGYASQYSSS ANTSEGTYYL EGVAYWSHVT DMRSASLGDS DIPGKQNLTI YTVFAFDDSP VGRDLLKKTA KYGGFNDYDS TGKPDKVAKW DQDGNGIPDT YYEASDGAAL AASLQKAFND ILARVSSGTA ASILSNSEGS GANILQAVFH PRKYFDAQTS ADWIGEMHNM WYFVDPKIKN SSIREDSDYI PGSPAPPHYL NLSKDKLINF YFNTDQAKTM VKRYTDVRGD GQPDLDTNGD LKADSYTPYD EVDSDSVKSI WKAGKQLWSR TAARNIYTNL AGSLTSFTGL DTTDGNIQQL LQAANKTEAD KIISYIAGTD QSGYRNRTVN IGGVTGTWRL GDIVSSTPRL QSSVKQNVYN MLSPKGYGDR SYGDDYTRKG YIYTSSYTNR GMVYVGANDG MLHAFKLGLL DVTASGDRKG KLSGEDLGEE QWSFIPKNAL PYLRYNADRD YNHIYYVDGT TVINDVSMGT PAGCTDNYWD CTRDVENGSN WRTVLVSSMG LGGASKIAGS GCKGTTCVET PITDPANAGE GVGYSSYFAL DITNPDSPSL LWEFGKPNLG FSTNGAAFVK ISAKKADGVT PDLTKNGKWF AVLASGPTGP IDTALHRFKA SSNQNLTIYV LNLADGSIAA TIDTLADGTK LSNAFAGTIT NATVDTDRWN RNARGHYEDD ALYIGYSQLS GTDWTGGVLR LMTGENLDPT TWKLSKVIDG VGPVTTNLTR LQDRQKHNLW LYFGSGRYYY SQDDNDATRL IAAVKDPCYD SANDKLIPTC TTKKSLSDLT NQSTSRSAVT NDGWYINLAP VDATHSFGAE RMIADPSALA NGVVYYTTFA PTTDPCSFGG KSYMYGLDYD SGNAPACNQL GEGIALVQLS TGAFQEIHLK DAIGCYNNKG EFVPPLPPTD TQVNPAGYTP PPGKSYSMVG KPPGDASPIV SASGLVPAKR ILHILEH
|
| |