Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gmet_0737 |
Symbol | |
ID | 3740543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter metallireducens GS-15 |
Kingdom | Bacteria |
Replicon accession | NC_007517 |
Strand | + |
Start bp | 809210 |
End bp | 812185 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637778015 |
Product | type IV pilus assembly protein PilY1 |
Protein accession | YP_383704 |
Protein GI | 78221957 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.465349 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATCA GCAATTTTCA TGCACGCAAA ATCCTCTTGT TAGCCGCAAT TCTCTGCTCT GTCGGCATTC TCGCGGCCAT GTCCTTTGCG GCGATTTCCC AGTACCCTCT GTTCCTGACC GGCAGCGTGC AGCCGAACGT CATGATTCTC CTGGACAACT CCGGCAGCAT GAATACGATC ATGGAGCATC GGGACTACAG TCCCGGTACC GTGTATTCCG GCTCGTTCAG GGGGGGCGAA ATTTACTTTA ACAGAATTGA CTACAGTAAA AGTGGAAATC CCTATTACCT GGTGAGCAGG GACACGGGGC ACACCGTCGA TGGTAACAAT CAGAACCAGT ACACAGTCAA TGGCCGCTCG ATCACGCTTC CGTTTCCCTA TGTTGACACC CGCTGGAATG GCAACTACCT GAACTGGCTT TTTTTTCATG CGACTGCGGC CCAGTACGGC AGCCTAGCCA CCGATGCATC GATCCGGGTG ACAAGAATCC AGACGGCCAG ATCTGTCATC AGCGACGTGG TTAAGAATGT CTCGGGTGTC AGGTTTGGCC TGTTCAAGCT GAACAACAGC CAGGGGGGGA GCAAGGTAAA GGATTGCGGC GCCCTGACCC CGACAACCGT CGATGCGGCG GTGGACGGGA TCAATGCAGA AACCTGGACC CCCTTGGGGG AGACCCTCTC GGAAATCTGG CAATACTTCA GGGGGGGGGA TTCCCTCTAT AATTCGGGAA CCTATACGAC CCCCATTACC AACGGGTGCC AGAAGAATTA CACAATTATC GTGACTGACG GGGAGCCGAC CTACGACAAC TGTTATCAAG GGCCGTTTGC TTCCTACGGC TGTCCCAAGG ACCCCTATGA CAAAGAGAAC GAAAACGCTC CGAGCCATCT GGCCGATGTG GCGAAAGACA TGCATGACGG CAACGCATCG CCGTTCAGTG GCAGGGTGCA GAATGTCTCC ACCTACACCA TCGGCTTGAC GCTGGACAGC GCCCTCCTCG ATCAGACAGC GGTGAACGGA GGGGGAAAAT ACTTCACGAC GACGTCGGGA ATCAGTCTTG CCACGGCCGT CCAGAATGCC CTGGCCGATA TTGTGTCCAA AATGGCGGCA GGAAGCAGCG TCGCCGTGAA TACCCCGTTT CTGAACTCGA ACAGCACGCT CTATCGGGCA AGATTCAAGT CACCGGACTG GAACGGTTAT CTGGAGGCCT TCAAGCTTGA CGCCGCCACC GGTGCTATCA TCGGATATCC GAACTCTCCC GAATGGGAGG CGGGCTCTCG ACTCAATAGT CGCTCATCTG CCCGGGCAAT CTATACCGCC GGCGTGGATG GCGGCACGTA CAAGCGGCTT GAGTACAGTA CCGCCAACGG GACCAAACTG GCGGGTGCGG GATTCATGAA TTTTTCGTCC GCAAAGGCCA GCGATCTCAT CGGGTATATC CGCGGTGATC GCTACGGAAC AACAAACCCC GCTGGGTATC GCAACCGGAC GAGCAAGCTC GGGGACATCA TCTCATCATC ACCCGTCGTC TACGGCCCTC CCGACGGTGT CTACACCGAT GCAGCCTACA AAGCATTCAA GAGGGACTGG GCAACCCGAA CATCGCTCAT TCTGGCGGGG GCCAACGACG GGATGCTCCA CGCCTTTGAC GCCGCGACCG GTGACGAGGC ATGGGCTTTC ATCCCGAATA TTCTCTTAAG CAAGCTGAAG CTCCTGAGAA ATGATCCCTA TACGCACACA AGTTACGTCA ATGGCGCGAT CACGGTTGCC GACGCCTATA TCCAGACGAA AAATGCCGAC GGTACCACTG CCGGAACCGC GGGGTGGCGC TCCATAGCCG TCTGCGGCCT CCGCGACGGC GGCAAGGGGT ATTTCGCCCT CGATATCACT GATTCGGCCA ATCCGATACC GTTATGGGAG CTTACCGCCA CATCATCGGC AACACCGAAC GGTCTGGGGT ACTCGTTTGG AACCCCCCTG ATCCTCAAAC TGAGGGATGA AGCGGCAACC AGCGGATTCC GCTGGGTTGC GGCCCTTGCG AACGGCTATG AGGGACCCAC AAGCAGCAAG GCCGCTTCGC TCATCATTGC CGACCTCGCC ACGGGAGCGG TCATCAACGA GATCGTTGTC GACAAGTCGG CATTCAGTGG TGTCTCCCCT AATGGGCTTG CCTCTCCCGC TGCCATTGAC AGGGATATGG ACGGCTTCGC CGACACCCTC TATGCCGGGG ATCTCAAAGG GAACATGTGG CGGTTTGACG TGAGCAGCGC CAAGATGGCC CAGTGGAAGG CTGACTGCAT CTTTTCCGCC GGGGTCACCC AGCCGATAAC CGCCGCTCCT GACGTGGTGG TGCGCCTCGG GTATCAGTAT GTCTTCTTCG GCACCGGGAA ATACCTTGAC GAGGGTGACA AGACCACATC ATTCACTCAA AGCTTTTACG GGGTAAAGGA TGATAATTCG ACGAAAAATC TGACGCAAGC TGATCTGGTC GGTCAGACCA TAACGGAGGT GACGTATTCG GGAGCCGTGT ACCGGACCCT TTCAAGCTTT ACGGTCGGCA GCAAGGATGG GTGGTATCTT GATCTCCCCG GTAAAGGCGA ACGAGTTATC GCGGAACCTG AAGCTACCGG GACCAATGAT GCCGGCAAGG TCAAATTCAC AACTTTTATT CCATCCACCG ATCCCTGTGA GCCGGGAGGT AAAGGTTGGC GCATGCAGGT CAATATGGAG ACGGGTGGAG AGCCGAAGAA GTCGGTGTTT GTGGTGCCAG GAAGGCCGGA CGGCACTGTC CCTGTGGGTG ACAGCTCTCG GAGGCCCTCG GGGATGCTTC TCTCTTCTGG GACTGTGGCA CCACCAGGAG GAACGAATGA TTTGGATATC ATCCAAAAAA TCGACACAGG TCTTGAACCT AAGCAAGATG AGAGCGGGAC CCACCAGTTC GGTCTGCGGA GTTGGAGGCA GCTTCTGTCC ATTTAG
|
Protein sequence | MRISNFHARK ILLLAAILCS VGILAAMSFA AISQYPLFLT GSVQPNVMIL LDNSGSMNTI MEHRDYSPGT VYSGSFRGGE IYFNRIDYSK SGNPYYLVSR DTGHTVDGNN QNQYTVNGRS ITLPFPYVDT RWNGNYLNWL FFHATAAQYG SLATDASIRV TRIQTARSVI SDVVKNVSGV RFGLFKLNNS QGGSKVKDCG ALTPTTVDAA VDGINAETWT PLGETLSEIW QYFRGGDSLY NSGTYTTPIT NGCQKNYTII VTDGEPTYDN CYQGPFASYG CPKDPYDKEN ENAPSHLADV AKDMHDGNAS PFSGRVQNVS TYTIGLTLDS ALLDQTAVNG GGKYFTTTSG ISLATAVQNA LADIVSKMAA GSSVAVNTPF LNSNSTLYRA RFKSPDWNGY LEAFKLDAAT GAIIGYPNSP EWEAGSRLNS RSSARAIYTA GVDGGTYKRL EYSTANGTKL AGAGFMNFSS AKASDLIGYI RGDRYGTTNP AGYRNRTSKL GDIISSSPVV YGPPDGVYTD AAYKAFKRDW ATRTSLILAG ANDGMLHAFD AATGDEAWAF IPNILLSKLK LLRNDPYTHT SYVNGAITVA DAYIQTKNAD GTTAGTAGWR SIAVCGLRDG GKGYFALDIT DSANPIPLWE LTATSSATPN GLGYSFGTPL ILKLRDEAAT SGFRWVAALA NGYEGPTSSK AASLIIADLA TGAVINEIVV DKSAFSGVSP NGLASPAAID RDMDGFADTL YAGDLKGNMW RFDVSSAKMA QWKADCIFSA GVTQPITAAP DVVVRLGYQY VFFGTGKYLD EGDKTTSFTQ SFYGVKDDNS TKNLTQADLV GQTITEVTYS GAVYRTLSSF TVGSKDGWYL DLPGKGERVI AEPEATGTND AGKVKFTTFI PSTDPCEPGG KGWRMQVNME TGGEPKKSVF VVPGRPDGTV PVGDSSRRPS GMLLSSGTVA PPGGTNDLDI IQKIDTGLEP KQDESGTHQF GLRSWRQLLS I
|
| |