Gene Gmet_0737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_0737 
Symbol 
ID3740543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp809210 
End bp812185 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content57% 
IMG OID637778015 
Producttype IV pilus assembly protein PilY1 
Protein accessionYP_383704 
Protein GI78221957 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.465349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATCA GCAATTTTCA TGCACGCAAA ATCCTCTTGT TAGCCGCAAT TCTCTGCTCT 
GTCGGCATTC TCGCGGCCAT GTCCTTTGCG GCGATTTCCC AGTACCCTCT GTTCCTGACC
GGCAGCGTGC AGCCGAACGT CATGATTCTC CTGGACAACT CCGGCAGCAT GAATACGATC
ATGGAGCATC GGGACTACAG TCCCGGTACC GTGTATTCCG GCTCGTTCAG GGGGGGCGAA
ATTTACTTTA ACAGAATTGA CTACAGTAAA AGTGGAAATC CCTATTACCT GGTGAGCAGG
GACACGGGGC ACACCGTCGA TGGTAACAAT CAGAACCAGT ACACAGTCAA TGGCCGCTCG
ATCACGCTTC CGTTTCCCTA TGTTGACACC CGCTGGAATG GCAACTACCT GAACTGGCTT
TTTTTTCATG CGACTGCGGC CCAGTACGGC AGCCTAGCCA CCGATGCATC GATCCGGGTG
ACAAGAATCC AGACGGCCAG ATCTGTCATC AGCGACGTGG TTAAGAATGT CTCGGGTGTC
AGGTTTGGCC TGTTCAAGCT GAACAACAGC CAGGGGGGGA GCAAGGTAAA GGATTGCGGC
GCCCTGACCC CGACAACCGT CGATGCGGCG GTGGACGGGA TCAATGCAGA AACCTGGACC
CCCTTGGGGG AGACCCTCTC GGAAATCTGG CAATACTTCA GGGGGGGGGA TTCCCTCTAT
AATTCGGGAA CCTATACGAC CCCCATTACC AACGGGTGCC AGAAGAATTA CACAATTATC
GTGACTGACG GGGAGCCGAC CTACGACAAC TGTTATCAAG GGCCGTTTGC TTCCTACGGC
TGTCCCAAGG ACCCCTATGA CAAAGAGAAC GAAAACGCTC CGAGCCATCT GGCCGATGTG
GCGAAAGACA TGCATGACGG CAACGCATCG CCGTTCAGTG GCAGGGTGCA GAATGTCTCC
ACCTACACCA TCGGCTTGAC GCTGGACAGC GCCCTCCTCG ATCAGACAGC GGTGAACGGA
GGGGGAAAAT ACTTCACGAC GACGTCGGGA ATCAGTCTTG CCACGGCCGT CCAGAATGCC
CTGGCCGATA TTGTGTCCAA AATGGCGGCA GGAAGCAGCG TCGCCGTGAA TACCCCGTTT
CTGAACTCGA ACAGCACGCT CTATCGGGCA AGATTCAAGT CACCGGACTG GAACGGTTAT
CTGGAGGCCT TCAAGCTTGA CGCCGCCACC GGTGCTATCA TCGGATATCC GAACTCTCCC
GAATGGGAGG CGGGCTCTCG ACTCAATAGT CGCTCATCTG CCCGGGCAAT CTATACCGCC
GGCGTGGATG GCGGCACGTA CAAGCGGCTT GAGTACAGTA CCGCCAACGG GACCAAACTG
GCGGGTGCGG GATTCATGAA TTTTTCGTCC GCAAAGGCCA GCGATCTCAT CGGGTATATC
CGCGGTGATC GCTACGGAAC AACAAACCCC GCTGGGTATC GCAACCGGAC GAGCAAGCTC
GGGGACATCA TCTCATCATC ACCCGTCGTC TACGGCCCTC CCGACGGTGT CTACACCGAT
GCAGCCTACA AAGCATTCAA GAGGGACTGG GCAACCCGAA CATCGCTCAT TCTGGCGGGG
GCCAACGACG GGATGCTCCA CGCCTTTGAC GCCGCGACCG GTGACGAGGC ATGGGCTTTC
ATCCCGAATA TTCTCTTAAG CAAGCTGAAG CTCCTGAGAA ATGATCCCTA TACGCACACA
AGTTACGTCA ATGGCGCGAT CACGGTTGCC GACGCCTATA TCCAGACGAA AAATGCCGAC
GGTACCACTG CCGGAACCGC GGGGTGGCGC TCCATAGCCG TCTGCGGCCT CCGCGACGGC
GGCAAGGGGT ATTTCGCCCT CGATATCACT GATTCGGCCA ATCCGATACC GTTATGGGAG
CTTACCGCCA CATCATCGGC AACACCGAAC GGTCTGGGGT ACTCGTTTGG AACCCCCCTG
ATCCTCAAAC TGAGGGATGA AGCGGCAACC AGCGGATTCC GCTGGGTTGC GGCCCTTGCG
AACGGCTATG AGGGACCCAC AAGCAGCAAG GCCGCTTCGC TCATCATTGC CGACCTCGCC
ACGGGAGCGG TCATCAACGA GATCGTTGTC GACAAGTCGG CATTCAGTGG TGTCTCCCCT
AATGGGCTTG CCTCTCCCGC TGCCATTGAC AGGGATATGG ACGGCTTCGC CGACACCCTC
TATGCCGGGG ATCTCAAAGG GAACATGTGG CGGTTTGACG TGAGCAGCGC CAAGATGGCC
CAGTGGAAGG CTGACTGCAT CTTTTCCGCC GGGGTCACCC AGCCGATAAC CGCCGCTCCT
GACGTGGTGG TGCGCCTCGG GTATCAGTAT GTCTTCTTCG GCACCGGGAA ATACCTTGAC
GAGGGTGACA AGACCACATC ATTCACTCAA AGCTTTTACG GGGTAAAGGA TGATAATTCG
ACGAAAAATC TGACGCAAGC TGATCTGGTC GGTCAGACCA TAACGGAGGT GACGTATTCG
GGAGCCGTGT ACCGGACCCT TTCAAGCTTT ACGGTCGGCA GCAAGGATGG GTGGTATCTT
GATCTCCCCG GTAAAGGCGA ACGAGTTATC GCGGAACCTG AAGCTACCGG GACCAATGAT
GCCGGCAAGG TCAAATTCAC AACTTTTATT CCATCCACCG ATCCCTGTGA GCCGGGAGGT
AAAGGTTGGC GCATGCAGGT CAATATGGAG ACGGGTGGAG AGCCGAAGAA GTCGGTGTTT
GTGGTGCCAG GAAGGCCGGA CGGCACTGTC CCTGTGGGTG ACAGCTCTCG GAGGCCCTCG
GGGATGCTTC TCTCTTCTGG GACTGTGGCA CCACCAGGAG GAACGAATGA TTTGGATATC
ATCCAAAAAA TCGACACAGG TCTTGAACCT AAGCAAGATG AGAGCGGGAC CCACCAGTTC
GGTCTGCGGA GTTGGAGGCA GCTTCTGTCC ATTTAG
 
Protein sequence
MRISNFHARK ILLLAAILCS VGILAAMSFA AISQYPLFLT GSVQPNVMIL LDNSGSMNTI 
MEHRDYSPGT VYSGSFRGGE IYFNRIDYSK SGNPYYLVSR DTGHTVDGNN QNQYTVNGRS
ITLPFPYVDT RWNGNYLNWL FFHATAAQYG SLATDASIRV TRIQTARSVI SDVVKNVSGV
RFGLFKLNNS QGGSKVKDCG ALTPTTVDAA VDGINAETWT PLGETLSEIW QYFRGGDSLY
NSGTYTTPIT NGCQKNYTII VTDGEPTYDN CYQGPFASYG CPKDPYDKEN ENAPSHLADV
AKDMHDGNAS PFSGRVQNVS TYTIGLTLDS ALLDQTAVNG GGKYFTTTSG ISLATAVQNA
LADIVSKMAA GSSVAVNTPF LNSNSTLYRA RFKSPDWNGY LEAFKLDAAT GAIIGYPNSP
EWEAGSRLNS RSSARAIYTA GVDGGTYKRL EYSTANGTKL AGAGFMNFSS AKASDLIGYI
RGDRYGTTNP AGYRNRTSKL GDIISSSPVV YGPPDGVYTD AAYKAFKRDW ATRTSLILAG
ANDGMLHAFD AATGDEAWAF IPNILLSKLK LLRNDPYTHT SYVNGAITVA DAYIQTKNAD
GTTAGTAGWR SIAVCGLRDG GKGYFALDIT DSANPIPLWE LTATSSATPN GLGYSFGTPL
ILKLRDEAAT SGFRWVAALA NGYEGPTSSK AASLIIADLA TGAVINEIVV DKSAFSGVSP
NGLASPAAID RDMDGFADTL YAGDLKGNMW RFDVSSAKMA QWKADCIFSA GVTQPITAAP
DVVVRLGYQY VFFGTGKYLD EGDKTTSFTQ SFYGVKDDNS TKNLTQADLV GQTITEVTYS
GAVYRTLSSF TVGSKDGWYL DLPGKGERVI AEPEATGTND AGKVKFTTFI PSTDPCEPGG
KGWRMQVNME TGGEPKKSVF VVPGRPDGTV PVGDSSRRPS GMLLSSGTVA PPGGTNDLDI
IQKIDTGLEP KQDESGTHQF GLRSWRQLLS I