Gene GSU2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2038 
Symbol 
ID2688005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2228907 
End bp2234594 
Gene Length5688 bp 
Protein Length1895 aa 
Translation table11 
GC content59% 
IMG OID637126729 
Producthypothetical protein 
Protein accessionNP_953087 
Protein GI39997136 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACAGT CGTCTGTGCT CGGTTGCGCG AGGCCGGCCG GCCTGGCTGC GCTGTTGACT 
CTTCTGTTGG CTGCGGGAGG CGATGGCGCT GCAGCCGCAA CCATGAACGA TTACTGCATT
CAGCCCCCCT TTGTGAGCCA AAGCGTTCCC CCTCTCGTCA TGTTCGAGGT GGGCAGGGAG
CACAAGCTCT ATTACGAGGC CTACAACGAC GCCAATGACC TGGATGACGA CGGGCGCCTC
GACACCACCT ACAAGCATTC CATCGATTAC TACGGCTATT TCGATCCCTA CAAGTGCTAC
ACCCACTCGG GCGGGTCAGG CTCCAACGAC AAGTACACCC CCGTCTCCAC GACGGCCGAC
AAGTTCTGCT CCTCGGGCCA GTGGAGCGGC AACATCCTCA ACTGGCTGAC CATGTCCCGG
ATGGATGTGC TCAAAAAAGT GCTGTTCGGC GGCCAGCGCT CGGCTGACAG CAATACCGCC
ACCTATCTCG AGCGTGTCTA CGTTCCCCAG GATGCCCACA GCTGGGGCAA GGAAGTCACT
GGCAGGCTCT GCTCAAACGG CACCAACTAT ACGGATATGT GCCAGTTCGA TTCTGACTGC
GATACGGGCT ATACCTGCGT TGACAAGTCG GTCAACCTGA TCGGTATCAC CGCCTCAGAT
ACCGGTACGG CCTGTTCCTT TACGTCCAGC ATCAAATGGG ATACAACCGG TAAAATCCTC
GTTGCCAAAT ATACCCACTC CAACTTCTCG TGCGGGAGTG ATTCCACCGA TCTCATCAGT
TCATATGAGC CAGCCAACCT GGTTGCCGGC TTCCCGGTCT ATGTCGCCAC CTTTGGTGAT
GCCATTCTCA ACCCCGCTGC GGATCATGGC GATCAGTTCA ATTATCTGGC ACTGGCGGAA
TTCAGTGTTT CCAAGTCGGA CAAGGGGAAC TGGATGTTTG CCATTGATGG CGACGATGGC
GTCGAACTCG AAATCATCAA CCCTGCCGGC GACGCCAGCA CCATAGTGGC ATCCCGTTAC
GGCTGCAATT CGGCCTGCAA CTGTCAGACG AACTCAGGCA CCATCAATCT CAACACTACC
GGTTATTGGC GGCTTATCGC CCGCCACTCC GAGAAGTCTG GCCAGGACGG CGTCAAGGTG
TGGTACAAGA AGCCCAGTAA AACCCAATCG AGCGATCCGT GGGTGCTTTT CGGCAGCTCA
ACCCTCACGC TGCGGGCTCC GACCATTCCC GCCGGGGCCG AGTGCACGCT CAAGGACCGC
TCCTTCATTG AGACCGGCAA ACCAAAAGTC GGGACTACTC CGAAGCAGCA TCTTTTCTGC
AGCACGACTC TCTCCGACGG TGGTACCCCG ATCCTGCGCT TCCTTGGGAA TAAGGAGAAC
CGGATCTGGG AGTGGGTTTC CAAGGAACGC CCCGTTTGCG ATTCGTCCCT CGGTGCCCCC
ACCGACTACA CGGTCCAGGT GGAAGTCTGC AAGTCCGTCT CGCCCGACAA TCGGCCGACC
GGCAAGAAGG ATGACCTGGC AAGCGGCCGG GAGACGAACT GCAAGGACTA CGCGGGCACC
TTCAAGCCGG TGGGGCTTCT GCAGAAGTTC GGCGAGGGGG AGGGGGCCAA GGTCTGTTCC
CGTACACTGG CCAAAAGCTG TACCAGCGAC AGTGACTGCG GCGCGGGCGA GGGACTTTGC
ATCTATAAGT CCCCCATGTA TTTCGGCATG TTCACCGACT CCTATACCAA GAACCTGAGC
GGCGGCGTCC TGCGCAAGAA CATCGGCAGC ATCCTGGACG AGACCAACGC CAACAACGGC
ATCTTCCAGA CCTCAGAGAA TGTCCAGGGC AACATCATGA TCACCCTGGA CCGGCTCAAG
ACCATCGGCT TCCGGTACAC GGACCAGTCC TACCAGGACG CAAGCGGTGG TTCCTGCGGC
TGGATCACCG ACCGGCCCCT GAACGAGGGG GAATGCCGCA TGTGGGGCAA CCCCATTGCC
GAAATGATGT ACGAGAGCCT CCGCTACTTT GCCGGCAAAG GGGCGCCTAC AACCGAGTTC
ACCTATACGA CGCCTGCGGA TTCGGGGCTT TCGCTGTCCA AGCCCAGTTG GGGGTACAGC
AAGGGGAGCA CAACGTATCA GCTTTATGAC ATCTATCCCC CGTGCGCCAA GCCGTTCATG
CTGATTCTGA GCGACATCAA CCCCAGTTAC GACTCGGATC AGATTCCCGG CAGCTCCTTC
AAAAAGACTG ACGGTACATA CTTCAGCGAA GATGCCGCAT CGCCCCAACT CGGCCTCGGC
GTTGCGGGCG CCGATGGCGT GTCGCTGCTC AACAAGCTGG CCGACACCAT CGGCACCTCC
GAGGGAATCA TCGGCGATAG CTGGTTTGTC GGCGAAAACG GATCAACCAC GGATTTTGTC
TGCAGCTCCA AAAGCGTTAC CAAGCTGAGC CTGTTACGCG GCATGTGCCC CGAGGAGCCC
ACCAAGCGGG GATCGTTCTA CTCGGCAGCC CTGGCTTATT ACGGCCTGAC CTTGATGAAG
GAGAAGACCG GCAAGCCGGA CGTTTCAACT TTTGTCGTTG CCCTGTCATC CCCGGTGGCC
GATCTCAAGA TCAAAGCCGG CAACAGTCAC GTGTCGATCC TGCCGGTTGG TAAATCGGTC
AGCGGAAGCC ACGGTATCAA TGCCTCGTGC GCCCAGAAGT GCACCCTCAC GGCGGACGAA
GACGGCCTGC ACATTTCCAA TTGCTCGTCA ACGGCGTACT GCCCCTCCAA CCAGATTGTG
GATTTCTATA TCGACTCTCT CAAGTATGAC AACGACAAAA ACGTCATCTA CGCCAAGTAC
CGTATCAACT ACGAAGACGT GGAGCAGGGC GCGGACCACG ATATGGACGC CATCGTCACC
TATGAGGTCT GTACCCAATC CGCCATCGAC CAAGGGCTTG GCGCCTGCAG CGGCTCACTG
GGCTCCAACA TCCAGATCAA GCTCAACTCC GACTATGCGG CCGGCAGCAT CGACCAAGTG
ATGGGCTTCG TCATCTCCGG CACCACTGAA GACGGCGTCT ATCTCCCCGT TCGTGATCGC
GACGTATCGT CTGCGGACAG CGACACCCCC GCCACCGTTG CGGGTCTTCC CCTCAACTGG
TCGAAGACGT TCACCATTTC GGGCAACCCG ACCGGCACGC TCAAGAGTCC TCTCTGGTAT
GCCGCCAAAT GGGGCGGGTT CATCGATGCC AATAACAACA AGAAGCCCGA TCTCGCCAGT
GAATGGGACA AGGACGGCGA CGGCGAACCC GACAACTATT TCCTGGTGGT AAACCCCCTG
AAACTGGAGC AGCAGCTCCA GAAGGCCCTC ACTGATATTC TTAACCGCGT TTCATCCGGT
ACGGCCGCCT CCATCCTGAG CAACAACGAC AACAACGGCG CCACGCTGCT CCAGGCCATA
TTCTACCCCC GCAAGAACTT TGCGGAGACG GAATTGGCCT GGACCGGCGA ACTCCAGGCG
TTCTGGTACT ACATCGATCC GTTCCTCAAC ACCAATAGCA TTCGCGAGGA CACCGACCAG
GACCTCAGGC TGAAGCTCAA GACCGACTAT GTCCTCGATT TCCGGTTCGA TACCAATGAC
AACAAGACCA AGATCGACCG CAGCCTTGAC GTCGACGGCA ACGGCAGCGG CGACAGTTAT
GTGAACACCA TCGAACCCGA GCAGGTAAAT GCCCTCTGGA AAGCCGGCAG CCTGCTCTGG
TCACGCAATC TCTCCACGTC TCCCAGAACC ATTTACACCA GCTACCGGGA TGCCGCCAGC
AAGGACCAGC TCACGGTATT CACCACCGCC GGGAAGGATC TGTTCAAGGC AAATCTCCAG
GCCGCAGACG CCACTGAAGA GGATAAGATC ATCAACTACA TCCGTGGCAC CGAGCAGAGC
GGCTACCGCA ACCGGACCGT CACCATCGGC GGTTCCACCG GGGTCTGGCG CCTCGGCGAC
ATCATATCCT CCACTCCGCG CCTGCAGTCC AATGCCCGGC TCAACGGCTA TCACCTGCCC
CCGCCGGTTG GTTACAAAGA TTCAAGCTAT CAGCGCTACT TGGATTCCAA TGAGTACAAG
ACCCGTGGCA TGGGCTATGT GGGGGCAAAT GACGGCATGC TTCACGCCTT CAACCTGGGC
GTTCTGAAGG CCGGCACCAC GAAAGACGTA ACCTCGTTCA TCACCGGCAG TGACTTCGGC
AAGGAGATGT GGACCTATAT CCCGCGCAAT GCATTACCGT ACCTGAAATA CCTCGCAGAT
CCCGAGTACG ATCACCTCTA CTACGTGGAT GCCTCCCCGA GCCTTAATGA TGTATCGATC
GAGGTGACCG AGGGGACTGG CTGCACTGAC GCAGCATACT GGCTCTGCAC AAAGCAAACG
GTCTACCAGG CCGGCACTGA TAGTACCACC AAAGAACTGG ATCTGGACAA AACGAGTTGG
CGCACGGTTT TGCTCGGGGC CATGGGCTTG GGAGGAGCTT CGCGCAACAC CACCGATGCC
TGTTCCGCCA GCACCGACTG CGTGAAGACC CCCATCGCCA ATGTGGGCTA CTCATCCTAC
TTCGCCCTGG ACGTGACCAC GCCTACCTCG CCATCCCTCA TGTGGGAATT CGCGTCTGCC
GACCTCGGTT ACTCCACCGT GGGGCCGGCC ATCGTGCGGA TCGGAGGGGA AACTAACGGC
CGCTGGTTCG CGGTGCTGGC CTCCGGTCCC ACCGGTCCCA TCAATACCCA GACCCATCAG
TTCCTGGGGC GATCCACCCA GACCCTGAAG CTGTTCATTC TCGACCTCAA GACCGGTGCA
CTCCTCCGGA CCATCGATAC CGGCATCCAG AATGCCTTTG CCGGTTCCCT TTCCGGCGGG
ACACTGGATA CCGACCGGAG TGCAGGTACA ACAGGAAAAT ATAACGACGA TGCCGTCTAT
CTCGGTTACG TCCGCAAGGA TACCACGACC GGAACCTGGA CCAAGGGAGG CGTCCTGAGG
CTCTTCACCA AGGAAAATAT CGATCCGGCC CAGTGGTGGT GGGCAACTCT GGTCGATGAT
ATCGGCCCCG TGACGAGTGC CGTGGCCCAG TTGCAGGATA CCACTCACAA GAATCACTGG
CTCTTCTTCG GCTCCGGCCG CTACTACTAC AAGGCCGGCA GCGATCTGGA CGACGCTGCG
GGCCGACGGG CACTGTACGG CATCAAGGAC CCCTGCTATG ACCTGAACAA CAAGATGAAG
ACAACCTGCA ACACACCGAC CGTTTTGGCG ACCGACCTGG TGAACCAGAC CGACAGCATT
CAGGGCATGG GCACTGCCCC AGGCTGGTAC GTCCTGCTCG ATGAGGCGAG TGGCAGCGCC
GGAGCCGAAA GGGTCATCAC CGACCCCGTG GCGGCCCCCA ACGGCGCCAT CTTCTTCACC
TCGTTCAAGC CGGCGGCCGA TGTCTGCAAA TTCGGCGGCG ACCTGGCCCT CTGGGGGGTC
AACTACAGCA CCGGCGGTTA CCTGGCTCCA TCACAGCTCA TCGGTGAGGC GATCATCCAG
TCATCCACGG GAAGCTTTGA ACAGATCGAT CTCGGCAGCT CGTTCACCCA GAGGCTGAAC
CGGAAGACGG CCGAGCGGCA AGGGGTGCCG CCGCGCAACA AGCCGACCAT CGTCACCAAT
GCCAACATCA AGCCTCAGAA GCGTATCATC CACATCAGGG AGAAGTGA
 
Protein sequence
MRQSSVLGCA RPAGLAALLT LLLAAGGDGA AAATMNDYCI QPPFVSQSVP PLVMFEVGRE 
HKLYYEAYND ANDLDDDGRL DTTYKHSIDY YGYFDPYKCY THSGGSGSND KYTPVSTTAD
KFCSSGQWSG NILNWLTMSR MDVLKKVLFG GQRSADSNTA TYLERVYVPQ DAHSWGKEVT
GRLCSNGTNY TDMCQFDSDC DTGYTCVDKS VNLIGITASD TGTACSFTSS IKWDTTGKIL
VAKYTHSNFS CGSDSTDLIS SYEPANLVAG FPVYVATFGD AILNPAADHG DQFNYLALAE
FSVSKSDKGN WMFAIDGDDG VELEIINPAG DASTIVASRY GCNSACNCQT NSGTINLNTT
GYWRLIARHS EKSGQDGVKV WYKKPSKTQS SDPWVLFGSS TLTLRAPTIP AGAECTLKDR
SFIETGKPKV GTTPKQHLFC STTLSDGGTP ILRFLGNKEN RIWEWVSKER PVCDSSLGAP
TDYTVQVEVC KSVSPDNRPT GKKDDLASGR ETNCKDYAGT FKPVGLLQKF GEGEGAKVCS
RTLAKSCTSD SDCGAGEGLC IYKSPMYFGM FTDSYTKNLS GGVLRKNIGS ILDETNANNG
IFQTSENVQG NIMITLDRLK TIGFRYTDQS YQDASGGSCG WITDRPLNEG ECRMWGNPIA
EMMYESLRYF AGKGAPTTEF TYTTPADSGL SLSKPSWGYS KGSTTYQLYD IYPPCAKPFM
LILSDINPSY DSDQIPGSSF KKTDGTYFSE DAASPQLGLG VAGADGVSLL NKLADTIGTS
EGIIGDSWFV GENGSTTDFV CSSKSVTKLS LLRGMCPEEP TKRGSFYSAA LAYYGLTLMK
EKTGKPDVST FVVALSSPVA DLKIKAGNSH VSILPVGKSV SGSHGINASC AQKCTLTADE
DGLHISNCSS TAYCPSNQIV DFYIDSLKYD NDKNVIYAKY RINYEDVEQG ADHDMDAIVT
YEVCTQSAID QGLGACSGSL GSNIQIKLNS DYAAGSIDQV MGFVISGTTE DGVYLPVRDR
DVSSADSDTP ATVAGLPLNW SKTFTISGNP TGTLKSPLWY AAKWGGFIDA NNNKKPDLAS
EWDKDGDGEP DNYFLVVNPL KLEQQLQKAL TDILNRVSSG TAASILSNND NNGATLLQAI
FYPRKNFAET ELAWTGELQA FWYYIDPFLN TNSIREDTDQ DLRLKLKTDY VLDFRFDTND
NKTKIDRSLD VDGNGSGDSY VNTIEPEQVN ALWKAGSLLW SRNLSTSPRT IYTSYRDAAS
KDQLTVFTTA GKDLFKANLQ AADATEEDKI INYIRGTEQS GYRNRTVTIG GSTGVWRLGD
IISSTPRLQS NARLNGYHLP PPVGYKDSSY QRYLDSNEYK TRGMGYVGAN DGMLHAFNLG
VLKAGTTKDV TSFITGSDFG KEMWTYIPRN ALPYLKYLAD PEYDHLYYVD ASPSLNDVSI
EVTEGTGCTD AAYWLCTKQT VYQAGTDSTT KELDLDKTSW RTVLLGAMGL GGASRNTTDA
CSASTDCVKT PIANVGYSSY FALDVTTPTS PSLMWEFASA DLGYSTVGPA IVRIGGETNG
RWFAVLASGP TGPINTQTHQ FLGRSTQTLK LFILDLKTGA LLRTIDTGIQ NAFAGSLSGG
TLDTDRSAGT TGKYNDDAVY LGYVRKDTTT GTWTKGGVLR LFTKENIDPA QWWWATLVDD
IGPVTSAVAQ LQDTTHKNHW LFFGSGRYYY KAGSDLDDAA GRRALYGIKD PCYDLNNKMK
TTCNTPTVLA TDLVNQTDSI QGMGTAPGWY VLLDEASGSA GAERVITDPV AAPNGAIFFT
SFKPAADVCK FGGDLALWGV NYSTGGYLAP SQLIGEAIIQ SSTGSFEQID LGSSFTQRLN
RKTAERQGVP PRNKPTIVTN ANIKPQKRII HIREK