Gene GSU3023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3023 
Symbol 
ID2686804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3320350 
End bp3327555 
Gene Length7206 bp 
Protein Length2401 aa 
Translation table11 
GC content60% 
IMG OID637127716 
Productglycosyl transferase, group 1/2 family protein 
Protein accessionNP_954065 
Protein GI39998114 
COG category[M] Cell wall/membrane/envelope biogenesis
[N] Cell motility
[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0438] Glycosyltransferase
[COG1216] Predicted glycosyltransferases
[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCATAA CGCCAATGAA TACCACGCCC GTCATTCTTG TTTGCTACAA TAGGCCGCAC 
CACACGGCGG AGATGCTCAA GGCCCTTGAG GTCCACAATA TCCAGAACCT GATCATTTTC
GCCGATGCAC CCAAATCGGA CAAGGACGTG GAGGGAGTTC GCGCAACAAG GAAACTGCTC
GAAGGCATCC GGTGGACCCA CCCCGAAATC GTATTCCAGA CAGAAAACCA AGGGCTCGCA
AAATCAATCG TTTCGTCTGC GAATTATGCT TTTTCACTAC ATGACCGGCT GGTCCTGCTT
GAAGATGATT GCGTACCCCA ACGCCACTTC TTCGACTTCA TGTCAAACTG CCTGGACAGG
TACGAAGAAA ATGAGAAGAT TTTCGGCATC AGCGGATACA CGGTTCCGAT CCCGTCCAGA
CTGCAGGAGC AACACCCCTA TGACCTCTAT TTCTATCCTC GCATTGGCAG TTGGGGATGG
GGAACGTGGA AACGGGCATG GCAACATTAC GACACCGATC TCGTAAAGCT TTGCCGGAAA
GCTCTGGAAT CAAACATCGA CTTAACGCAA GGAGGTGTTG ATATCCCGGT CAACATTGAG
GGACTTCTTC GCGGGACATT GAAGGATGTG TGGACCCTGA ACTGGGTTCT CACCGTATAT
CTCAACAAGG GCTATTACAT CTACCCCACC AAGTCCCACA TCAACAACAT AGGATTCGAT
GGCACCGGTG TCCACTGCGG AAAATCCGAC CTTTTCCAAA CCATTCTTGC CGATTCACCC
GCAATTCGGT TCCCGTCGGA TGTGGTCCTG AACTATGACC TTATCAGTCA TTACAATATG
TATTTCGGAG GTCCTGTTGT AACTCCGCCC CCAAGGGAAC CTGCCGGCTT GAACGCCGTT
CCCGAGGCTG CATCCAGAAA GGCTCCGCTG AACGTTGCTC TCCTTTCCAC CATGGACTTC
GGTGGTGCCG GCAAAGCAAC CCATCGTCTC CTCAGGGGTC TTCAGGCGTA CGGCAGCGAC
GCACTGATGG CTGTGCTATG TAAAAGCACA GAAGATCCAT CCATAAAGCT GTTGAGCAAT
TCTGCCGGAG GTTTGACGAC TGTCTCAGCC GAAGGTGCAG GACGATGGGA AGAACTCTTC
CGGAAGTGGC GCGGACAGTT GGCAGGCTAT ACAAACCGGC CGGAAGGACT TGAAATTTTT
ACCGACTCCC GTTCCCGGTT CTCCCTGGAG GATATCCCCG AACTCCAGAG AGCCGACATA
CTCAACTTTC ACTGGATGGC CGGACTGCTC AACTATCCGA CGTCTTCCTC GGCCCTGAAG
GGAAAAAAAA TCGTCTGGAC CCTCCACGAC ATGAATCCCT TCACCGGCGG CTGCCACTAT
GCCGGCGAGT GCACCGGATA TCTCCGCTCA TGCGGTACGT GTCCCCAACT CGGTTCCAGT
GACAAAGAAG ATCTTTCACG AAAAATTTGG GAAGACAAAC GGGCCGCGTA CGCCGACCTG
GACCTGACTA TCGTAACCCC GAGTCGCTGG CTGGCTGAAT GCGCCCGCAA CAGCTCGCTC
CTTTCCCGCT TCCCGGTCCA CGTCATACCC AACGGACTCC CCACGGACAT TTTCCGCCCA
CATCCGAAGG ACGAGCTCCG CCGGTCGTTC AACATCCCCG AACACGCCCG GGTGATATTG
TTCGGTGCCG ATTATGATAC CCGTCGCAAA GGTTTCCACT ATCTTGTAGA TGCCCTGAGA
GCTCTTCCCG ACAAGCGCAA CCTGGTGCTC GCGTCATTCG GGCCTCTCCC GGAGACCAAG
TTCAGCAGCG AATTCCTCAC GATGAACTTC GGTTCGATCT CCAATGAGAC TCGCCTAGCC
CAAATCTACA GCCTTGCCGA TCTCTTTGTC CTACCGTCAA TGGAGGACAA CCTCCCGAAC
ACAGTCATTG AGTCGATGGC CTGCGGAATA CCGGTGGTCG GCTTCAAGAT CGGGGGCATG
CCCGACATGA TCGAACACAA GGTGAACGGC TACCTGGCTC AACCAGGGGA TGTCACAGGG
CTCACTGAGG GAATCCGTTG GTGCCTCGCA AATGCATCAG CACTCAAACT TGGCGAACGG
TGCAGGGAGA AGGTTGAGCT GGAGTATTCC CAGCGGGTTC AGGCTGAGAG CTATACCAAC
CTCTATGAGA ACATACTCCT AGGGAAAAGT GCCGTCAAAT CTTTGGCAGC ACCTGCCTGT
TCGGCGGACT CGATCCTCAT CGCCGCCAAC CTGGTCCCCT TCCGGGACGC GGGGCAGCGG
CAACGGCAGG ATACCGGCAT AGCTAGCATC ACCGCCCTTG TAGCGAAGGG CATCATTCCC
CTCAACATCT GCTACCCGGA CGAACTCCTA GAGCCGGCCG ACTGGCAGAC AGCAACAATG
CTCGAGCGCA GCGCGAACGT AGAACTGAAG ATCGACGGCA AGCGCAAGCC GTTCGTCATC
GACCTCTTCG ATATCGCCGC CCAGTGGGCA ACAGCCCACG GAATTACGTG GTTCGCCATC
ACCAACAGCG ATATCGTCCT GACAGACGCG CTCATCGCCG AGTTGCGGCG CCTTCAAGCC
GACGGCATCG AAACGGTTGC CATCTCACGC AACGAGGTGG AACGGGTGGA GGGGGACGGC
AGGCTCGTCC CTGGCTACCT GGAGGTGAAC GGCTACGATA TCTTCCTCTG CAGGAGCTCC
TGGTGGCAGT CGAACCGTCA CCGCTTCCAG CCATATATCT ACGGCGAACG GGCCTGGGAC
GACGCCTACG CGGCCATCAT GGCATGCCAC TCTCGCTTCG CCATGCTCTA CCAGGACGGC
CTCTGCTTCC ACTTCAAGCA CCCGACCAGC TGGATCTCCG GCCCATACTC CGACTACAAC
ATGGGGCTCT ATACCGGCAT CGACAAACCT TACAGCGACC GATACGAGGC GTTCATTAAG
GAGGTGCTTG CCCTGACCAA GGCACAGCTC ACCCCAGCGA AGACCGCCGA ACTGGTGGCG
AAGCATTTTT CACCGCCCCC CCCCGTGCCG GTGAACTCCT CTCAGGGCTT CGTCAACATC
GGCATGATCA CCTACAACCG CCTTGATTTC ACGAAACTCT GCCTGGAGGC GTTCGAGCGG
ACCGTCGACT ATCCTCACCG GCTTACCGTC ATCGACAACA ACAGCCAGGA CGGGACCGTG
GAGTTCCTGC GGAAGCTGCA AGCCCAGGGC GTCATCCACA ACCTGATCCT CTTGAACGAA
AACGTTGGGG TGGCCAAGGC GTCGAACCTC GCCTGGGCAA TGGAGCCCGA TGCCCCGTAC
TACATGAAGC TCGACAATGA CATCGTCTTC CAGAAAATGG GGTGGCTCTC CCGACTGGTG
GAGGTGATTG AAAGGGTGCC GCAGATCGGG GCGGCGGGTT ACAACTTCGA GCCCGTCAGC
TATCCCCTCT ACGAGCTGAA CGGCTGCCAG GTCAGGATCA AGGAACCGGG GAATCTGGGG
GGGGCATGCA TCCTGATCCC GAAGCGGACC GAACGGCTCC TCGGCAACTG GTGCGAGGAT
TATGGCCTTT ACGGTGAGGA GGACGCCGAT TATGGCTTCC GGATCCGCTG CGCCGGTCTT
CTCAACGCCT ACATGGAGGA CGAGGAGATT GGCTTCCACC TTCCTGCGGG CAAAGCGGCG
ACCATCGACA GTGCAACCCT GGTGGCCCTT GACGGGCAGG AGGAGGACCT CCACGCAGAC
TACCGCAAGT GGAAGGACGA ACTGCGTCGC AAAAACGTAC ACGGTCCTTT CAAGCGAAAC
TTGGAGCGTT ATGCTCACGA CCCGACTTCA CTCTTCCAGC AATCGCGCTT TGCCACAGAG
TGGTTGCGGA CTCACCGACC GGACATTGAC GTTTCGCCAC TGAAAACAAC GGGGGGCAAG
CTCACCATCA CCCTGCTTTC CCTCGACCTT CCCTCCCATG CCTGCATGCA GCTCAGGATC
ACCGGCCCCG CAAGCGCCTT CTCTGATGAG GTGGAGCTGC TTCAAGCCGT TACCAATGAC
GGGACAAAGT ATCTCATCAA CTCCGACTCC ATAGACCGGG CCGACCTGAT CATCGTCCAG
CGGTTCTTCC CTCGGCCAGA AACAGAGCGT CATCTGCAGA AGGCCCTGGC GTCGGGCAAA
CCGATCATCT ACGAGTTTGA CGACCTCCTG ACCGACCATT CTCCGGACAA TCCGCACCGG
GAATTGAGCA CCCTCTGTGC TCCTTTCGTT TCCGCACTTC TTGCCAAGGC AGACGGGGTA
ACGGTATCCA CCGACCTTCT CGCCAGTGCT CTTCTCCCAA GAAAGGGAAC AGTCCATGTT
CTGCCGAACC TCCTTGACGA GAAGCTCTGG GCCGCTCCGC CGGCGTCACG CCCGACCGGC
GCTCCGGTAA TTATTGGCTA TGCCGGTACA CCAGGGCATG AGGCGGACCT GGCGCCGATC
GAGGAGGCGC TGGAGCGCAT CGCCCGAATG TACGGACACC GGGTAGCGTT CCGCTTTTTC
GGCTGCGCCA CCGAGCGTAT CAGGAAACTT CCTGGCTATA CCTTCATACC CTTCACAGGC
AATTACTCTG AATACGCAGC CACCTTGCAA AATTCCGGCA TCGACATCGG CCTCGTCCCC
CTGGAGGACA ACCGCTTCAA CCGCTGCAAG AGCAACATCA AGTGGCTCGA ATACTCGGCC
TGCGGCATAG CCGGCATCTA CGCCGACCTC CCCCCCTACC GCTCGTGCGT GAAGGAAGGG
GAAACGGGGC TCCTGATAGC TGGCTACGAC GTGGACGCCT GGGTGGCGGC CATCGAAAGC
CTCATCGACA ACCCGGCCCG CCGCCATGCC ATGGCCCTGG CGGCCCGCAC CGAGGTCCTC
GCCAACTACA CCCTCAAGAG CCGCGGCCAC CTTTTCCTCG ACACTTGGCG CCGGATCGCC
GGCCGTGCCG ATACCACAGC CAAGGAGCAG CAGATGCCCA TCTCACCGCA ACCGTTCGCG
CCGGTCGCCG CAGCCACTGG CTCAGACGCC CCGAAGGTAT CCATCATCGT CCCCCTCTAC
AACAAGGCGG AGTACACCAA GCAGTGCCTG GAGGCCCTGG CCCTCAATAC GGAGCAGGCC
CTGAACTACG AGGTCATCCT CGTGGACAAC GCTTCGAGCG ACGGCACCGC CGAGTACCTG
CGCACCCTTT CGGGGGACGT GACCATCGTG ACCAACCTGA AGAACCTGGG CTTTGCCAAG
GCGTGCAACC AAGGGGGGCG GATCGCCCGG GGGCGGTACC TGGTTTTCCT GAACAACGAC
ACCATCCCCC ATCCGGGGTG GCTCGACGGG CTCATCAAGG GCGCGGAGCA GGACGGCGCC
GACATCGTGG GGGCCAGGCT CCTCTACCCC AACGGCCGGG TCCAGCACGC CGGGGTGGCC
TTCAACGAGC AGTCCATCGG CTACCACATC TTCAACGGCT TCCCGGCAGA CTCGCCGGCC
GTCAACCGCA AGCGGTTCAT GCAGTGCGTG ACCGCCGCCT GCATGCTGGT GAAACAGGAG
CTCTTCGCGG AGCTCGGCGG CTTTGACGAG GGGTACGTGA ACGGCTTCGA GGATGTGGAT
TTCTGCCTCC GGGCCGGGGA GCGGGGCCGC CGCATCCTCT ACACCCCCGA AAGCGTTTTG
ATCCACTTCG AGGAGACCAG CGAGGGTCGC AAGGACCACG ACACCCCCAA CATCCGCCGC
TTCCTGGCCC GCTGGGAAGG GAAGGTCCGC TGCGATCATC AGGATATCTA CCGTTCCGAG
GGGTACCGGG CCGAACGGCA GGCCGACGGC AGGCTGCGCA TCTACCAGGC AGACGTGGCG
CCCGTGTCGT CAGCTCCGAC GGCTCCGCAG CAGGTCACGC CGACACCGGG CACCGGGGCT
GCGGCGGCAA CGCCGTCCGT TTCGGGGCGG GAAAAGGCCC TTGCCCTGAA GGCGGAAGGA
CGGTACGTGG AGGCCATCGA GCATCTGGTC AAAATTGTGA CAGCGGGTGA CAACTCCGTG
CTCGTCGATC TCGGCGACTG CCTGGCGAGC CTGGAGAAAT ACGACGACGC CCTGGCCCTT
TACGAGGAAA GCCTTGCCCT GTGCCCCACC AACGGGCGGG CGCTGGTGGG GGTCGGTGTT
GTCAGATATA TGACACGACG GATCGCCGAG GCGGCCGACG CCTTCAGCCG GGCACTGGAA
ACCGACCCTG CCGACCCGAA GGCCCTTTGC GGCCTGGGCA TGGCCCGCTG CGCCCAGGGA
CGGAACGCGG AAGGGTTCGA GCTCTACGGC CGGGCGCTTG AGGCCGAGCC GGAGAACCTG
ACCGCAGTGC ACGAATCGGT GAGGCTTGCC TATGAGCTGG GACGCTTCAG CGAGGCGGCC
AAGCGACTTG AGTCATACCT GCGCCATCAT CCGGGCGACA TCGACATCCT CTTTGCCAGT
GCCGGACTCC TTCACATGGC CGGCAGGAAC GCCGAGGCCC GTGACGCCCT GGAGCGGCTG
CTGGTGTTCT CCCCCGATTA CAGCGGGGCC ATGGAGTTGC TGGCGAAGCT GGAGGAGCAG
GACCAAGAGC CGGGTGAAAG AGCCACGGAA GCTGAAGCCC GCAGGCTCAA GGAAGACGGG
AAGTACGAGG AGGCCCTGAC GGCCTTCTCC CGGGTCGCAG AGGCCGGCGA TTCATCGGCC
CTGGCCGACA TGGGGGACTG CCTTGCCCAG CTGGGACGGC TCGACGAGGC GGCCGCCCGT
TACCTGGAAG CCCTGGATGC CGACGGAGCA AACCTCAAAG CCCTGGTGGG GCTCGGAGTG
GTATCGCTGG TCCAGGGGAA ACAGGTGAAG GCGGTCACTT GGTTCAACAG GGCCCTCAAG
GCGGACCCCG CCAACGCAAA GGCTCTCTGC GGGCTCGGGA TGGTCCGGAA CATGCAAAAC
AAGCATGACG AGGCGTTCAG CCTCCTTGCC CGGGCCGTTG ATGCGGACCC CGAGGGCCTC
ACGGCCCTTC ACGAACTGAT CCGGCTCTCC TATGCCACCG GCCGGTTCGA TGAAGCGGGA
GAACGGCTCG ACCGGTACCT GATGCACCAC CCCGCAGACC TGGACATGGT CTTCGCCCAG
GCAGGCATCC GCTTCAAGGC GGGCCGCTAT GCCGAGGCCC TGTCGAGCAT CGAGACGGTG
CTCCTCTTTG CCTCCGACTA CGAAGGGGGG CTGGAATTGC GGGAAGCGAT CACCCAAGCC
ATGTAG
 
Protein sequence
MSITPMNTTP VILVCYNRPH HTAEMLKALE VHNIQNLIIF ADAPKSDKDV EGVRATRKLL 
EGIRWTHPEI VFQTENQGLA KSIVSSANYA FSLHDRLVLL EDDCVPQRHF FDFMSNCLDR
YEENEKIFGI SGYTVPIPSR LQEQHPYDLY FYPRIGSWGW GTWKRAWQHY DTDLVKLCRK
ALESNIDLTQ GGVDIPVNIE GLLRGTLKDV WTLNWVLTVY LNKGYYIYPT KSHINNIGFD
GTGVHCGKSD LFQTILADSP AIRFPSDVVL NYDLISHYNM YFGGPVVTPP PREPAGLNAV
PEAASRKAPL NVALLSTMDF GGAGKATHRL LRGLQAYGSD ALMAVLCKST EDPSIKLLSN
SAGGLTTVSA EGAGRWEELF RKWRGQLAGY TNRPEGLEIF TDSRSRFSLE DIPELQRADI
LNFHWMAGLL NYPTSSSALK GKKIVWTLHD MNPFTGGCHY AGECTGYLRS CGTCPQLGSS
DKEDLSRKIW EDKRAAYADL DLTIVTPSRW LAECARNSSL LSRFPVHVIP NGLPTDIFRP
HPKDELRRSF NIPEHARVIL FGADYDTRRK GFHYLVDALR ALPDKRNLVL ASFGPLPETK
FSSEFLTMNF GSISNETRLA QIYSLADLFV LPSMEDNLPN TVIESMACGI PVVGFKIGGM
PDMIEHKVNG YLAQPGDVTG LTEGIRWCLA NASALKLGER CREKVELEYS QRVQAESYTN
LYENILLGKS AVKSLAAPAC SADSILIAAN LVPFRDAGQR QRQDTGIASI TALVAKGIIP
LNICYPDELL EPADWQTATM LERSANVELK IDGKRKPFVI DLFDIAAQWA TAHGITWFAI
TNSDIVLTDA LIAELRRLQA DGIETVAISR NEVERVEGDG RLVPGYLEVN GYDIFLCRSS
WWQSNRHRFQ PYIYGERAWD DAYAAIMACH SRFAMLYQDG LCFHFKHPTS WISGPYSDYN
MGLYTGIDKP YSDRYEAFIK EVLALTKAQL TPAKTAELVA KHFSPPPPVP VNSSQGFVNI
GMITYNRLDF TKLCLEAFER TVDYPHRLTV IDNNSQDGTV EFLRKLQAQG VIHNLILLNE
NVGVAKASNL AWAMEPDAPY YMKLDNDIVF QKMGWLSRLV EVIERVPQIG AAGYNFEPVS
YPLYELNGCQ VRIKEPGNLG GACILIPKRT ERLLGNWCED YGLYGEEDAD YGFRIRCAGL
LNAYMEDEEI GFHLPAGKAA TIDSATLVAL DGQEEDLHAD YRKWKDELRR KNVHGPFKRN
LERYAHDPTS LFQQSRFATE WLRTHRPDID VSPLKTTGGK LTITLLSLDL PSHACMQLRI
TGPASAFSDE VELLQAVTND GTKYLINSDS IDRADLIIVQ RFFPRPETER HLQKALASGK
PIIYEFDDLL TDHSPDNPHR ELSTLCAPFV SALLAKADGV TVSTDLLASA LLPRKGTVHV
LPNLLDEKLW AAPPASRPTG APVIIGYAGT PGHEADLAPI EEALERIARM YGHRVAFRFF
GCATERIRKL PGYTFIPFTG NYSEYAATLQ NSGIDIGLVP LEDNRFNRCK SNIKWLEYSA
CGIAGIYADL PPYRSCVKEG ETGLLIAGYD VDAWVAAIES LIDNPARRHA MALAARTEVL
ANYTLKSRGH LFLDTWRRIA GRADTTAKEQ QMPISPQPFA PVAAATGSDA PKVSIIVPLY
NKAEYTKQCL EALALNTEQA LNYEVILVDN ASSDGTAEYL RTLSGDVTIV TNLKNLGFAK
ACNQGGRIAR GRYLVFLNND TIPHPGWLDG LIKGAEQDGA DIVGARLLYP NGRVQHAGVA
FNEQSIGYHI FNGFPADSPA VNRKRFMQCV TAACMLVKQE LFAELGGFDE GYVNGFEDVD
FCLRAGERGR RILYTPESVL IHFEETSEGR KDHDTPNIRR FLARWEGKVR CDHQDIYRSE
GYRAERQADG RLRIYQADVA PVSSAPTAPQ QVTPTPGTGA AAATPSVSGR EKALALKAEG
RYVEAIEHLV KIVTAGDNSV LVDLGDCLAS LEKYDDALAL YEESLALCPT NGRALVGVGV
VRYMTRRIAE AADAFSRALE TDPADPKALC GLGMARCAQG RNAEGFELYG RALEAEPENL
TAVHESVRLA YELGRFSEAA KRLESYLRHH PGDIDILFAS AGLLHMAGRN AEARDALERL
LVFSPDYSGA MELLAKLEEQ DQEPGERATE AEARRLKEDG KYEEALTAFS RVAEAGDSSA
LADMGDCLAQ LGRLDEAAAR YLEALDADGA NLKALVGLGV VSLVQGKQVK AVTWFNRALK
ADPANAKALC GLGMVRNMQN KHDEAFSLLA RAVDADPEGL TALHELIRLS YATGRFDEAG
ERLDRYLMHH PADLDMVFAQ AGIRFKAGRY AEALSSIETV LLFASDYEGG LELREAITQA
M