Gene GSU0987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0987 
Symbol 
ID2687508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1060383 
End bp1063742 
Gene Length3360 bp 
Protein Length1119 aa 
Translation table11 
GC content69% 
IMG OID637125657 
Producthypothetical protein 
Protein accessionNP_952041 
Protein GI39996090 
COG category 
COG ID 
TIGRFAM ID[TIGR02243] conserved hypothetical protein, phage tail-like region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.365158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCCG ATCTCCCCTT GATAGACGGC CGTACGGCCG CGGATGTGGT CGAGAAGATT 
CGCGCCACCG CGCCATTCTA TGTGCCCGAG TGGAGCGGCG GTGTCGACGG CGATGCGGGA
AGTGCCCTGA CCGGCGTCTT CGGCGACATG GTGGTGGAGG TGCTCCAGCG GCTCAACCGT
GTGCCCGAGC GCCATCTGGC CGCGTTTCTG GAAGTGCTGG GGTTGCGGCT CCTGCCGCCC
CGGCCGGCCG AAACCGTGGT GGTCTTCACC CTGGCCAAGG ATGCGGACCG AAGCGCCCTC
GTCGCGTCCG GCACCCAGGT CATGGCGGAA AAGACTACGG ACCACCCGGA ACTCCTCTTC
GAAACCGACG AGAACGTGCT GGCCGTCCCG AGCAGGATCA CGGCCCTCTA CAGCACCATC
CCCGATACGG CGGGCGGGAG GGAGAAGGTC TTCGGCCACA CGGAGGCGTG GACCGCGGGA
AGCTCCTTTA CGATCTTCCA CGGCACCGAG AACCTCCAGG AGCATGCCCT CTATCTGCGG
CAGACCGGGC TCTTCACCGT GCGGAGCGGC GTGGAGATCC ACCTGTCCGG CGTGCCGCGG
GCGCTGGCCG ACCAGGTTGC CTGGAGCTAC ACCGATGCCG ACGGCCGGGA GACGCTGTTC
GGGGCGAGAT TCGATGGTGC CGGCGGCACG CTCATTCTCT CCACCGGGGC AGGTCGGCCG
GAACTGGGCC GGTGCGTGGT GAACGGCATC GACGGCATTT GGCTCAAGGG GGTGCCGAAG
GTAGGTGCCG ACGGCACGAC TGCCCTGGCG CGGGTCAAGG ACGCCGTGCT CCGCACCGTG
GTGGGGGTCG GCACCAGGTC GCTTCCCACC GCGGTGATTC ACCCTGACGT GGCTTTTGCC
GGCGATGTTC CCCAGGACCT CACCGTGACT GCCGGTGGCG ATTTCATCCG GAACCTCCTC
CCTTTCGGCG ACAAGCCCAT GCCACTGGCC GCCTTCCACC TGGCAAGCCG GGAGGTCTTC
TCCAAGAGGG GGGCCAGGAT TACCCTCCGC ATTACCTGCG CCACGGACCT TGACGTGCCC
ATCGAGCGGG TCCAGGGGAT CGGCGCCGTC TTCTCGGCCC GGCTCAGGGG AGCGGGCATT
GCCACGGCCG GTGAACTCCT GTCCCGGTCC GACGCCCAGG TTGCCGGGAT AATCCGGGCT
CCCGGCAGCG CTCGCCCGGC TTCCTCCTAC CTGCTCCGGG CCAGGAATAT CCGCGAAGCC
ACGGCCAAGG CCTATTACGA CAAGACCGGA GCGGTCAGCT ACCGGTCAGG CCGTGCTGCC
GCGCTGGGGC CGGGCCTGTC ATGGGAGTAC TGGAACGGGA CCGGCTGGTG TGCCATCAGC
GACGTGACCG ACAGCACCGG CGGCCTCCTG GCCACCGGCG AGGTGACCTT CACCTGCCCC
GCCGACATGG CGGAGGTGGA GGTGAGCGGC CAGCGGAACT GGTGGATCCG GGTCCGGCTC
TCTTCCGGCG ATTATGGCCG GGAGTTCGAG ATCGTAAACG GCCAGGCCGT GGCGGCCGGC
TTCACCCCGC CGCGCCTGGG GCGCATCACC CTCTCCTGGG ACGGCACCGA TCCCTCGGCC
ATGGGTGAGC CCGATTCTAT CCTGACGCGC AATAACCTGG AGTGGGACGA CGGTCTTGCC
GTGCTGCGCA CGGGCGCGGT CTTCCGGCCG TTCCGCCCAC TGGCCGACCT GCGCCCCGCC
TTCTACGCCG GCTTCGACCG CCCCCTGGCC AAGGGGCCCA TCGGCCTCTA CCTGGATCTG
GCCGAAATCG ATTATGCCCG CGACTTCAGG CCCAGGGTGC AGTGGGAAGT CTATGACGCC
GGAGCGTGCG AGTGGTTGAG GCTCGATGCG GAGGACGGCA CAGCCGGCCT CACCCGTGCC
GGCATCGTCC GTGTCGTGGT GCCCGAGGGG GGCGTTCCGG CACCGCTCTT CGGCACGCGG
CTTCACTGGC TCCGGGCGGT GCTCGTGGCA GGGGAGTACG CCCCCACGGC CCTCGGCGCC
GGTCTTCCCT ACCGGCTGCG CCGCGGCTAC CTGAACATCC CCCGCATCAT CGGCAGACCG
GCCCACCTGA AGGGGAGCAT CCTCAATCCG GCCCGGTGGC GGTCGTGGCG GCAGCTTCCC
TCCAACAGCC CCCCCATTGC CCGGGGCATC TACCTCAATG CCGCCCGGGC CCTGCAGCTC
ACCTCCGTGA GGAACGAGCG GCCGGGATCG GGGAACGGGC TGCCGGACCA GAGTTTCACC
CTTGCCCGGA AGCCGGTTTT CGACGACCAG GTCTGGGTCA ACGAGTTCGG GCTCATTTCC
GCCGCAGAGC TGGATCGCCT CGCCGCAGAC GCTCCCGGTC GGGTCAGCCG GGTGACTGAC
CGGGAGGGAC GGGTGAGCGA AGTCTGGGTG CGGTGGGAAG GGCGCGACGA CCTCCTGGCC
TCGGGGCCGC TAGACCGGCA CTACGGCATC GACCGGAGCA GTGGTCTCGT CTCCTTTGGC
AACGGGAAAA GGGGCCGGGT GCTGCCTGCC GGAACGAACA ACGTCCGGGT CAGCTACCGC
ACCGGCGGAG GTGCTGCCGG CAACCTCGGC TGGGGGGCGA TTGCCCGCAT GCGGACCGGC
ATCCCCTTGG TGGACAAGGT CGTCAATCCC GGCCCCTGCG GAGGCGGGGT CGAGGGGGAG
GATATCTCCG CCCTCTACCG GCGGGGCCCC CAGAGCCTGC GGCGCCGCGA CCGGGCCGTG
ACCGTTGAGG ACTACGAGGG CCTGATCAGG GAGCAGTTCC CCGGTATGGC CCTGGTCAAG
TGCCTCCCCG TGTGCGATGA CCGGGGCATG ACCCGCACGG GATGGCTGAC GGTGATCATC
GTGCCGCGCG CGGCCGATGA CCGGCCCATC CCCTCGGCGG CCCTGCGCCG GCGGGTGGAG
GAGTTTGTCG CCGGTCACGG CGCCAACGTA GTGACCGCCC CCTGCCATGC CGTGGTGACC
CGGCCGTCCT ACGTGCGGGT GTCGGTTGAT GCCGCCCTGG TGCCCCGCAG CCTCGACCAG
GCCCCCGCCG TGGAGACGGC GGCCCTTGCC GCTCTCGGGC GGTTCCTCCA TCCCCTGACC
GGCGGGTGGG ACGGCGGAGG GTGGCCCTTT GGCCGCCTGG TCTGCCATTC GGACCTCTAC
CGCTTGCTGG AGGAGATCGA AGGGGTTGAT CGGGTGGCAA GCATGGCGGT AACGGCCGTC
GATGAAATGG GACGTCGGAT GGAGCTTGGC GAGGCGGATG AGTTGACGCG GCCGGTGGAC
CCCTATCTGC TCGTTTCCAG CGGAGACCAT CGGGTCGCGG CACGGGCCGG CGACGTATAG
 
Protein sequence
MNADLPLIDG RTAADVVEKI RATAPFYVPE WSGGVDGDAG SALTGVFGDM VVEVLQRLNR 
VPERHLAAFL EVLGLRLLPP RPAETVVVFT LAKDADRSAL VASGTQVMAE KTTDHPELLF
ETDENVLAVP SRITALYSTI PDTAGGREKV FGHTEAWTAG SSFTIFHGTE NLQEHALYLR
QTGLFTVRSG VEIHLSGVPR ALADQVAWSY TDADGRETLF GARFDGAGGT LILSTGAGRP
ELGRCVVNGI DGIWLKGVPK VGADGTTALA RVKDAVLRTV VGVGTRSLPT AVIHPDVAFA
GDVPQDLTVT AGGDFIRNLL PFGDKPMPLA AFHLASREVF SKRGARITLR ITCATDLDVP
IERVQGIGAV FSARLRGAGI ATAGELLSRS DAQVAGIIRA PGSARPASSY LLRARNIREA
TAKAYYDKTG AVSYRSGRAA ALGPGLSWEY WNGTGWCAIS DVTDSTGGLL ATGEVTFTCP
ADMAEVEVSG QRNWWIRVRL SSGDYGREFE IVNGQAVAAG FTPPRLGRIT LSWDGTDPSA
MGEPDSILTR NNLEWDDGLA VLRTGAVFRP FRPLADLRPA FYAGFDRPLA KGPIGLYLDL
AEIDYARDFR PRVQWEVYDA GACEWLRLDA EDGTAGLTRA GIVRVVVPEG GVPAPLFGTR
LHWLRAVLVA GEYAPTALGA GLPYRLRRGY LNIPRIIGRP AHLKGSILNP ARWRSWRQLP
SNSPPIARGI YLNAARALQL TSVRNERPGS GNGLPDQSFT LARKPVFDDQ VWVNEFGLIS
AAELDRLAAD APGRVSRVTD REGRVSEVWV RWEGRDDLLA SGPLDRHYGI DRSSGLVSFG
NGKRGRVLPA GTNNVRVSYR TGGGAAGNLG WGAIARMRTG IPLVDKVVNP GPCGGGVEGE
DISALYRRGP QSLRRRDRAV TVEDYEGLIR EQFPGMALVK CLPVCDDRGM TRTGWLTVII
VPRAADDRPI PSAALRRRVE EFVAGHGANV VTAPCHAVVT RPSYVRVSVD AALVPRSLDQ
APAVETAALA ALGRFLHPLT GGWDGGGWPF GRLVCHSDLY RLLEEIEGVD RVASMAVTAV
DEMGRRMELG EADELTRPVD PYLLVSSGDH RVAARAGDV