Gene GSU0988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0988 
Symbol 
ID2685732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1063766 
End bp1065799 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content68% 
IMG OID637125658 
Producthypothetical protein 
Protein accessionNP_952042 
Protein GI39996091 
COG category 
COG ID 
TIGRFAM ID[TIGR02243] conserved hypothetical protein, phage tail-like region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATAG TACCGATACC GGATCTTGAC GACAAACGAT ACGCCGACCT GGTGCGGGAG 
GCAGTGGCGA GCCTTGCGGT GCGCGCACCG GCGTGGACCG ACCACAACGC ATCGGACCCG
GGGATCACAT TGGTGGAGCT CTTCGCCTGG CTGGCGGAGA TGCAGATCTA CGGGCTCAAC
TGCCTTACCC CCGACCACTA CCGGACCTTC CTCCGTCTCG TGGGTATCCG GCCGGTTCCG
GCAGTGCCCG CCACCGTGGC CCTGACCCTC TCATCCACCG CCGGACGGGA GTTGATGATC
CCGCGAGGTA CGCGCCTCAC GGCCGAGACG GGAGGGGAAC CCCTTCCCTT CGAGACGACG
GCCGACCTGT TCCTGCTGAA CAACCGGATC GCTGCCGTCA TCAGCGCCTG GGGCGGCGGC
TTCCGCAATG TCACCGATGC CAATGCCCGC GATGCCTTCT TTTTCCCTGC CTTCGGCGAG
CAGCCGGCAC CGGGGGGCAC GCTCCTCATC GCCTTTGAGC GGATGCTTCC GGCAGGGCGG
GAGGTGCGCC TCGCCATCGA TCTCTACGAA GCAGACCTGC CCCCCGTGGG TGCCCACCGG
GGCGAACCGG CCGCCGTCGT ACCCCCGGTG GATCTGGTCT GGGAGTATTC CACGGAGGCC
GGTTACCGTC GGCTCGATCT GCTGGAGGAC GGCACCACCG GCCTGACCCG CTCCGGACAG
ATTCTCTTTT CAGTGCCTGC CGACATGACG GCGACCACCT CGCCTTACCT GCCGGCAACG
GTGCCTGCGC CGCGCTTTCC CATGATCCGC TGTCGCCTGC CGGAAGAGAG CACCGCCGTG
CCGCTGCCTT CCGAGGGTCT GTCGGCATGC CGCCGCGCCG CCGCCCGGCA GGGGACGGGC
TTCTATGAGA TACCTCCGCG CATCGACTCC ATCCGCCTCA ATACGGTGAC GGCGCGCCAA
GCGGTATCGG TGGTGGACGA GAACCTTGTC TCTTCATCCG GCACCGACTG GGGCAATGGC
ATGCCGGGAC AGGTAGTTTC CCTTGCCCGG CGGCCGGCCA TGGAGGGGAG CCTCAGGGTC
AGGGTCCTGA CCGGCACCGA TTGGGAAGAG TGGGCCGAAC GCGACAACCT GGACGCCTCG
GGGCCCACGG ACCGGCATTT CGTACTCGAC CCGGCGGTCG GTACCATCCT TTTCGGTGAT
GGCCGCAACG GGCGGGTTTT GCCGGAGGGG GCTCGGGTGC GGGCCGACTA CCTGTCAGGC
GGGGGCGCGG CGGGAAACCT GAGGCCCCTG GCATCGTGGC GCTTCGATGA TCCCCTGCTG
GCGGACCTTG CCGCGCGCAA TGACGCGTCC GCCAGCGGCG GCAGAGACGC CGAGCCGCTC
GAGGAGGCCA TTGCCCGCGC CCCCCTGGAA CTGCGGGAGG TGGACCGGGC CATAACCTCC
GATGATTTCG AGTATCTGGC GCTGAATACC CCCGGTCTCC GGGTGGCCCG CGCCCGGGCG
CTTCCCCTCT GGGAGCCGGA AGCGCCGGAG GATCGTCCTG TGCCGGCCAC GGTGACCGTG
GTCGTGGTCC CCTGGTCCTT TACCCCGCGG CCGTATCCCG GCCCACGGTT CCTGCGCGCG
GTGTGCGACC ACCTGGATCG CCACCGGCTG GTGACGACCC GCGTCAGGGT GATCCCTCCC
CTCTACGCGC AGGTGACGGT CAGAACACGG GTCAGCGCCG GTGAAGGGGT ACGTCCCGAG
GAGTTGCGAG CGCGGGTGGC CGAACGGCTT CTGGAGTTTC TTCATCCCCT GAAGGGGGGT
GAGGACGGGA CGGGGTGGCC TTTTGGCAGA GGGGTCTACC GTTCGGAGAT CATTGCAGCC
ATCAGGGAGG TGAGCGGTGT GGAATGCGTG CTGGAGACGA CGCTTTCGGG CGATACGTGC
ACGCGGGTCG ACGGCGAGGG CAACCTGATG ATAGACCGGG ACGCCCTGGT CTATTCCGAA
CGGCACGAGG TGGACGTGAC TGCCCGCTCC GGCCGGTGCA CCGTCACGTA CTGA
 
Protein sequence
MPIVPIPDLD DKRYADLVRE AVASLAVRAP AWTDHNASDP GITLVELFAW LAEMQIYGLN 
CLTPDHYRTF LRLVGIRPVP AVPATVALTL SSTAGRELMI PRGTRLTAET GGEPLPFETT
ADLFLLNNRI AAVISAWGGG FRNVTDANAR DAFFFPAFGE QPAPGGTLLI AFERMLPAGR
EVRLAIDLYE ADLPPVGAHR GEPAAVVPPV DLVWEYSTEA GYRRLDLLED GTTGLTRSGQ
ILFSVPADMT ATTSPYLPAT VPAPRFPMIR CRLPEESTAV PLPSEGLSAC RRAAARQGTG
FYEIPPRIDS IRLNTVTARQ AVSVVDENLV SSSGTDWGNG MPGQVVSLAR RPAMEGSLRV
RVLTGTDWEE WAERDNLDAS GPTDRHFVLD PAVGTILFGD GRNGRVLPEG ARVRADYLSG
GGAAGNLRPL ASWRFDDPLL ADLAARNDAS ASGGRDAEPL EEAIARAPLE LREVDRAITS
DDFEYLALNT PGLRVARARA LPLWEPEAPE DRPVPATVTV VVVPWSFTPR PYPGPRFLRA
VCDHLDRHRL VTTRVRVIPP LYAQVTVRTR VSAGEGVRPE ELRARVAERL LEFLHPLKGG
EDGTGWPFGR GVYRSEIIAA IREVSGVECV LETTLSGDTC TRVDGEGNLM IDRDALVYSE
RHEVDVTARS GRCTVTY