Gene GSU0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0231 
Symbol 
ID2687640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp239002 
End bp240777 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content64% 
IMG OID637124897 
Producthypothetical protein 
Protein accessionNP_951292 
Protein GI39995341 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.305258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTCA CCGGTGACCT GGAACACCTG TCGATCGTCG ACGTCATCCA ACTGCTGCAC 
GCCACCCGCA AGTCAGGCAC CCTCACCGTG CGGGGACGCA AGGGAGAGTC CCAGCTCGTC
TTCAACGACG GCTACATCAT CAGCGCCAAT CACTTCGACA ACAGCGTCCG GATCGGCAAC
ATCCTCGTAG AGGCCGGCGT CATCAGCAAG GAGGTCCTGG AGCAGGCCCT GCAGGAGCAG
GAGGAGGCGG GAGCGGGACG GAAACCCCTG GTGGCAACAC TTCTCGAACG GGGCAGCGTC
CGCAAGGAGG ACGCCTACCG GGGGCTTGAG GCCCTCATCG AGCTGACGGT GGTGGAAATT
CTCACCTGGA GGCGGGGCAC CTTTGATCTG GACGTGAACC GGGTCAGCGT CTCCGACGAG
TACCGCTATT TCCCGGAAAA GCTCCATGAG GAAATCACGC TCCACACGGA AAATGTCCTC
ATGGACGCCC TCCGCATCTA CGACGAGAAA AAGCGCGATG GTCTGCTGGT GGAGGAGGAG
TTTGCGATCG AGGCCCCTAT CCCGGACCTC TCCGGCGATG AAGCCGCCGA TTTCAACATC
TCGGCCGACG ATCTGGGACT CGGGGACCTG GATCAGATCG AACGGAAAAT TCCCCAGGTC
TTTCTGGGGC TGGAGGATCG CAGCCCCTCC CTTCAGCGTA AGATTCAAGA GCTGGGCGCA
GACCTTTCCG ACAAAGAACA GGAGGAGCTC TTCGCCTTTC TGGGCCGGCT CGGGAACACC
GCACCAGCCG CCGGTGCGCC CACTCTTTCC GCCATCCTCT TCAGCCCGGA CGACCTTTTT
TCCTACTGCG TAACCACCGT CTGCCGTCAG GCGGGGATTT CCGTTTTCAC CACCAACGAC
GAGCAGGACC TGGCCCCTCT GGCGCAACAG TTCGCCTCCC GTGGCGGGCA GACGACCCTG
ATCCTTGACT CGCCGGCATC GCCCGGCTTT ACCCTGCCCG CAGAAGATGC GGCGCGGCTC
CTGCGACGAG TCAGGGAGCA CCACCCCTCC CTCGCCCTCA TCCAGCTCGC TTCGCCCCTT
GAGCCGGCCT TTGCCCTCCA GGCCCTGAAA GACGGAGCCG TGGCCGTCTT CCCCCGCCCC
GTGCGCGAGG TGAGCGGCGA CACCTTCCTG GAGGACACCC TCCGGCTTCT GGACGCCCTG
CCCCTCTATC TGAGGCGGCG GGGCTCAGAC GGCGGCGAAG CGGCCATGGC CCAACTCGGC
AAAACGTTAA TGGAGTTGCG GGCACTTCGC GAGCCTCCGG AAATCGCCCT GACCCTCCTG
AGTACGGTGG CAGGCACCTT CGAGCGGGCA CTCACTCTCA TCGTGCGCGA AGAGGAACTT
ATCGCCGAGC GGAGTATCGG CATTCGGAGC CCCCGTGGCG CCTGCGTCTC ACCCTCTTTC
GGGACGAGGA TCCCCCTGGA CCGCCCGTCG GTGCTCCGGG ACGCGATTGA AAAAAGAGCT
GCATTTTATG GCGAAACTGA CGATGAGATA CTGAAGGGGC ATCTTTTTCC CATCATCGGC
GCTCCGCTCC ACCCCACGGT CATCCTGCTC CCGCTGGTCT GCGGCGGCAA GGTCATCGCT
CTCATCTACG GGGATTTCGG CCACAAGGGG GCAGCGCCCG TGCGCACCGA GCTGCTTGAG
CTCGTGACGG GCGAGGCGGG GTTGGTTCTG GAAACGGCGC TCTATCGCAG GAAACGGGAG
CGGAAGGCTC CCGAGGGGAC GGCCTGTGAC CGCTGA
 
Protein sequence
MSFTGDLEHL SIVDVIQLLH ATRKSGTLTV RGRKGESQLV FNDGYIISAN HFDNSVRIGN 
ILVEAGVISK EVLEQALQEQ EEAGAGRKPL VATLLERGSV RKEDAYRGLE ALIELTVVEI
LTWRRGTFDL DVNRVSVSDE YRYFPEKLHE EITLHTENVL MDALRIYDEK KRDGLLVEEE
FAIEAPIPDL SGDEAADFNI SADDLGLGDL DQIERKIPQV FLGLEDRSPS LQRKIQELGA
DLSDKEQEEL FAFLGRLGNT APAAGAPTLS AILFSPDDLF SYCVTTVCRQ AGISVFTTND
EQDLAPLAQQ FASRGGQTTL ILDSPASPGF TLPAEDAARL LRRVREHHPS LALIQLASPL
EPAFALQALK DGAVAVFPRP VREVSGDTFL EDTLRLLDAL PLYLRRRGSD GGEAAMAQLG
KTLMELRALR EPPEIALTLL STVAGTFERA LTLIVREEEL IAERSIGIRS PRGACVSPSF
GTRIPLDRPS VLRDAIEKRA AFYGETDDEI LKGHLFPIIG APLHPTVILL PLVCGGKVIA
LIYGDFGHKG AAPVRTELLE LVTGEAGLVL ETALYRRKRE RKAPEGTACD R