Gene GSU1359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1359 
Symbol 
ID2686434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1484238 
End bp1487363 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content63% 
IMG OID637126034 
Producthelicase, putative 
Protein accessionNP_952412 
Protein GI39996461 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTAG CTCACGGCAT ATACGAAGCA CTTCTCGATG AAGCTTTGCA GGAGGCCCTA 
GCCCTGCGGC CGGAACTGCG ACGCGTTTTC AGCAAGATCG ATCAGGAGGA GCAGCCGGCC
CTCTATGCCG CCTTTGTCGC CCGGGTACTG GAGCAGGCGC TGCGAGAGGA GTCGGACCAG
GAAAAGCGCC TCGCACTCTG CAACCGGATT CTTGGCTCCG TGGCTGCTGA GCCGGGTAGG
GGGCATCTGG AAAAGCGCCG TCTGATCCCG GAACAGAAGC CGCTCCTCTT GGAAATCACC
CCGCCCAACT ACAACCAATC CGGGCTCCCC CGGCCCCATA CCTCTCTTGC ACAAAGCAGT
CTCTTTACCG GAGCAGATGG TCATCTCCAG CTTGTCCATG AGTTGCTGGC TGAACTACGC
TCCGCCGACG GGGTGGATAT CCTCGTATCC TTCATCAAGT GGTCCGGTCT GCGGCTCCTC
ATGCCGGCTT TTGAAGATCT GCGTGACCGA CAGGTCCCGG TGCGGCTGAT CACCACCTCC
TACATGGGGG CGTCCGATGC CCCGGCGGTG GAGTGGCTGG CTGGTCTCCC CAACGTGCAG
GTCCGAGTCT CCTACGACAC CGAGCGGACC CGGCTCCACG CCAAGGCCTA TCACTTCCGG
CGGCAGAGCG GCTTCTCCAC CGCCTACATC GGCTCAGCCA ACATGTCCCA GGCGGCGATT
ACCAGCGGGC TGGAGTGGAA CCTGAAAGTC ACCGCCCAGG ACATGCCTCA CGTGCTGGAG
AAGTTCAGCG TCGAGTTCGA GACTTACTGG AACAGCCGGG AATTCGTCCC CTTCGATCCG
GCCCGGCCGG AGCAGTTTCG CAGCGCCCTG GCCCGCGCCA AGAACAAGGA GATCAGCGGT
CCGGCGGTCT TTTTCGATCT TACCCCCCAC CCGTTTCAGG AGCGGATTCT GGAAGCGCTG
GAGCGGGAGC GCAGCGCCCA CGACCGCTGG CGCAATCTGG TCATCGCCGC CACCGGCACC
GGTAAGACCG TGGTAGCGGC CTTTGATTTC AAGCGCTTCT ATGAGCAGAA ACAGAAGCAG
GCAAAGCTCC TTTTCCTGGC CCACCGGCAG GAGATTCTCC AGCAGGCACT CGGCACCTTC
CGCAACATCC TGCGGGACCA GAATTTCGGC GAACTGCAGG TCGGGCCCTA TGACGCCACC
CGTCTGGAGC ACCTCTTCTG CTCCGTCGGT ATGCTCACCT CCCGCCGCCT CTGGGAGCAG
GTCGGCCCGG ACTTCTACGA CTACATCGTC ATCGACGAAG CCCACCACGG CACGGCCGGC
AGCTACCGTC CCATCTTCAA CCACTTCTCC CCGCAGATAC TGCTGGGACT CACCGCCACC
CCGGAACGGA TGGACGGCGA TAACGTGGCC GCCGATTTTG GCAACCGCTT TGCCGCCGAG
ATCCGCCTCC CTGAAGCGCT GGAGGAGAAG CTCCTCTGCC CCTTCCACTA CTTCGGCATT
GCCGACCCCA TCGCCATCAA CGGCGATCAG TTCTGGCGCA ATGGCAAGTA CGATGCCGCG
GCTCTGGAGA ACGTCTATGT CCTCGATACG GCCAACGCCC GTAAGCGGGT GGCGGCGATC
ATTGAGGCTC TGAAGCGCTA CGAGCCGGAC GTAGCGAGCC TCAAGGGGAT CGGTTTTTGC
GTCACCATCC GCCATGCCGA GTACATGGCC GAGCAGTTCA GCCAGCGCGG CATCCCGTCA
GCCCCCTTTG TCTCCGGCAT CGACTCCGAC CAGTGCGCCG ACCTCCTGGC CCGGCTGAGA
AACGGCCAGC TCACCTTCCT CTTCACCGTG GACAAGCTGA GTGAAGGGGT GGACGTGCCG
GAGATCAACA CCGTCCTCTT TCTGCGCCCC ACCGAGAGCC TGACGGTCTT TCTCCAGCAG
CTCGGCCGCG GCCTGCGCCA TGCCCCGGAG AAGGAGTGCC TCACCGTCCT CGACTTTGTC
GGTCAGGCCC ACCGCCGCTA CCGCATCGAC ACCAAGCTGA AAGCACTCCT CCCCCGGCAC
CGCTTCGCCA TCGACAAGGA AGTGGAGCAA GATTTCCCCC ATCTGCCGGC CGGCTGCGCC
ATCCAGCTTG ACCGCCTCTC CCGCCAGTAT GTGCTGGACA ATATCCGGGA GAACTTCGGC
CGTCTGGCGG TGCAGGTGCC GGACCGGCTC CAGACCTTTA CCAGCGAGAC CGGGCAGGAG
CTGACCTTTG GCAACTTCAT CCGCTACCAC GATTACGAGC CGGAGGTGCT GCTGGCCAAG
GAGTCGTGGA GCGAGTGGAA AGCCAAGGCC CAACTGGCGC CGATTCCGGT CGATCCCGAT
CTGGCGCGGC TGAAGAAAGC CCTGCTCCGG GCGGCCTTCA TCAATGGCCC ACGGGAAGCA
GAGCTGCTGC GGCAACTGCT GGCCATGCTT GCCGCCGGGC AGGTGATCGA GGCGCTGGCT
CTGGCCGGTT CATCGGCCAT GCTGCTCTAT TACCGCATCT GGGGGGATAG GGCGGAGAAG
GTGGGGATTG CCTCGCTGGC AGAGGCCTTC CAACGGCTAG CCGCCAACCC GAGCATCTGC
GCCGATTTGG ACGAGATTTT GGCCTGGTCG TTGGATACCA CCGAAGTTGC CGGGATTGCC
CCGGAGCTGC CGTATCGTGT GCCGCTGGAG TTGCACGCCC AGTATGGCAT TCGTGAGGTT
CAGGCCGCCT TTGGCCGTGC GACGCTGGAG AGCAGCGGCC AGACCGGGGT CGGGGTGATG
CACTTTGCCG AGCAGAAGAC CTATGCCCTG TTGGTGACCT TCCAGAAGAC GGAAAAGGAG
TTCTCCCCCA GCACCATGTA TGCGGATTAC CCCATCAGCC GGGAGCTGCT GCACTGGGAG
TCCCAGGCCA ACACCGCCCA GCATCACACC GACGGGCAGA ACCTGATTCA TCATCAGGAG
CGGGGCTACA CGGTTCTGGT ATTTGCCCGT GGGAAGAAAA AGCGGAACGG GGTTACAGTG
CCGTTCACTT ACCTGGGGCC GGTGGATATG GTGAGTTATG AGAGCGAGCG GCCGATTAAG
ATGGTCTGGC GGCTCAGGTA TGCGATGCCG GTGGAGATGT TTGAGGATAA TCGGCGGGGT
GGGTGA
 
Protein sequence
MRLAHGIYEA LLDEALQEAL ALRPELRRVF SKIDQEEQPA LYAAFVARVL EQALREESDQ 
EKRLALCNRI LGSVAAEPGR GHLEKRRLIP EQKPLLLEIT PPNYNQSGLP RPHTSLAQSS
LFTGADGHLQ LVHELLAELR SADGVDILVS FIKWSGLRLL MPAFEDLRDR QVPVRLITTS
YMGASDAPAV EWLAGLPNVQ VRVSYDTERT RLHAKAYHFR RQSGFSTAYI GSANMSQAAI
TSGLEWNLKV TAQDMPHVLE KFSVEFETYW NSREFVPFDP ARPEQFRSAL ARAKNKEISG
PAVFFDLTPH PFQERILEAL ERERSAHDRW RNLVIAATGT GKTVVAAFDF KRFYEQKQKQ
AKLLFLAHRQ EILQQALGTF RNILRDQNFG ELQVGPYDAT RLEHLFCSVG MLTSRRLWEQ
VGPDFYDYIV IDEAHHGTAG SYRPIFNHFS PQILLGLTAT PERMDGDNVA ADFGNRFAAE
IRLPEALEEK LLCPFHYFGI ADPIAINGDQ FWRNGKYDAA ALENVYVLDT ANARKRVAAI
IEALKRYEPD VASLKGIGFC VTIRHAEYMA EQFSQRGIPS APFVSGIDSD QCADLLARLR
NGQLTFLFTV DKLSEGVDVP EINTVLFLRP TESLTVFLQQ LGRGLRHAPE KECLTVLDFV
GQAHRRYRID TKLKALLPRH RFAIDKEVEQ DFPHLPAGCA IQLDRLSRQY VLDNIRENFG
RLAVQVPDRL QTFTSETGQE LTFGNFIRYH DYEPEVLLAK ESWSEWKAKA QLAPIPVDPD
LARLKKALLR AAFINGPREA ELLRQLLAML AAGQVIEALA LAGSSAMLLY YRIWGDRAEK
VGIASLAEAF QRLAANPSIC ADLDEILAWS LDTTEVAGIA PELPYRVPLE LHAQYGIREV
QAAFGRATLE SSGQTGVGVM HFAEQKTYAL LVTFQKTEKE FSPSTMYADY PISRELLHWE
SQANTAQHHT DGQNLIHHQE RGYTVLVFAR GKKKRNGVTV PFTYLGPVDM VSYESERPIK
MVWRLRYAMP VEMFEDNRRG G