Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0231 |
Symbol | |
ID | 2687640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 239002 |
End bp | 240777 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637124897 |
Product | hypothetical protein |
Protein accession | NP_951292 |
Protein GI | 39995341 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.305258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTTCA CCGGTGACCT GGAACACCTG TCGATCGTCG ACGTCATCCA ACTGCTGCAC GCCACCCGCA AGTCAGGCAC CCTCACCGTG CGGGGACGCA AGGGAGAGTC CCAGCTCGTC TTCAACGACG GCTACATCAT CAGCGCCAAT CACTTCGACA ACAGCGTCCG GATCGGCAAC ATCCTCGTAG AGGCCGGCGT CATCAGCAAG GAGGTCCTGG AGCAGGCCCT GCAGGAGCAG GAGGAGGCGG GAGCGGGACG GAAACCCCTG GTGGCAACAC TTCTCGAACG GGGCAGCGTC CGCAAGGAGG ACGCCTACCG GGGGCTTGAG GCCCTCATCG AGCTGACGGT GGTGGAAATT CTCACCTGGA GGCGGGGCAC CTTTGATCTG GACGTGAACC GGGTCAGCGT CTCCGACGAG TACCGCTATT TCCCGGAAAA GCTCCATGAG GAAATCACGC TCCACACGGA AAATGTCCTC ATGGACGCCC TCCGCATCTA CGACGAGAAA AAGCGCGATG GTCTGCTGGT GGAGGAGGAG TTTGCGATCG AGGCCCCTAT CCCGGACCTC TCCGGCGATG AAGCCGCCGA TTTCAACATC TCGGCCGACG ATCTGGGACT CGGGGACCTG GATCAGATCG AACGGAAAAT TCCCCAGGTC TTTCTGGGGC TGGAGGATCG CAGCCCCTCC CTTCAGCGTA AGATTCAAGA GCTGGGCGCA GACCTTTCCG ACAAAGAACA GGAGGAGCTC TTCGCCTTTC TGGGCCGGCT CGGGAACACC GCACCAGCCG CCGGTGCGCC CACTCTTTCC GCCATCCTCT TCAGCCCGGA CGACCTTTTT TCCTACTGCG TAACCACCGT CTGCCGTCAG GCGGGGATTT CCGTTTTCAC CACCAACGAC GAGCAGGACC TGGCCCCTCT GGCGCAACAG TTCGCCTCCC GTGGCGGGCA GACGACCCTG ATCCTTGACT CGCCGGCATC GCCCGGCTTT ACCCTGCCCG CAGAAGATGC GGCGCGGCTC CTGCGACGAG TCAGGGAGCA CCACCCCTCC CTCGCCCTCA TCCAGCTCGC TTCGCCCCTT GAGCCGGCCT TTGCCCTCCA GGCCCTGAAA GACGGAGCCG TGGCCGTCTT CCCCCGCCCC GTGCGCGAGG TGAGCGGCGA CACCTTCCTG GAGGACACCC TCCGGCTTCT GGACGCCCTG CCCCTCTATC TGAGGCGGCG GGGCTCAGAC GGCGGCGAAG CGGCCATGGC CCAACTCGGC AAAACGTTAA TGGAGTTGCG GGCACTTCGC GAGCCTCCGG AAATCGCCCT GACCCTCCTG AGTACGGTGG CAGGCACCTT CGAGCGGGCA CTCACTCTCA TCGTGCGCGA AGAGGAACTT ATCGCCGAGC GGAGTATCGG CATTCGGAGC CCCCGTGGCG CCTGCGTCTC ACCCTCTTTC GGGACGAGGA TCCCCCTGGA CCGCCCGTCG GTGCTCCGGG ACGCGATTGA AAAAAGAGCT GCATTTTATG GCGAAACTGA CGATGAGATA CTGAAGGGGC ATCTTTTTCC CATCATCGGC GCTCCGCTCC ACCCCACGGT CATCCTGCTC CCGCTGGTCT GCGGCGGCAA GGTCATCGCT CTCATCTACG GGGATTTCGG CCACAAGGGG GCAGCGCCCG TGCGCACCGA GCTGCTTGAG CTCGTGACGG GCGAGGCGGG GTTGGTTCTG GAAACGGCGC TCTATCGCAG GAAACGGGAG CGGAAGGCTC CCGAGGGGAC GGCCTGTGAC CGCTGA
|
Protein sequence | MSFTGDLEHL SIVDVIQLLH ATRKSGTLTV RGRKGESQLV FNDGYIISAN HFDNSVRIGN ILVEAGVISK EVLEQALQEQ EEAGAGRKPL VATLLERGSV RKEDAYRGLE ALIELTVVEI LTWRRGTFDL DVNRVSVSDE YRYFPEKLHE EITLHTENVL MDALRIYDEK KRDGLLVEEE FAIEAPIPDL SGDEAADFNI SADDLGLGDL DQIERKIPQV FLGLEDRSPS LQRKIQELGA DLSDKEQEEL FAFLGRLGNT APAAGAPTLS AILFSPDDLF SYCVTTVCRQ AGISVFTTND EQDLAPLAQQ FASRGGQTTL ILDSPASPGF TLPAEDAARL LRRVREHHPS LALIQLASPL EPAFALQALK DGAVAVFPRP VREVSGDTFL EDTLRLLDAL PLYLRRRGSD GGEAAMAQLG KTLMELRALR EPPEIALTLL STVAGTFERA LTLIVREEEL IAERSIGIRS PRGACVSPSF GTRIPLDRPS VLRDAIEKRA AFYGETDDEI LKGHLFPIIG APLHPTVILL PLVCGGKVIA LIYGDFGHKG AAPVRTELLE LVTGEAGLVL ETALYRRKRE RKAPEGTACD R
|
| |