Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3140 |
Symbol | |
ID | 5163544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 3710608 |
End bp | 3712620 |
Gene Length | 2013 bp |
Protein Length | 670 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640550625 |
Product | HEAT repeat-containing PBS lyase |
Protein accession | YP_001231874 |
Protein GI | 148265168 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAAAAAC TGCATACCCT TACTGAACAA CTCTGTTCGC CTGATGAGGA AAAACGGAGA CTGGCAGTGG TCGGCCTGGG AGGGTATCCC CTTGACCAGA TTGAAGAAAG TCTTTTTCGT GCTCTGGGCG ATATAAGCTG GCGTGTCCGC AAAGAGGCGG TAGATATACT TGTGGCCGCT GCTACCGGTA CTGGTGCTGT AATCCTGGAG GAAAAGCTGA TCGATATGCT CAGATCCCAG GAAAATGCAG GCCTGCGCAA TTCCGCCGTG GAATCCCTGG AAAAGCTCGG GCGGCAGGCT ACACATGCCC TTTGCCGTCA TGTGGGGGAT GACGACCATG ATGTGCGGAA GTTCATTGTC GACATCATGG GGAACATCTG CGACCCTGCT TTTGTGCCGC ATTTGATAAA GGCGCTCGAT GACTCCGATG CCAACGTTCG TGCTGCAGCT GCAGAAAATC TGGGAAAGAT TCGGGATGCG CAGGCGATAC CGGCTCTTTT GCAGGCCTTG GCAAAAAATG ACGTTTGGCT GAGGTTCACC ATTCTCGAGT CCATCGGCAA AATCGGTAAA CCGGTGCCTA TGGCCGCGAT TGTACCGCTT GCCAAGGAAA ATCTCCTGAA AAAGGCGATA TACGACTGTC TCGGTGCGAT TGGCGATGTG GAGGCGGTGC CGCTTCTTGT GGAGGGATTG AAGGACAGGG TGAAGAATGC CCGTGAGGCA GCGGCGATGG GGCTTGTTTC GCTCAGGGAC CGCCTCCCCT CCGAAATCGC TGAACGTGCA GTGGATGAGA GGCTCTGGAG GTTCAAGGGA ACTCCGTTTG TGGAAGGTCT GCTTGCTTCG TTTGAAACTT CCGAGAGCAG TTTGAGAGAG TCACTGGTCA AAATTCTCGG CATCATCGGG GATGATCGTG CGTCTGGCCA TCTGCTTCGC GGCTGCTGTG ACGACAGGCT CCGCAGGCAC TGCTTGCAGG CATTCAAGGC CATGGGGGAG GGCGGAGCAG CATCTCTGGT TGAAGCATAT CCTGCCGCGG ATGACGATGA ACGCTGTTTC ATCGTTTATG TATGCGGAGA ATTGCGCTAC AAGGGGTGCG TTTCCCTATT AAGCGAAGGG CTGCTCGACT CCAATGCCCG GCTGAGAAAA GCCTCGGTGC AGGCTGCGGG TAAGACCGGC CTTGTTGTGT TTATCAACGA AATCGCCCAT CTGCTCGAAG ACAGTGAGCC GGATGTCCGT GAAGGGGCAA TTGAAGCCCT TTATCGCCTT GTCGAAGCGG ACAGGGTAGC CGTAGCAAAA ATTACCGGCA AACTTGCCTC TTCCGAAGTT TCTGAAAAAA GGCGTAATGC GGCGATTCTT TGCACGGCAC TCTCCGACAC CGATAAACTT TCGCTCCTTA TCAAGGATGA AGACGCATCC GTACGCAAAA CCGCGGTCAG TTCGTTGGCC AAGCTCAAGT CGGCGGCGAG CGTCGGGCAC CTCGTCATGG CTTTGGTTGA TGAGGATCCT GATGTCCGTA TTGCTGCAGC AGGCGCTCTC GGGGATATCG GCGGGGATGA CGTCCTCGAT CCTTTACTCT TGGCGCTTAA GGATGACGAT CCCTGGGTTA AGTGCGCAGC ACTGAAGAGC CTTGGTAATC TGCGCAACGA TGCCGCCTTG CCGGCGATTG TCGAGTTGTT TGAGAGTGCT GAAGGGCTCG TTTTGATCTC CGTGCTGGAT ACTGTTGCCC AGATCGGCGG AGACAAGGGG ACCGCATTGG TTGAAAGGGC GCTTGAAAAC CACGATGAAG AAGTGGTCAA GGCTGCCATC AACATCCTTT CTCTGAATGG CGATGGGTGG ATTCATGCAT ACCGGAACAA ACTTCTGTCC CATCCGCACT GGGATGTAAG GAGCAGTTTT GTTAAGGCAA TGGCTGCCCG GATGGGTGAG AAGGCATTGC CATATCTGCG TTCGGCCCTG GAAACCGAGT CGGATGAATC GGTAAAGGGG CAGATCGTGG AATTAATGGA TAGGTTCTTC TAA
|
Protein sequence | MEKLHTLTEQ LCSPDEEKRR LAVVGLGGYP LDQIEESLFR ALGDISWRVR KEAVDILVAA ATGTGAVILE EKLIDMLRSQ ENAGLRNSAV ESLEKLGRQA THALCRHVGD DDHDVRKFIV DIMGNICDPA FVPHLIKALD DSDANVRAAA AENLGKIRDA QAIPALLQAL AKNDVWLRFT ILESIGKIGK PVPMAAIVPL AKENLLKKAI YDCLGAIGDV EAVPLLVEGL KDRVKNAREA AAMGLVSLRD RLPSEIAERA VDERLWRFKG TPFVEGLLAS FETSESSLRE SLVKILGIIG DDRASGHLLR GCCDDRLRRH CLQAFKAMGE GGAASLVEAY PAADDDERCF IVYVCGELRY KGCVSLLSEG LLDSNARLRK ASVQAAGKTG LVVFINEIAH LLEDSEPDVR EGAIEALYRL VEADRVAVAK ITGKLASSEV SEKRRNAAIL CTALSDTDKL SLLIKDEDAS VRKTAVSSLA KLKSAASVGH LVMALVDEDP DVRIAAAGAL GDIGGDDVLD PLLLALKDDD PWVKCAALKS LGNLRNDAAL PAIVELFESA EGLVLISVLD TVAQIGGDKG TALVERALEN HDEEVVKAAI NILSLNGDGW IHAYRNKLLS HPHWDVRSSF VKAMAARMGE KALPYLRSAL ETESDESVKG QIVELMDRFF
|
| |