Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_0967 |
Symbol | |
ID | 5166756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 1147769 |
End bp | 1148728 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640548463 |
Product | rhomboid family protein |
Protein accession | YP_001229746 |
Protein GI | 148263040 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0078595 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCCGG AGAGAGATAA CATCGAGGCA TGCGAAGAGG AGTGGCTGGC AATCCCGCCG GAGTTGGGAG TCTGGAAAGA CAGCGGCACA CTTTCCGAGC GGCAGGTACG CCTCTGGACC CTGGTCCTGG ATGCACGCGG CGTGCCCTTC CGCACCGAGC GGAGCGCTAC GGGCTGGCAA CTGCTGGTGC CGGTGGGCTA CCTCAATGCG GCCCGGGACG AGTTGCGCCT CTTTGAAAAG GAAAACCGCA ACTGGCCCCC GCCCCTGCCT CCGGCAAGAA CCCTGACGGA GAACACCCTG GCAACCATGT CGGTCCTGAT TCTCCTGGCC ACCTTCCACA ACCTTACCCT GCTCGACATT TCTCTGCCCG GCCATCACCC GATCAACTGG ATCGCCCTCG GCAACGCACA CGCCGCCAAG ATACTGGCCG GCCAATGGTG GCGGCCGATC ACCGCCCTCA CCCTCCACTC CAACTGGCAG CACCTTCTCG GCAACCTGGC AATCGGCGGG GTCTTCATCA TCATCCTCTG CCGCGAGCTC GGCTCGGGGC TGGCCTGGAG CATGCTCCTC GGCGCCGGCA TCCTCGGCAA CCTGGCCAAC GCCTGCTTGC AGCTGCCGGA CCATAGCTCG ATCGGCGCCT CCACCCTCGT CTTCGGCGCC GTCGGCATAC TCGCCGCCCT CAACATGGTG CACTACCGGC ACCACCTGCA AAAGCGCCGG CTACTCCCCG TTGCTGCTGC CATGGCCCTG CTCGCATTGT TGGGCACAGA AGGTGAACAC ACAGATCTGG GTGCACACCT GTTCGGCTTT GTCTTCGGCA TAGGTCTTGG CCTGGTTACG GAATACCTGG CAGGGAAGTA CGGGCGGCCC GGGCGGCGGA TCAACGCCCT GCTGGCGCTG GCCGGAGCCG TTGTGGTGAT AGCGGCCTGG TGGGGGGCGC TGGGTCATTT TGCACTTTAG
|
Protein sequence | MDPERDNIEA CEEEWLAIPP ELGVWKDSGT LSERQVRLWT LVLDARGVPF RTERSATGWQ LLVPVGYLNA ARDELRLFEK ENRNWPPPLP PARTLTENTL ATMSVLILLA TFHNLTLLDI SLPGHHPINW IALGNAHAAK ILAGQWWRPI TALTLHSNWQ HLLGNLAIGG VFIIILCREL GSGLAWSMLL GAGILGNLAN ACLQLPDHSS IGASTLVFGA VGILAALNMV HYRHHLQKRR LLPVAAAMAL LALLGTEGEH TDLGAHLFGF VFGIGLGLVT EYLAGKYGRP GRRINALLAL AGAVVVIAAW WGALGHFAL
|
| |