Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3125 |
Symbol | |
ID | 5165368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 3693132 |
End bp | 3694202 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640550610 |
Product | peptidase M42 family protein |
Protein accession | YP_001231859 |
Protein GI | 148265153 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.612233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGATG CATCCTTTGA ATTTTTACAG AAACTGCTGG AGGCTCCCAG TCCGTCAGGA TACGAGCAAC CGGCCCAGCG GATATTCCGC GCTTACATCG CCCCCTTTGC CCAAGTCAGC ACCGATGTCA TGGGTAATGT CTTCGGTGCC ATCGAGGGCA TTGGTGTGGA GCGGCCGCGG GTCATGCTGG TCGGTCACTC CGATGAAATA GGTTTTCAGG TCAAGTACTT GGACGACAAC GGCTTTCTCT ATTTCGCCGC CATCGGTGGG GTTGACCCTC ATATTACCCC CGGTCAGCGG GTGAATATCC ATGGCTGCAA GGGGGTTGTT CCGGGGGTGA TCGGCAAGAA GCCGATCCAT CTGATGGAGA CCAAGGAGCG GGAGACCGTG GTGAAGCTCG ATGCCCAGTA CATAGATATC GGCGCTGCTA ACAGGAAAGA GGCGGAAACG CTGGTCCGGG TCGGCGACCC GGTGACCTTT GCCGTCGGCC TGGAAAAGCT TCATGGCGAC CGGGTCACTT CCCGCGGTTT CGACGACAAG GCGGGGAGTT TTGTCGTTGC GGAGGTGTTG CGGCAGGTGG CGGCGTTAAG CGCAAAACTG CCGGTCGATC TCTTCGGCGT GTCATCGGTG CAGGAGGAAG TGGGGCTGCG AGGAGGAACA ACGAGCAGCT ATACGGTGAA TCCTGATATC GGTATCTGTG TCGAAGTCGA TTTTTCCACT GACCAGCCGG ACGTTGATAA AAAGCACAAC GGTGAAGTGG GGATCGGCAA GGGACCGATT CTGCCGCGGG GGGCGAACAT AAACCCTGTC CTGTTCGAGC TTCTTGCCGA TACGGCGGCG CGCGAGAACA TACCGGTGCA GTTTACCGGC ATCCCAAGGG CGACCGGCAC CGATGCCAAC GTCATGCAGA TATCCCGCGG CGGTGTTGCC ACGGCCCTGG TGAAAATACC CCTCCGCTAC ATGCATACTC CTGTGGAGGT ACTTTCTTTG GCCGATCTGG AAAACGCCGT CAGGCTGATT GTCGCGACTC TGGCGCGAAT AACGGATAAA CAGACATTTA TCCCGAGCTG A
|
Protein sequence | MRDASFEFLQ KLLEAPSPSG YEQPAQRIFR AYIAPFAQVS TDVMGNVFGA IEGIGVERPR VMLVGHSDEI GFQVKYLDDN GFLYFAAIGG VDPHITPGQR VNIHGCKGVV PGVIGKKPIH LMETKERETV VKLDAQYIDI GAANRKEAET LVRVGDPVTF AVGLEKLHGD RVTSRGFDDK AGSFVVAEVL RQVAALSAKL PVDLFGVSSV QEEVGLRGGT TSSYTVNPDI GICVEVDFST DQPDVDKKHN GEVGIGKGPI LPRGANINPV LFELLADTAA RENIPVQFTG IPRATGTDAN VMQISRGGVA TALVKIPLRY MHTPVEVLSL ADLENAVRLI VATLARITDK QTFIPS
|
| |