Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3693 |
Symbol | |
ID | 5162963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 4320453 |
End bp | 4321667 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640551178 |
Product | peptidase U32 |
Protein accession | YP_001232419 |
Protein GI | 148265713 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00154214 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC CCGAACTCCT TGCCCCAGCC GGCAACATGG AAAAACTCCG TATCGCCGTC CATTACGGCG CTGATGCCGT CTACCTGGGG GGAAAAAGCT TCGGTTTGCG GAACCTGGCC GATAATTTCT CCACCGCCGA ACTGGCGGAA GCCGTGGTCT ATGCCCATGA ACGGGGGGTT AAGGTCTACC TCACGGTCAA CGCCTACCCC GACAACGATG ATATAGGCGA ATTGCTGCAT TATCTGGAAG AAGTGCGCCC TATCCCCTTC GACGCCTACA TTGCCGCAGA TCCCGGTGTC ATTGAAACCA TCAGGGAAAT CTCGCCGGAA CGCGACATCC ACCTCTCAAC TCAGGCCAAC ACCACAAACT GGAAAAGCGC TCTTTTCTGG CAGAAACAGG GGATACGGCG TATTAACCTT GCCCGCGAGA TGTCCCTTGA AGGGATGCGC CAAGTCAGGG AGAGGACCGA CATCGAACTG GAGGCCTTTG TTCACGGAGC CATGTGCATC TCCTATTCGG GGCGCTGCCT CCTGTCCAGC GTCATGAGCG GCAGAAACGC CAACAAGGGT GAATGCACCC AGCCCTGCCG CTGGAACTAC GCCATCGTCG AAGAAACGAG ACCCGGCGAG TATTTCCCGG TCATGGAGGA TGAAAACGGC ACTTTTATCT TCAACTCCAA AGACCTCTGC CTACTTACCT ACCTGCCGGA ACTGGCGGGC GCTGGGGTGG ATTCCCTGAA AATCGAAGGA AGAATGAAGG GGATCTATTA CGTTGCCTCT GTCGTGAGAA TTTACCGCCA GGCCCTGGAC CGTTACTTCG CAGAGCCGGA AACCTACCGC TGTGATCCCG ACTGGCTGGA GGAACTCTGC AAGATCAGCC ACCGCGGCTA CACAACGGGC TTTTTCCTCG GCCCGCCAAA AGATATTGAC CACCAGTACC ACTCCAGCTA TATTAGAAAC CATGAATTTG TCGGCATAGT AGAAGAGCCA CTGCCGGACG GCGCCATTAT ATTGGAAGTC AGGAACAGGA TAAAAACCGG AGACACCCTG GAATTCATCG GTCCCGCCAT GTCGTCTTCC TTCCACGAAA TGAAAGAGAT CATCACCGAC CGGGGAGAAA GGGTTGAAGC TGCCAACCCT AACCAACGCA TCATTGTCAG GACCGCTTTC GCAGCAGAGA AATATGACCT GGTCCGACGG GAAAAATCCT TGTAG
|
Protein sequence | MKKPELLAPA GNMEKLRIAV HYGADAVYLG GKSFGLRNLA DNFSTAELAE AVVYAHERGV KVYLTVNAYP DNDDIGELLH YLEEVRPIPF DAYIAADPGV IETIREISPE RDIHLSTQAN TTNWKSALFW QKQGIRRINL AREMSLEGMR QVRERTDIEL EAFVHGAMCI SYSGRCLLSS VMSGRNANKG ECTQPCRWNY AIVEETRPGE YFPVMEDENG TFIFNSKDLC LLTYLPELAG AGVDSLKIEG RMKGIYYVAS VVRIYRQALD RYFAEPETYR CDPDWLEELC KISHRGYTTG FFLGPPKDID HQYHSSYIRN HEFVGIVEEP LPDGAIILEV RNRIKTGDTL EFIGPAMSSS FHEMKEIITD RGERVEAANP NQRIIVRTAF AAEKYDLVRR EKSL
|
| |