Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_2049 |
Symbol | |
ID | 5163088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 2397893 |
End bp | 2399515 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640549544 |
Product | extracellular solute-binding protein |
Protein accession | YP_001230812 |
Protein GI | 148264106 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGATCG GCCACGCAGC CTGTCTCATC TTACTTTCCC TGGTCCTGCT CATTGCAGCC TGTACTGACA AGCAGCCGGT CCGACAGACG GTGCGTGGTC CCGGCAAGCC TGCTTACGGT GACGCCATCG TTGTCGGCTC CATCGGCGAT GCCTCCAACC TGATTCCACT CCTTGCCTCC GACACTCCTT CCTTTGAAAT CGCCGGTTAT GTTTATAACG GCCTGGTGAA ATACGACAAG GACCTGAACC TGGTCGGCGA CCTGGCCGAG TCGTGGGACA TCTCGAAGGA CGGTCTGACC ATCACTTTCC ACCTGCGCAA GGGAGTCAAA TGGCACGACG GCGTGGAATT CACCTCCGCC GACGTCCTTT ACACCTACCG CGTCACCATT GATCCGAAGA CCCCCACGGC CTATTCCGAA GATTTCAAGC AGGTGAAGAG AGCTGAAGCG CCGGACCGCT ACACGTTCAG CGTTACCTAC GGCAAGCCGT TCGCCCCGGC CCTGGCATCC TGGGGAATGA ACATTCTACC CGCCCACCTC CTGGAAGGAA AAGACATCAC CAAGAGCGAG CTGGCCCGCA GGCCGGTCGG CACCGGCCCG TACCGCTTCA AGGAGTGGGT TGCGGGGCAG AAGATCGTAC TTGATGCGTA CAAGGACTAC TTCGAGGGGG AGCCGTACAT CGGTCGTTAC GTGTACCGCA TCATCCCCGA TAACTCCACC ATGTATATGG AACTGAAGGC GGGCGGGCTG GACATGATGG GGCTGACCCC GGTCCAGTAT CAGCGCCAGA CCGGCACGCC GGATTTCAAG GCGCGTTTCA ACAAATACCG CTACCCCGCT TCTGCCTATA CCTATCTCGG CTACAACCTG CGAAACCCCA TGTTTGCCGA CAAGCGGGTG CGCCAGGCCA TTACATCGGC TATCAACAAG GACGAGATCA TCCACGGCGT GCTGTTCGGT ATGGGACAGG TCGCTCACGG CCCTTATAAG CCGGGGACAT GGGCCTGCAA CCCACATATC AAGGATTTCG ACTATAACCC CGCCCTGGCG AAACAGCTCC TGACCGAAGC GGGCTGGAGG CAGGTGAACA GCGACGGAGT GCTGGTAAAG GACGGCAAGC CTTTCACCTT TACCATTCTC ACCAACCAGG GGAACGACCA GCGTCTGAAA ACCGCCCAGA TAATCCAGCG CCGCCTGAAA AATGTCGGTA TCGATGTGAA GATCCGGGTC CTCGAATTTG CGTCCCTCTT GACCAATTTT ATCGACAAGG GGAACTTCGA CGCGCTGATC ATGGGGTGGA CCATCACCCA GGACCCAGAC ATCTTCGACG TCTGGCATTC CAGCAAGACC GGACCGAAAG AGCTGAATTT CATCGGCTTC AAGAACAAAG AGGTGGACCG GCTGATAGAG GAGGGGCGCA GCACCTTCGA TGTGGAAAAG CGGAAACGTT GCTACTACCG CATCCAGGAG ATCCTTGCGG ACGAGCAGCC CTACACCTTC TTGTATGTGC CGGATTCGCT GCCGGTGGTG AGCTCCCGCT TCCGCGGCAT TGAGCCGGCC CCGGCAGGGA TCACGTACAA CTTCATCAAG TGGTATGTCC CCAAAGAGGA ACAGGTGTAC TAA
|
Protein sequence | MRIGHAACLI LLSLVLLIAA CTDKQPVRQT VRGPGKPAYG DAIVVGSIGD ASNLIPLLAS DTPSFEIAGY VYNGLVKYDK DLNLVGDLAE SWDISKDGLT ITFHLRKGVK WHDGVEFTSA DVLYTYRVTI DPKTPTAYSE DFKQVKRAEA PDRYTFSVTY GKPFAPALAS WGMNILPAHL LEGKDITKSE LARRPVGTGP YRFKEWVAGQ KIVLDAYKDY FEGEPYIGRY VYRIIPDNST MYMELKAGGL DMMGLTPVQY QRQTGTPDFK ARFNKYRYPA SAYTYLGYNL RNPMFADKRV RQAITSAINK DEIIHGVLFG MGQVAHGPYK PGTWACNPHI KDFDYNPALA KQLLTEAGWR QVNSDGVLVK DGKPFTFTIL TNQGNDQRLK TAQIIQRRLK NVGIDVKIRV LEFASLLTNF IDKGNFDALI MGWTITQDPD IFDVWHSSKT GPKELNFIGF KNKEVDRLIE EGRSTFDVEK RKRCYYRIQE ILADEQPYTF LYVPDSLPVV SSRFRGIEPA PAGITYNFIK WYVPKEEQVY
|
| |