Gene Gura_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_2049 
Symbol 
ID5163088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2397893 
End bp2399515 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content58% 
IMG OID640549544 
Productextracellular solute-binding protein 
Protein accessionYP_001230812 
Protein GI148264106 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGATCG GCCACGCAGC CTGTCTCATC TTACTTTCCC TGGTCCTGCT CATTGCAGCC 
TGTACTGACA AGCAGCCGGT CCGACAGACG GTGCGTGGTC CCGGCAAGCC TGCTTACGGT
GACGCCATCG TTGTCGGCTC CATCGGCGAT GCCTCCAACC TGATTCCACT CCTTGCCTCC
GACACTCCTT CCTTTGAAAT CGCCGGTTAT GTTTATAACG GCCTGGTGAA ATACGACAAG
GACCTGAACC TGGTCGGCGA CCTGGCCGAG TCGTGGGACA TCTCGAAGGA CGGTCTGACC
ATCACTTTCC ACCTGCGCAA GGGAGTCAAA TGGCACGACG GCGTGGAATT CACCTCCGCC
GACGTCCTTT ACACCTACCG CGTCACCATT GATCCGAAGA CCCCCACGGC CTATTCCGAA
GATTTCAAGC AGGTGAAGAG AGCTGAAGCG CCGGACCGCT ACACGTTCAG CGTTACCTAC
GGCAAGCCGT TCGCCCCGGC CCTGGCATCC TGGGGAATGA ACATTCTACC CGCCCACCTC
CTGGAAGGAA AAGACATCAC CAAGAGCGAG CTGGCCCGCA GGCCGGTCGG CACCGGCCCG
TACCGCTTCA AGGAGTGGGT TGCGGGGCAG AAGATCGTAC TTGATGCGTA CAAGGACTAC
TTCGAGGGGG AGCCGTACAT CGGTCGTTAC GTGTACCGCA TCATCCCCGA TAACTCCACC
ATGTATATGG AACTGAAGGC GGGCGGGCTG GACATGATGG GGCTGACCCC GGTCCAGTAT
CAGCGCCAGA CCGGCACGCC GGATTTCAAG GCGCGTTTCA ACAAATACCG CTACCCCGCT
TCTGCCTATA CCTATCTCGG CTACAACCTG CGAAACCCCA TGTTTGCCGA CAAGCGGGTG
CGCCAGGCCA TTACATCGGC TATCAACAAG GACGAGATCA TCCACGGCGT GCTGTTCGGT
ATGGGACAGG TCGCTCACGG CCCTTATAAG CCGGGGACAT GGGCCTGCAA CCCACATATC
AAGGATTTCG ACTATAACCC CGCCCTGGCG AAACAGCTCC TGACCGAAGC GGGCTGGAGG
CAGGTGAACA GCGACGGAGT GCTGGTAAAG GACGGCAAGC CTTTCACCTT TACCATTCTC
ACCAACCAGG GGAACGACCA GCGTCTGAAA ACCGCCCAGA TAATCCAGCG CCGCCTGAAA
AATGTCGGTA TCGATGTGAA GATCCGGGTC CTCGAATTTG CGTCCCTCTT GACCAATTTT
ATCGACAAGG GGAACTTCGA CGCGCTGATC ATGGGGTGGA CCATCACCCA GGACCCAGAC
ATCTTCGACG TCTGGCATTC CAGCAAGACC GGACCGAAAG AGCTGAATTT CATCGGCTTC
AAGAACAAAG AGGTGGACCG GCTGATAGAG GAGGGGCGCA GCACCTTCGA TGTGGAAAAG
CGGAAACGTT GCTACTACCG CATCCAGGAG ATCCTTGCGG ACGAGCAGCC CTACACCTTC
TTGTATGTGC CGGATTCGCT GCCGGTGGTG AGCTCCCGCT TCCGCGGCAT TGAGCCGGCC
CCGGCAGGGA TCACGTACAA CTTCATCAAG TGGTATGTCC CCAAAGAGGA ACAGGTGTAC
TAA
 
Protein sequence
MRIGHAACLI LLSLVLLIAA CTDKQPVRQT VRGPGKPAYG DAIVVGSIGD ASNLIPLLAS 
DTPSFEIAGY VYNGLVKYDK DLNLVGDLAE SWDISKDGLT ITFHLRKGVK WHDGVEFTSA
DVLYTYRVTI DPKTPTAYSE DFKQVKRAEA PDRYTFSVTY GKPFAPALAS WGMNILPAHL
LEGKDITKSE LARRPVGTGP YRFKEWVAGQ KIVLDAYKDY FEGEPYIGRY VYRIIPDNST
MYMELKAGGL DMMGLTPVQY QRQTGTPDFK ARFNKYRYPA SAYTYLGYNL RNPMFADKRV
RQAITSAINK DEIIHGVLFG MGQVAHGPYK PGTWACNPHI KDFDYNPALA KQLLTEAGWR
QVNSDGVLVK DGKPFTFTIL TNQGNDQRLK TAQIIQRRLK NVGIDVKIRV LEFASLLTNF
IDKGNFDALI MGWTITQDPD IFDVWHSSKT GPKELNFIGF KNKEVDRLIE EGRSTFDVEK
RKRCYYRIQE ILADEQPYTF LYVPDSLPVV SSRFRGIEPA PAGITYNFIK WYVPKEEQVY