Gene Gura_2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_2055 
Symbol 
ID5165698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2405194 
End bp2406969 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content57% 
IMG OID640549550 
ProductpepF/M3 family oligoendopeptidase 
Protein accessionYP_001230818 
Protein GI148264112 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00862679 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATACG ATATAAAAGA TCTGTTGTGG AACACCGCCC CCCTCTATAC CGGACCTGAG 
TCACCGGACC TGGAAGGTGA TTTTGAAGCA GCAGCAACAG GAGCGAATGG GTTCAGGGAA
CGCTACCGGG GGCGTGTCGC GGGCTTGGAT GTCGTTGAAC TGCAAAAGGC GCTGGTCGAA
TACGAAGAGC TCGAAGAACT AATCGTCAAA CCGCAGCTCT ATGCCCATCT CCTCTTTGCC
GCTGACTCGG AAAACGACGT CAACAAGCGC CTCTCTCAAA AGGCAGCGGA ATTCGGCAAC
CTGATGAGCA GGGAACTCCT GTTCTTCGAC CTGGAGATCA TCCAGATGGA GGACAAAGCC
TTTGCGCAAT TGATCGGCGA CGAACGGCTC GCTAACTACC GTCACTACAT GGAAAGCCTG
CGTAAATTCC ACCCCCACAC CCTGACTGAG CGGGAAGAAA GCCTGTTAAA ACAGAAGAGT
CTGACCGGTA CAGAGGCGTT CTCCCGCCTG TTCGACGAGG TATCAGCATC ATTCCGCTAC
ACCATGACTC TCGATGGGGA AGAACGGGAG TTTACCGGCG AGGAGCTGTT GGGACTGCTC
CATCATACCG ACGCCATGGT CAGGGAACAG GCATTCGCTA CTTTCCTCAA GCGCCACGAG
GAACAGGGGA TCATCTTTTC TTCCGTTTTC AATACCGTTG CCCTCGACCA TGGGCAGGAC
CTGGAACTGC GCAACTACAA AAGCCCCATG GAGCCGACCA ACCTGGGTAA CGAGATCCCT
GCCGAGGTAG TAGAGCGGAT GATGTCCGTT TCCGAGGCCA ATTACCCGCT GGCCCAGGAG
TACTTCCGCC TCAAGGCGAA ACTGCTGAAT CTGGATAAGC TGAAAAACAC CGACGTCTAC
GCGCCGGTTG GGGAAATAGA GCAACACTAT ACCTTTGCCG AGGCCCGCGA CCTGGTGATT
GCCGCCTATG ACCGGTTCTC ACCGGAATTT CGGGATATAG CCGCCGCCTT TTTCAAGGAC
GGCAGGATTG ACGCCCTTCC CCGCATCGGC AAGAGCGGCG GCGCCTTCTG CATGGGAATG
ACCCCGCGAC TCGCGCCATA CGTGCTTCTC AACTTTACCG GCAACCTGCG CGACGTGGCC
ACCGTAGCGC ACGAACTGGG GCACGGCATC CACTTCACCC TCGCCCAACG CCAGACCATG
GTCAACTACC ATGCACCGCT CCCCCTGGCG GAAACGGCAT CGGTCTTCGG CGAAATGCTC
CTCACCCGAC ACATGCTGGA GGGTGAAACG GACAAGCAGG TGAAGATCGC CCTTCTTTGC
GCCAAGATCG AGGACATCAT CGCCACCACC TTTCGTCAGA ACGTCCTGAC CCGTTTTGAA
GAGCGGATGC ACCTGGAGCG GAAGAAGGGG CTACTGACCG CGACGCAGCT CTGCGACCTG
TGGTGGGAAG AAAACGCCAG GCTTTACGGC GATTCAGTGG AGATGATCGA AGCATACCGC
TGGGGATGGA GTTACATCTC TCATTTCATT CACACCCGGT TCTACTGCTA TTCTTACACC
TTTGCCGAAC TCCTCGTCCT CTCCCTCTAC CAGAGATACC TCAAGGAAGG AGACGCATTC
ATCCCCACCT ACCGGGAGAT CCTTGCCGGA GGCGGCTCCA AGTCACCGGC CGACACGGTC
AGACCGGCCG GCATCGACCT TGCCGACCCG GACTTCTGGC AGAATGGCTA TGACGTCCTG
ACCGGCCTGC TTGAAGAACT GAAACAGCTG GTCTGA
 
Protein sequence
MGYDIKDLLW NTAPLYTGPE SPDLEGDFEA AATGANGFRE RYRGRVAGLD VVELQKALVE 
YEELEELIVK PQLYAHLLFA ADSENDVNKR LSQKAAEFGN LMSRELLFFD LEIIQMEDKA
FAQLIGDERL ANYRHYMESL RKFHPHTLTE REESLLKQKS LTGTEAFSRL FDEVSASFRY
TMTLDGEERE FTGEELLGLL HHTDAMVREQ AFATFLKRHE EQGIIFSSVF NTVALDHGQD
LELRNYKSPM EPTNLGNEIP AEVVERMMSV SEANYPLAQE YFRLKAKLLN LDKLKNTDVY
APVGEIEQHY TFAEARDLVI AAYDRFSPEF RDIAAAFFKD GRIDALPRIG KSGGAFCMGM
TPRLAPYVLL NFTGNLRDVA TVAHELGHGI HFTLAQRQTM VNYHAPLPLA ETASVFGEML
LTRHMLEGET DKQVKIALLC AKIEDIIATT FRQNVLTRFE ERMHLERKKG LLTATQLCDL
WWEENARLYG DSVEMIEAYR WGWSYISHFI HTRFYCYSYT FAELLVLSLY QRYLKEGDAF
IPTYREILAG GGSKSPADTV RPAGIDLADP DFWQNGYDVL TGLLEELKQL V