Gene GSU1873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1873 
SymbolpepF 
ID2686250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2050853 
End bp2052643 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content62% 
IMG OID637126564 
Productoligoendopeptidase F 
Protein accessionNP_952922 
Protein GI39996971 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.201769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAG ATTTGAATTC CCTGCTCTGG GATACGTCGC CCCTCTACGC ATCCGCAACC 
TGTGAAGACA TAGACCGGGA CCTGTCCCGC GGCGCTGCGG AGGCCCAGGC GTTTCGTGAA
CGCTACGCCG GAAAGGTGGC AACCCTTGAT GCGGCTGAGG TGGCAGATGC GGTGCGTCGC
TACGAGGACC TCGGCGAGCT TCTGGCCAAA CCCCAGCTGT ACGCCCACCT CCTCTTTGCC
GCCGACTCGG AGGCCGACGA TCACAAGCGG CTCTCCCAGC GGACTGCCGA GTTCGGCAAT
CTCATGAGCC GCGAGCTTCT CTTCTTCGAC CTGGAGATCA TGGAAATAGC CGATGATCGC
TTCGCCGGGC TGCTGTCCGA CGAGCTCCTC GCCCCCTACC GGCACTACCT GGAAAGCGTG
CGGCGGTTTC GGCCCTACAC CCTCAAGGAA CGGGAGGAAC AGCTCCTGAA AATGAAGAGC
CTCACGGGTA CCGATGCCTT TTCACGGCTC TTCGACGAGT TATCGGCCTC TCTTCGCTAT
CGGATGGAAC TTGAGGGTGA AGAGCGGGAC TTCACCGGCG AGGAACTCCT CGGACTCCTC
CACCACCCTG AGGCCGGGGT GCGTGAACGC GCCTTTGCCA CGTTCCTCAA TCGCCACGAA
GAAAACGGCA TCGTTTTTTC GAGCGTGTTC AACAATGTTG CCCTGGATCA CTCCCAGGAA
CTGGAGTTGA GGGGGTACCG CCACCCCATG GAACCGACGA ACCTGGGCAA TGATATCCCC
GAAGAGGTGG TCAACCGGCT GATGGACGTG TCCGAGGCCA ACTATGGGCT GGCGCGGGAC
TACTTCCGCC TGAAGGCCCG GCTCCTGGGA CTGCCGAAAC TGAAGAACAC CGACGTCTAT
GCCCCCGTTG GGGACAACGA CCGGACGTAC TCCTTTGACG AGGCAAGGGA ACTGGTGCTT
GAAGCCTACG GCCGCTTTCA CCCCCGTTTC GGGGAGATGG CCGCGGCGTT TTTCGACGAG
CGGCGCATCG ATGTCCTCCC CCGCCCCGGC AAGAGTGGCG GAGCATTCTG CATGGGGATG
ACCCCGCACC TCTCCCCCTA CCTGCTCCTG AACTACACCG GCAACCTGCG CGACGTGGCC
ACCCTGGCCC ACGAGCTGGG GCACGGCCTC CACTTCGAGC TGGCCCGGAA ACAGACCATG
CTCAACTACC ATGCCCCGCT GCCTCTGGCG GAAACGGCGT CGGTCTTCGG CGAGATGCTC
CTGACTCGCT TCCTCCTCCA GCGGGAGAGC GACCCGGCCA TGAAAATATC GCTGCTGTGC
GCCAAGATCG AGGACATCAT CGCCACGACC TTCCGCCAGA ACGTGCTCAC CCGCTTCGAG
GAGCGGATGC ACCGGGAGCG GCAGGAGGGC CTCCTCACCT CATCACGGCT ATGCGATCTC
TGGTGGGAGG AAAACGGCAG GCTTTACGGC GATGCCGTGG ACATGATCCC CCCCTACCGC
TGGGGCTGGA GCTATATTTC TCACTTCATC CACGCGCGCT TCTACTGCTA CTCTTACACC
TTTGCGGAGT TGCTGGTTCT TTCCCTCTAC CGCAACTACC TGGAACAGGG AGAACGGTTC
ATCCCGACGT ACCTGTCCAT CCTGGAAAGC GGCGGCTCCC TCTCGCCCGC CGATACGGTC
AGGCCCGCCG GCATCGACCT GGCTGACCCG CATTTCTGGC AGAAGGGATA CGATTTTCTT
GCGGAACTGA TCGAAGAACT GAAGGGGCTG CTGGAACAGC GTCAGCATTG A
 
Protein sequence
MTTDLNSLLW DTSPLYASAT CEDIDRDLSR GAAEAQAFRE RYAGKVATLD AAEVADAVRR 
YEDLGELLAK PQLYAHLLFA ADSEADDHKR LSQRTAEFGN LMSRELLFFD LEIMEIADDR
FAGLLSDELL APYRHYLESV RRFRPYTLKE REEQLLKMKS LTGTDAFSRL FDELSASLRY
RMELEGEERD FTGEELLGLL HHPEAGVRER AFATFLNRHE ENGIVFSSVF NNVALDHSQE
LELRGYRHPM EPTNLGNDIP EEVVNRLMDV SEANYGLARD YFRLKARLLG LPKLKNTDVY
APVGDNDRTY SFDEARELVL EAYGRFHPRF GEMAAAFFDE RRIDVLPRPG KSGGAFCMGM
TPHLSPYLLL NYTGNLRDVA TLAHELGHGL HFELARKQTM LNYHAPLPLA ETASVFGEML
LTRFLLQRES DPAMKISLLC AKIEDIIATT FRQNVLTRFE ERMHRERQEG LLTSSRLCDL
WWEENGRLYG DAVDMIPPYR WGWSYISHFI HARFYCYSYT FAELLVLSLY RNYLEQGERF
IPTYLSILES GGSLSPADTV RPAGIDLADP HFWQKGYDFL AELIEELKGL LEQRQH