Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1873 |
Symbol | pepF |
ID | 2686250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 2050853 |
End bp | 2052643 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637126564 |
Product | oligoendopeptidase F |
Protein accession | NP_952922 |
Protein GI | 39996971 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.201769 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACAG ATTTGAATTC CCTGCTCTGG GATACGTCGC CCCTCTACGC ATCCGCAACC TGTGAAGACA TAGACCGGGA CCTGTCCCGC GGCGCTGCGG AGGCCCAGGC GTTTCGTGAA CGCTACGCCG GAAAGGTGGC AACCCTTGAT GCGGCTGAGG TGGCAGATGC GGTGCGTCGC TACGAGGACC TCGGCGAGCT TCTGGCCAAA CCCCAGCTGT ACGCCCACCT CCTCTTTGCC GCCGACTCGG AGGCCGACGA TCACAAGCGG CTCTCCCAGC GGACTGCCGA GTTCGGCAAT CTCATGAGCC GCGAGCTTCT CTTCTTCGAC CTGGAGATCA TGGAAATAGC CGATGATCGC TTCGCCGGGC TGCTGTCCGA CGAGCTCCTC GCCCCCTACC GGCACTACCT GGAAAGCGTG CGGCGGTTTC GGCCCTACAC CCTCAAGGAA CGGGAGGAAC AGCTCCTGAA AATGAAGAGC CTCACGGGTA CCGATGCCTT TTCACGGCTC TTCGACGAGT TATCGGCCTC TCTTCGCTAT CGGATGGAAC TTGAGGGTGA AGAGCGGGAC TTCACCGGCG AGGAACTCCT CGGACTCCTC CACCACCCTG AGGCCGGGGT GCGTGAACGC GCCTTTGCCA CGTTCCTCAA TCGCCACGAA GAAAACGGCA TCGTTTTTTC GAGCGTGTTC AACAATGTTG CCCTGGATCA CTCCCAGGAA CTGGAGTTGA GGGGGTACCG CCACCCCATG GAACCGACGA ACCTGGGCAA TGATATCCCC GAAGAGGTGG TCAACCGGCT GATGGACGTG TCCGAGGCCA ACTATGGGCT GGCGCGGGAC TACTTCCGCC TGAAGGCCCG GCTCCTGGGA CTGCCGAAAC TGAAGAACAC CGACGTCTAT GCCCCCGTTG GGGACAACGA CCGGACGTAC TCCTTTGACG AGGCAAGGGA ACTGGTGCTT GAAGCCTACG GCCGCTTTCA CCCCCGTTTC GGGGAGATGG CCGCGGCGTT TTTCGACGAG CGGCGCATCG ATGTCCTCCC CCGCCCCGGC AAGAGTGGCG GAGCATTCTG CATGGGGATG ACCCCGCACC TCTCCCCCTA CCTGCTCCTG AACTACACCG GCAACCTGCG CGACGTGGCC ACCCTGGCCC ACGAGCTGGG GCACGGCCTC CACTTCGAGC TGGCCCGGAA ACAGACCATG CTCAACTACC ATGCCCCGCT GCCTCTGGCG GAAACGGCGT CGGTCTTCGG CGAGATGCTC CTGACTCGCT TCCTCCTCCA GCGGGAGAGC GACCCGGCCA TGAAAATATC GCTGCTGTGC GCCAAGATCG AGGACATCAT CGCCACGACC TTCCGCCAGA ACGTGCTCAC CCGCTTCGAG GAGCGGATGC ACCGGGAGCG GCAGGAGGGC CTCCTCACCT CATCACGGCT ATGCGATCTC TGGTGGGAGG AAAACGGCAG GCTTTACGGC GATGCCGTGG ACATGATCCC CCCCTACCGC TGGGGCTGGA GCTATATTTC TCACTTCATC CACGCGCGCT TCTACTGCTA CTCTTACACC TTTGCGGAGT TGCTGGTTCT TTCCCTCTAC CGCAACTACC TGGAACAGGG AGAACGGTTC ATCCCGACGT ACCTGTCCAT CCTGGAAAGC GGCGGCTCCC TCTCGCCCGC CGATACGGTC AGGCCCGCCG GCATCGACCT GGCTGACCCG CATTTCTGGC AGAAGGGATA CGATTTTCTT GCGGAACTGA TCGAAGAACT GAAGGGGCTG CTGGAACAGC GTCAGCATTG A
|
Protein sequence | MTTDLNSLLW DTSPLYASAT CEDIDRDLSR GAAEAQAFRE RYAGKVATLD AAEVADAVRR YEDLGELLAK PQLYAHLLFA ADSEADDHKR LSQRTAEFGN LMSRELLFFD LEIMEIADDR FAGLLSDELL APYRHYLESV RRFRPYTLKE REEQLLKMKS LTGTDAFSRL FDELSASLRY RMELEGEERD FTGEELLGLL HHPEAGVRER AFATFLNRHE ENGIVFSSVF NNVALDHSQE LELRGYRHPM EPTNLGNDIP EEVVNRLMDV SEANYGLARD YFRLKARLLG LPKLKNTDVY APVGDNDRTY SFDEARELVL EAYGRFHPRF GEMAAAFFDE RRIDVLPRPG KSGGAFCMGM TPHLSPYLLL NYTGNLRDVA TLAHELGHGL HFELARKQTM LNYHAPLPLA ETASVFGEML LTRFLLQRES DPAMKISLLC AKIEDIIATT FRQNVLTRFE ERMHRERQEG LLTSSRLCDL WWEENGRLYG DAVDMIPPYR WGWSYISHFI HARFYCYSYT FAELLVLSLY RNYLEQGERF IPTYLSILES GGSLSPADTV RPAGIDLADP HFWQKGYDFL AELIEELKGL LEQRQH
|
| |