Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_0966 |
Symbol | |
ID | 5166755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 1144550 |
End bp | 1147480 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640548462 |
Product | peptidase S9B dipeptidylpeptidase IV subunit |
Protein accession | YP_001229745 |
Protein GI | 148263039 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0823] Periplasmic component of the Tol biopolymer transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000198927 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA TCCTCCTGCT TCTCGTCATC CTCCTTGCCG CATTTGTCTC CCCTGCAGCC GCCGCAAGAC TGGACACCTC GTTCTCCTTT TCGACCATTG AAACCGACCA CTTTTCCATC CATTTTCACC AGGGACTGGA AGAGATTGCC CAACGGGCCG CTGTGCTTGC CGAGGACGCC CACGGCAAGC TGACCGCCCA GTTCGACTGG AAACCGCGGG AAAAGACCCA GCTGGTGCTC ATCGACAATA CCGACTTTAC CAACGGCTTC GCCACCCCAC TCCCCTATAA TACCGTCTTC ATCCAGGTGG TCCCCCCCTC CATCGATTCG ACGCTGGGGG AATACGATGA CTGGCTGAAG GAGATCATCT TCCACGAATA CGCCCACATC GTCACCTCTG ATACGGCCCG GGGCTATTCC CGCATCACCC GCTCCATCTT CGGCAAGCCC ATTATTCCAG GTGACATTAT CAGCCTCGCC ATTTTCATTT TCACCGCCCC CCCAAATGTG TTCATGCCCC ATTGGTGGCA CGAGGGGATG GCGACCTGGA GCGAGACCGA GTTCACCGGC GTGGGACGCG GCCGGAGCGC CACCTACGAG ATGATCCTCC GCTCGGCAGT GGCCGCGAAC AGCCTCCCCT CCATCGACAA GGTGAACGGC GAGGTTCCCT ACTGGCCCAA CGGCCATCTC CCTTACATCT TCGGCCTGCG CCTGTCCAAG TTCATCGCCG ACAAGTACGG CAAGGAAACC CTGGGAAAGC TGAGCCTGGC CCAGGCCGGC CGGGTTCCCT ACGCCATCGG CACGCCTCCG GAAGAATTCT GTAACGGCAA GGATTACGCC GAACTCTACC GGGACATGCT TGACGACCTG AAAAAGGAAG AGCAGAGCCG CATCGCCACA CTGGAAAAAG CCCCGTTCAC CCCGCTTAAG GTCTTAAGCA CCGTCGGCGA GAACCTCACC AATCCGCGCT TTTCCCCGGA CGGAGGCAGG CTCGCCTTCA ACGTCAACGA CCCCCACGGC CACAGCAAGA CCGTCATCAC CAGCCGGGAC GGGGCAACCA TCCAGGCGGA ATTCCGCCGC CAGTTCTCGG ACCAGGGACT CTCCTGGTCC CCCGACGGCA ACCGGCTCTA CTTTGCCCAA GCCGAGGTGG TGCGGGGCTT CGACATTTAC CAGGACCTCT ACGCCTACGA CCTCCGCCGC GACCGGGTGC AGCGGCTGAC CAGCGGCCTG CGCATCAAGG ACCCGGAGAT CTCCCCCGAC GGAAAATACT TTGCGCTGGC GGTAAACGAC CGGGGAAGTC AGAACCTGGC CCTGCTGGAT GCCCGTGAGA CACTCGACGG CCGCAACAAC CTGAAGCCAC GCCTGGTTAC CGCTTACCGG CAGGAAAGGG TTGCCACCCC CCGATGGGCC CCCGATTCCA GGACCATCGC CTATGCGGTG ACCGACAACC GGGGCAAGAC CTCACTCCGC CTCTATGATG TCAAGAGCGG CCAGGACAGA CCGCTCTTGA CGGTCAGCTA CAACGCCGCC TACCCCAGCT GGTCCCGGGA CGGGAGGATC ATCTACTACG TGTCCGACGA AACCGGGGTC TACAACCTGT TTGCCTACGA CCTGAAGGAG GAGAAAAGTT ACCAGGTCAG CCACCTTCTC GGCGGAGCCA TGCAGCCCGA CGCAGCCCCG GACGACGACA CCCTGGTCTT CAGCAGTTAT ACCGCGAGGG GGTTCAACAT CGCGTCCCTC ACCCTCGACC GGAACAGCTG GACATTACAG CGGGGACCGT CCATCACCCC ATACTGGCAG GATGCCGGGC CGGCAGCAGC AGAACGTAGC CTGCCCGGCG ACAATCCGGC CATCGGCAAA CCGGCCCCTT ACTCTCCCTG GGAGACTCTT TACCCACGTT TCTGGCTGCC GCGGATCTAC AGCGAGGACC AGGACCATAA CGCTTTCGGT GCTTTTACCG CTGGCCAGGA CGTGCTCGGC TACAACAGCT ATCTCGCAGA ATTAACCTAC GGCACCGGGC ATCACAAGGT CTATTACAAC CTCGCCTACC GGAACGATTA CCTCTACCCG TCGTTTCTCC TTCAGAGCTA CGCCCAGCCG GTCTTTTATT CCGACCTGCT GCAGCGGGGT GACTATTACG AGCAGAACCG GAGCCTGATC CTCGAAACGA GCGTGCCGCT CAATTTCCTC GAATCATCCT ACCGCCTCTT CTTCGGCTAC CACCTCCAGG ACCAGTCGGC CTTAAGCAGG CTGCAGAACG ACCGTTTCAA CGGTCTGCCG GTCTTCCAGG GTCGCCGCGA CAACATCTTC GCCGGCATCG AGTTCGCCGA CAACCTGAAA TACCCTTACT CAATCAGCCA CGAGGAAGGG CGCACCATCT CCTTCACCTA CCGGAACTAC TCCCGGCAGC GGGGCTCAGA CCTGGACGGG GAGGAGTACC TGGCCGCCTA CACCGAGTAC CTCCATCTCC CCTCACAGCC TCTGCGGCAT CATGTGCTCA CCTACAGCCT GAACGGCGGC GTGGCCACCG GCGAACGCAC CGTGCAGCAG GCTTTCCAGC TGGGAGGCGA GCCGGGCAAC CTCGTCCAGT TCCCCCTGCG CGGTTACCCG GCCCGCTTCG AGACCGGCAA GTATGTCGCC ACCGGCACCG TCGAGTACCG GGCTCCGCTC TGGTATCTGC TGCGTGGTTT CGGTACCAAG CCGTTCTTTT TCGACAGGCT GCACGGAGCC GTCTTTACCG ATGTGGGGGA AGTCTGGGAC GACCAGCGTT CGTTCAAGCT GGACCGGCTC AAGGTAGGGG CAGGAGTGGA GGGACGCTTC GACATGACCC TCGGTTACTG GCTGAAGATC ACCCCGGCGG TCGGCTATGC CCACGGATTT AACCAGGGGG GAGAGGACCG GATTTACTTC ACAATCTACG CCAATCTTTA G
|
Protein sequence | MNKILLLLVI LLAAFVSPAA AARLDTSFSF STIETDHFSI HFHQGLEEIA QRAAVLAEDA HGKLTAQFDW KPREKTQLVL IDNTDFTNGF ATPLPYNTVF IQVVPPSIDS TLGEYDDWLK EIIFHEYAHI VTSDTARGYS RITRSIFGKP IIPGDIISLA IFIFTAPPNV FMPHWWHEGM ATWSETEFTG VGRGRSATYE MILRSAVAAN SLPSIDKVNG EVPYWPNGHL PYIFGLRLSK FIADKYGKET LGKLSLAQAG RVPYAIGTPP EEFCNGKDYA ELYRDMLDDL KKEEQSRIAT LEKAPFTPLK VLSTVGENLT NPRFSPDGGR LAFNVNDPHG HSKTVITSRD GATIQAEFRR QFSDQGLSWS PDGNRLYFAQ AEVVRGFDIY QDLYAYDLRR DRVQRLTSGL RIKDPEISPD GKYFALAVND RGSQNLALLD ARETLDGRNN LKPRLVTAYR QERVATPRWA PDSRTIAYAV TDNRGKTSLR LYDVKSGQDR PLLTVSYNAA YPSWSRDGRI IYYVSDETGV YNLFAYDLKE EKSYQVSHLL GGAMQPDAAP DDDTLVFSSY TARGFNIASL TLDRNSWTLQ RGPSITPYWQ DAGPAAAERS LPGDNPAIGK PAPYSPWETL YPRFWLPRIY SEDQDHNAFG AFTAGQDVLG YNSYLAELTY GTGHHKVYYN LAYRNDYLYP SFLLQSYAQP VFYSDLLQRG DYYEQNRSLI LETSVPLNFL ESSYRLFFGY HLQDQSALSR LQNDRFNGLP VFQGRRDNIF AGIEFADNLK YPYSISHEEG RTISFTYRNY SRQRGSDLDG EEYLAAYTEY LHLPSQPLRH HVLTYSLNGG VATGERTVQQ AFQLGGEPGN LVQFPLRGYP ARFETGKYVA TGTVEYRAPL WYLLRGFGTK PFFFDRLHGA VFTDVGEVWD DQRSFKLDRL KVGAGVEGRF DMTLGYWLKI TPAVGYAHGF NQGGEDRIYF TIYANL
|
| |