Gene Gura_0966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_0966 
Symbol 
ID5166755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp1144550 
End bp1147480 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content61% 
IMG OID640548462 
Productpeptidase S9B dipeptidylpeptidase IV subunit 
Protein accessionYP_001229745 
Protein GI148263039 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000198927 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA TCCTCCTGCT TCTCGTCATC CTCCTTGCCG CATTTGTCTC CCCTGCAGCC 
GCCGCAAGAC TGGACACCTC GTTCTCCTTT TCGACCATTG AAACCGACCA CTTTTCCATC
CATTTTCACC AGGGACTGGA AGAGATTGCC CAACGGGCCG CTGTGCTTGC CGAGGACGCC
CACGGCAAGC TGACCGCCCA GTTCGACTGG AAACCGCGGG AAAAGACCCA GCTGGTGCTC
ATCGACAATA CCGACTTTAC CAACGGCTTC GCCACCCCAC TCCCCTATAA TACCGTCTTC
ATCCAGGTGG TCCCCCCCTC CATCGATTCG ACGCTGGGGG AATACGATGA CTGGCTGAAG
GAGATCATCT TCCACGAATA CGCCCACATC GTCACCTCTG ATACGGCCCG GGGCTATTCC
CGCATCACCC GCTCCATCTT CGGCAAGCCC ATTATTCCAG GTGACATTAT CAGCCTCGCC
ATTTTCATTT TCACCGCCCC CCCAAATGTG TTCATGCCCC ATTGGTGGCA CGAGGGGATG
GCGACCTGGA GCGAGACCGA GTTCACCGGC GTGGGACGCG GCCGGAGCGC CACCTACGAG
ATGATCCTCC GCTCGGCAGT GGCCGCGAAC AGCCTCCCCT CCATCGACAA GGTGAACGGC
GAGGTTCCCT ACTGGCCCAA CGGCCATCTC CCTTACATCT TCGGCCTGCG CCTGTCCAAG
TTCATCGCCG ACAAGTACGG CAAGGAAACC CTGGGAAAGC TGAGCCTGGC CCAGGCCGGC
CGGGTTCCCT ACGCCATCGG CACGCCTCCG GAAGAATTCT GTAACGGCAA GGATTACGCC
GAACTCTACC GGGACATGCT TGACGACCTG AAAAAGGAAG AGCAGAGCCG CATCGCCACA
CTGGAAAAAG CCCCGTTCAC CCCGCTTAAG GTCTTAAGCA CCGTCGGCGA GAACCTCACC
AATCCGCGCT TTTCCCCGGA CGGAGGCAGG CTCGCCTTCA ACGTCAACGA CCCCCACGGC
CACAGCAAGA CCGTCATCAC CAGCCGGGAC GGGGCAACCA TCCAGGCGGA ATTCCGCCGC
CAGTTCTCGG ACCAGGGACT CTCCTGGTCC CCCGACGGCA ACCGGCTCTA CTTTGCCCAA
GCCGAGGTGG TGCGGGGCTT CGACATTTAC CAGGACCTCT ACGCCTACGA CCTCCGCCGC
GACCGGGTGC AGCGGCTGAC CAGCGGCCTG CGCATCAAGG ACCCGGAGAT CTCCCCCGAC
GGAAAATACT TTGCGCTGGC GGTAAACGAC CGGGGAAGTC AGAACCTGGC CCTGCTGGAT
GCCCGTGAGA CACTCGACGG CCGCAACAAC CTGAAGCCAC GCCTGGTTAC CGCTTACCGG
CAGGAAAGGG TTGCCACCCC CCGATGGGCC CCCGATTCCA GGACCATCGC CTATGCGGTG
ACCGACAACC GGGGCAAGAC CTCACTCCGC CTCTATGATG TCAAGAGCGG CCAGGACAGA
CCGCTCTTGA CGGTCAGCTA CAACGCCGCC TACCCCAGCT GGTCCCGGGA CGGGAGGATC
ATCTACTACG TGTCCGACGA AACCGGGGTC TACAACCTGT TTGCCTACGA CCTGAAGGAG
GAGAAAAGTT ACCAGGTCAG CCACCTTCTC GGCGGAGCCA TGCAGCCCGA CGCAGCCCCG
GACGACGACA CCCTGGTCTT CAGCAGTTAT ACCGCGAGGG GGTTCAACAT CGCGTCCCTC
ACCCTCGACC GGAACAGCTG GACATTACAG CGGGGACCGT CCATCACCCC ATACTGGCAG
GATGCCGGGC CGGCAGCAGC AGAACGTAGC CTGCCCGGCG ACAATCCGGC CATCGGCAAA
CCGGCCCCTT ACTCTCCCTG GGAGACTCTT TACCCACGTT TCTGGCTGCC GCGGATCTAC
AGCGAGGACC AGGACCATAA CGCTTTCGGT GCTTTTACCG CTGGCCAGGA CGTGCTCGGC
TACAACAGCT ATCTCGCAGA ATTAACCTAC GGCACCGGGC ATCACAAGGT CTATTACAAC
CTCGCCTACC GGAACGATTA CCTCTACCCG TCGTTTCTCC TTCAGAGCTA CGCCCAGCCG
GTCTTTTATT CCGACCTGCT GCAGCGGGGT GACTATTACG AGCAGAACCG GAGCCTGATC
CTCGAAACGA GCGTGCCGCT CAATTTCCTC GAATCATCCT ACCGCCTCTT CTTCGGCTAC
CACCTCCAGG ACCAGTCGGC CTTAAGCAGG CTGCAGAACG ACCGTTTCAA CGGTCTGCCG
GTCTTCCAGG GTCGCCGCGA CAACATCTTC GCCGGCATCG AGTTCGCCGA CAACCTGAAA
TACCCTTACT CAATCAGCCA CGAGGAAGGG CGCACCATCT CCTTCACCTA CCGGAACTAC
TCCCGGCAGC GGGGCTCAGA CCTGGACGGG GAGGAGTACC TGGCCGCCTA CACCGAGTAC
CTCCATCTCC CCTCACAGCC TCTGCGGCAT CATGTGCTCA CCTACAGCCT GAACGGCGGC
GTGGCCACCG GCGAACGCAC CGTGCAGCAG GCTTTCCAGC TGGGAGGCGA GCCGGGCAAC
CTCGTCCAGT TCCCCCTGCG CGGTTACCCG GCCCGCTTCG AGACCGGCAA GTATGTCGCC
ACCGGCACCG TCGAGTACCG GGCTCCGCTC TGGTATCTGC TGCGTGGTTT CGGTACCAAG
CCGTTCTTTT TCGACAGGCT GCACGGAGCC GTCTTTACCG ATGTGGGGGA AGTCTGGGAC
GACCAGCGTT CGTTCAAGCT GGACCGGCTC AAGGTAGGGG CAGGAGTGGA GGGACGCTTC
GACATGACCC TCGGTTACTG GCTGAAGATC ACCCCGGCGG TCGGCTATGC CCACGGATTT
AACCAGGGGG GAGAGGACCG GATTTACTTC ACAATCTACG CCAATCTTTA G
 
Protein sequence
MNKILLLLVI LLAAFVSPAA AARLDTSFSF STIETDHFSI HFHQGLEEIA QRAAVLAEDA 
HGKLTAQFDW KPREKTQLVL IDNTDFTNGF ATPLPYNTVF IQVVPPSIDS TLGEYDDWLK
EIIFHEYAHI VTSDTARGYS RITRSIFGKP IIPGDIISLA IFIFTAPPNV FMPHWWHEGM
ATWSETEFTG VGRGRSATYE MILRSAVAAN SLPSIDKVNG EVPYWPNGHL PYIFGLRLSK
FIADKYGKET LGKLSLAQAG RVPYAIGTPP EEFCNGKDYA ELYRDMLDDL KKEEQSRIAT
LEKAPFTPLK VLSTVGENLT NPRFSPDGGR LAFNVNDPHG HSKTVITSRD GATIQAEFRR
QFSDQGLSWS PDGNRLYFAQ AEVVRGFDIY QDLYAYDLRR DRVQRLTSGL RIKDPEISPD
GKYFALAVND RGSQNLALLD ARETLDGRNN LKPRLVTAYR QERVATPRWA PDSRTIAYAV
TDNRGKTSLR LYDVKSGQDR PLLTVSYNAA YPSWSRDGRI IYYVSDETGV YNLFAYDLKE
EKSYQVSHLL GGAMQPDAAP DDDTLVFSSY TARGFNIASL TLDRNSWTLQ RGPSITPYWQ
DAGPAAAERS LPGDNPAIGK PAPYSPWETL YPRFWLPRIY SEDQDHNAFG AFTAGQDVLG
YNSYLAELTY GTGHHKVYYN LAYRNDYLYP SFLLQSYAQP VFYSDLLQRG DYYEQNRSLI
LETSVPLNFL ESSYRLFFGY HLQDQSALSR LQNDRFNGLP VFQGRRDNIF AGIEFADNLK
YPYSISHEEG RTISFTYRNY SRQRGSDLDG EEYLAAYTEY LHLPSQPLRH HVLTYSLNGG
VATGERTVQQ AFQLGGEPGN LVQFPLRGYP ARFETGKYVA TGTVEYRAPL WYLLRGFGTK
PFFFDRLHGA VFTDVGEVWD DQRSFKLDRL KVGAGVEGRF DMTLGYWLKI TPAVGYAHGF
NQGGEDRIYF TIYANL