Gene Gura_0879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_0879 
Symbol 
ID5162652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp1043702 
End bp1045132 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content55% 
IMG OID640548375 
Productprotease Do 
Protein accessionYP_001229658 
Protein GI148262952 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value7.81376e-08 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCC GTATTATTAT TTTTTTGTTG ATGCTGGCCA CCCTTCTTTC GGCTTGCAAA 
AAGAAGGAAG AAGCCTATTT CTTTGAATCA AACCGCAAAG GGACTGCCGA GGCCCCGGTA
AAAGAAGTAC CCAAGGACAT CCTTGCCACC CAGCAGGCTT TCGCCAATGT GGTAAAAGCG
GTTAACCCGG CAGTGGTCAA TATTTCCACG GTGAGCAAGA AAAAGCTCGT GCAGCCGTTC
TTCGAGATGT CCCCCCTTTT CGATGATTTT TTCGGCGGCA GGGGTGGGAC GCCCCAATAC
CGCCGTGAAA ACAGCCTGGG TTCCGGCTTT ATCATCAACC GGGACGGATA CATCATCACC
AACGATCATG TGGTGCGGGA TGCGGAGAGC ATCCAGGTAA AACTTTCCAA CGAAAATGTT
TATAGCGGCA AAGTGGTCGG CAGCGATCCC AAAACCGACA TCGCGGTGAT CAAGATCAAC
GCCAAGGAAC AGCTCCCGGT GGCAGTGCTG GGCGATTCCG ACAAACTCCA GGTAGGACAG
TGGGCCATTG CTATCGGCAA CCCGTTCGGT CTCGACCGGA CCGTTACCGT CGGGGTGGTG
TCGGCGACCG GCCGTTCCAA TATGGGGATC GAAACCTACG AGAACTTCAT CCAGACGGAC
GCTTCCATCA ACCCGGGCAA TTCCGGCGGA CCGCTCTTGA ATGTCTATGG CGAGGTAATC
GGCATCAATA CTGCCATTGT AGCGGCAGGC CAGGGCATCG GCTTCGCCAT TCCCATAAAT
ATGGCGAAAC GGGCAGTGCC GCAGTTGATA AAAAAGGGGA ATGTCAGCCG CGGTTGGCTG
GGTGTTTCCA TTCAGCCGGT GACGGAAGAG ATTGCCCAGT CCTTTGGCTT GAAACGGGCA
CAAGGTGCCT TGGTGAGCGA TATAATGGCG GGGAGCCCTG CTGCCAAGGC CGGCCTCAGG
CAGGGTGACA TCATAACCGG GATTGCCGGC AAAGAGATAA AGTCCGTCCA GCAACTCCAG
TTGCTGGTGG CTGATATGCC GGTCGGCTCT CCGGTGGAGA TAGAGGTTTT CCGCGAAGGC
CGGGCAAAAA AACTTTCAAT CATCCCTGCC TCCGCTGACA GTGCCGCGGG TGCGAAGCCT
AAATCGGTCG AGACGGAAAC GGCATGGGTC GGATTGTCGG TAGAGGAACT CCCCCGTGAT
ATACGACTGA AGGGTCTGCA GGGTGTTGTC GTGACGAGCG TTGAGCCGGG CAGCCTTGCT
GCCGACAGCG GCATCCAGCA GGGCGATGTC GTTGTTTCAG TCAACCAGAG AAAGATCGCC
GGTGTGAACG ACTATGCAAA GGCCATGAAA GATGCTGAAA AGAAAGGGTC CGTAGCCCTG
CTGGTAAGGC GGGGGGATGC CAGCATCTAT TTTGCGATCA GAATAAAGTA G
 
Protein sequence
MKIRIIIFLL MLATLLSACK KKEEAYFFES NRKGTAEAPV KEVPKDILAT QQAFANVVKA 
VNPAVVNIST VSKKKLVQPF FEMSPLFDDF FGGRGGTPQY RRENSLGSGF IINRDGYIIT
NDHVVRDAES IQVKLSNENV YSGKVVGSDP KTDIAVIKIN AKEQLPVAVL GDSDKLQVGQ
WAIAIGNPFG LDRTVTVGVV SATGRSNMGI ETYENFIQTD ASINPGNSGG PLLNVYGEVI
GINTAIVAAG QGIGFAIPIN MAKRAVPQLI KKGNVSRGWL GVSIQPVTEE IAQSFGLKRA
QGALVSDIMA GSPAAKAGLR QGDIITGIAG KEIKSVQQLQ LLVADMPVGS PVEIEVFREG
RAKKLSIIPA SADSAAGAKP KSVETETAWV GLSVEELPRD IRLKGLQGVV VTSVEPGSLA
ADSGIQQGDV VVSVNQRKIA GVNDYAKAMK DAEKKGSVAL LVRRGDASIY FAIRIK