Gene Gura_4143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4143 
Symbol 
ID5165718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4801991 
End bp4803034 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content63% 
IMG OID640551621 
Productpeptidase M48, Ste24p 
Protein accessionYP_001232859 
Protein GI148266153 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.724866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCCCG TCCGCGCCTA TTACTACGAC GGCAAGATTT CTGCCCGGCG CATTGTCTCG 
CTGGAGCGCA TCGGTGAAAA CCTGCGGCTC CATGGCGACG GGGTGGATGT GAACTACCCC
GTTGCATCGG TCCGCGTCTC TCCCCCCATC GGCCATGTAC GCCGCTCGCT CCGTTTCCCC
AATGGAATCC TCTGCGAGAT CATCGACGAC GCGTCGCTTG GAGAACTACT GGGAAACGAC
GGCGGCCTTA TCCCGCGACT GCTCGCACGC TGGGAGCGAA GCATCCCGTT GGCCCTGTTG
GCAATGGTAC TGACCGTCGC AACAATTGTG CTGTTCATCA AGTACGGCCT GCCGGCGGTT
GCCCGTCACG TGGTCCATGC GGTCCCCGCC TCCTCGGAGG CGAATCTGGG CAGGGAATCA
CTGGCTTTTC TCGACAAATA CATGATGCAG CCGTCAAAGT TGCCGGAGGC CAGGCGGCGA
GAGGTCATGG CCCTCTTCAA AAGGATGCGG GACACCTTGC CGGAAGCCAA TGGCTACCGT
CTGGAATTCC GCTCCAGCGA TCAAATCGGC GCCAACGCCT TCGCCCTGCC CGGCGGGACG
ATCGTCGTCA CCGACGGCAT GGTGGAGCTG GCAAAGCGCG ACGAGGATCT GACCGGCGTC
TTAGCCCACG AGGCGGGTCA TGTCCATAAC CGCCATGCCC TGCGCCACGT CCTGCAAAGC
ACCGGCAGCG GACTCCTCAT CGCGGCCGTC ACCGGGGACA TCACCTCCAT CACCTCCCTT
TCGGCGACGC TGCCGACGGC GCTCGTCAAT GCCGGCTATT CACGAGAATT CGAAAACGAG
GCGGATGATG CGGCGGTTGC CTACCTGGTC AAGGCCGGGA TAGAGCCGAA GACCTACGCG
GAGATGCTGG CCAGGCTTCA GGCAGAACAC GACAAGCGGG CAGGCAAGAA GGGAGATACC
CGGCACTGGA GCCCCGCGGA TCTCTTCGCG TCCCACCCGG AGACCTCCGA GCGGATCAGA
CGGGTGCTCG GGAAGCGCAA ATGA
 
Protein sequence
MSPVRAYYYD GKISARRIVS LERIGENLRL HGDGVDVNYP VASVRVSPPI GHVRRSLRFP 
NGILCEIIDD ASLGELLGND GGLIPRLLAR WERSIPLALL AMVLTVATIV LFIKYGLPAV
ARHVVHAVPA SSEANLGRES LAFLDKYMMQ PSKLPEARRR EVMALFKRMR DTLPEANGYR
LEFRSSDQIG ANAFALPGGT IVVTDGMVEL AKRDEDLTGV LAHEAGHVHN RHALRHVLQS
TGSGLLIAAV TGDITSITSL SATLPTALVN AGYSREFENE ADDAAVAYLV KAGIEPKTYA
EMLARLQAEH DKRAGKKGDT RHWSPADLFA SHPETSERIR RVLGKRK