Gene Gura_0954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_0954 
Symbol 
ID5166233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp1129587 
End bp1130888 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content57% 
IMG OID640548450 
Productpeptidase M23B 
Protein accessionYP_001229733 
Protein GI148263027 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000982445 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAACCT TTATATCCAT CGTGATCGTG GTGGTGTTGG GCATCACCGC TTATTATTTT 
TTGGGCACGC CAGCGCCTGC GGTGTCTCTG ACGCCCGATG GCGGGCCGAT CTCCGCAAGC
CGCGAACTGG CCATTAAGCT GGACGCATCC GGAGCCCTGC TTAAAAAACT TACGGTGAGG
GCCCTTCAAG GCGATAAGAC GGTCGATGTC CTGGTCAAGG ATTATCCCAA AGGCACCCAC
CAAGCCGGAG AAACCTTCAA TTTCAGTCGG GCCGGCCTCA GGGAAGGCCC CTTCCAGCTG
CAGATCAAGT CGACAGATTC TCCCCTGCAT TTCGGCGCCG GCAACAGTAC TACACTTCTC
CGCTCTTACG ACATGCAGAA TACGCCGCCA GTTGTCGCGG TGTTGAGCGC TGCCCACAAC
ATCACCCGCG GAGGCGCCGG TCTGGTGGTA TACACGCTCC CCAGAGAGGT TGAGAAAACA
GGTGTAAAGT ACGCTGACCA GTTCTCTCCC GGGTACAGGC AGGGGGGGGG CTTTTACGCC
TGTCTCTTTC CTTTCCCTTA CAACATGGAC CCCAAACAAT ACGTTCCGAG GGTGATAGCC
GTTGACCGTG CCGGCAACGA GCGGCTCATG GGAATCAACT TCCATCTCCT TCCAAAAACC
TTTTCCGCGG ACAGGATCAA TCTGTCCGAC TCCTTTCTGG AGAAGGTTGC CGCTGAGTTC
AAGAACAAAT TTCCGCAGGC CGCAACACCA CTGGAAATCT TCCAGAAGGC TAACAACCAG
CTGCGGGAGC AGGATGCCAA AGCCCTTAAT GAATTCGCAA GGCAAACCTC GCCCGTGCCG
CTCTGGCAGG GGGAGTTCCT GCGGTTGCCC AACTCCGCCC CACGCGGTTC CTTCGGCCAG
TTCCGCATTT ATCTGTATCA GGGAAAAGTC GTAGATCAGC AGACTCACCT GGGGGTCGAC
CTGGCCTCCC TTGCCCATTC CCCTGTGCCT GCTGCCAATC GCGGCAGGGT TGTCCATGCC
GGCGACCTGG GCATTTACGG TCAGTGCATC GTAATCGACC ATGGCCTGGG TCTGCAGACC
CTCTACGGCC ACTTGAGCCA AATGGTAGTT AAGGAAGGGG ACAACGTGGA AAAAGGCCAG
ACTATCGGCA ACACCGGGGC CACCGGCATG GCCGCCGGCG ACCACCTCCA CTTTGGAGTC
ATCGTATCCG GTGTTCCGGT CAATCCGATC GAATGGTGGG ACCCGTCCTG GATCAGAAAC
AATGTCACGA ACAAGCTCGA TATGTATAAA ACATCAAAAT GA
 
Protein sequence
MRTFISIVIV VVLGITAYYF LGTPAPAVSL TPDGGPISAS RELAIKLDAS GALLKKLTVR 
ALQGDKTVDV LVKDYPKGTH QAGETFNFSR AGLREGPFQL QIKSTDSPLH FGAGNSTTLL
RSYDMQNTPP VVAVLSAAHN ITRGGAGLVV YTLPREVEKT GVKYADQFSP GYRQGGGFYA
CLFPFPYNMD PKQYVPRVIA VDRAGNERLM GINFHLLPKT FSADRINLSD SFLEKVAAEF
KNKFPQAATP LEIFQKANNQ LREQDAKALN EFARQTSPVP LWQGEFLRLP NSAPRGSFGQ
FRIYLYQGKV VDQQTHLGVD LASLAHSPVP AANRGRVVHA GDLGIYGQCI VIDHGLGLQT
LYGHLSQMVV KEGDNVEKGQ TIGNTGATGM AAGDHLHFGV IVSGVPVNPI EWWDPSWIRN
NVTNKLDMYK TSK