Gene Gura_3125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3125 
Symbol 
ID5165368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp3693132 
End bp3694202 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content58% 
IMG OID640550610 
Productpeptidase M42 family protein 
Protein accessionYP_001231859 
Protein GI148265153 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.612233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGATG CATCCTTTGA ATTTTTACAG AAACTGCTGG AGGCTCCCAG TCCGTCAGGA 
TACGAGCAAC CGGCCCAGCG GATATTCCGC GCTTACATCG CCCCCTTTGC CCAAGTCAGC
ACCGATGTCA TGGGTAATGT CTTCGGTGCC ATCGAGGGCA TTGGTGTGGA GCGGCCGCGG
GTCATGCTGG TCGGTCACTC CGATGAAATA GGTTTTCAGG TCAAGTACTT GGACGACAAC
GGCTTTCTCT ATTTCGCCGC CATCGGTGGG GTTGACCCTC ATATTACCCC CGGTCAGCGG
GTGAATATCC ATGGCTGCAA GGGGGTTGTT CCGGGGGTGA TCGGCAAGAA GCCGATCCAT
CTGATGGAGA CCAAGGAGCG GGAGACCGTG GTGAAGCTCG ATGCCCAGTA CATAGATATC
GGCGCTGCTA ACAGGAAAGA GGCGGAAACG CTGGTCCGGG TCGGCGACCC GGTGACCTTT
GCCGTCGGCC TGGAAAAGCT TCATGGCGAC CGGGTCACTT CCCGCGGTTT CGACGACAAG
GCGGGGAGTT TTGTCGTTGC GGAGGTGTTG CGGCAGGTGG CGGCGTTAAG CGCAAAACTG
CCGGTCGATC TCTTCGGCGT GTCATCGGTG CAGGAGGAAG TGGGGCTGCG AGGAGGAACA
ACGAGCAGCT ATACGGTGAA TCCTGATATC GGTATCTGTG TCGAAGTCGA TTTTTCCACT
GACCAGCCGG ACGTTGATAA AAAGCACAAC GGTGAAGTGG GGATCGGCAA GGGACCGATT
CTGCCGCGGG GGGCGAACAT AAACCCTGTC CTGTTCGAGC TTCTTGCCGA TACGGCGGCG
CGCGAGAACA TACCGGTGCA GTTTACCGGC ATCCCAAGGG CGACCGGCAC CGATGCCAAC
GTCATGCAGA TATCCCGCGG CGGTGTTGCC ACGGCCCTGG TGAAAATACC CCTCCGCTAC
ATGCATACTC CTGTGGAGGT ACTTTCTTTG GCCGATCTGG AAAACGCCGT CAGGCTGATT
GTCGCGACTC TGGCGCGAAT AACGGATAAA CAGACATTTA TCCCGAGCTG A
 
Protein sequence
MRDASFEFLQ KLLEAPSPSG YEQPAQRIFR AYIAPFAQVS TDVMGNVFGA IEGIGVERPR 
VMLVGHSDEI GFQVKYLDDN GFLYFAAIGG VDPHITPGQR VNIHGCKGVV PGVIGKKPIH
LMETKERETV VKLDAQYIDI GAANRKEAET LVRVGDPVTF AVGLEKLHGD RVTSRGFDDK
AGSFVVAEVL RQVAALSAKL PVDLFGVSSV QEEVGLRGGT TSSYTVNPDI GICVEVDFST
DQPDVDKKHN GEVGIGKGPI LPRGANINPV LFELLADTAA RENIPVQFTG IPRATGTDAN
VMQISRGGVA TALVKIPLRY MHTPVEVLSL ADLENAVRLI VATLARITDK QTFIPS