Gene Gura_3693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3693 
Symbol 
ID5162963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4320453 
End bp4321667 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content55% 
IMG OID640551178 
Productpeptidase U32 
Protein accessionYP_001232419 
Protein GI148265713 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00154214 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC CCGAACTCCT TGCCCCAGCC GGCAACATGG AAAAACTCCG TATCGCCGTC 
CATTACGGCG CTGATGCCGT CTACCTGGGG GGAAAAAGCT TCGGTTTGCG GAACCTGGCC
GATAATTTCT CCACCGCCGA ACTGGCGGAA GCCGTGGTCT ATGCCCATGA ACGGGGGGTT
AAGGTCTACC TCACGGTCAA CGCCTACCCC GACAACGATG ATATAGGCGA ATTGCTGCAT
TATCTGGAAG AAGTGCGCCC TATCCCCTTC GACGCCTACA TTGCCGCAGA TCCCGGTGTC
ATTGAAACCA TCAGGGAAAT CTCGCCGGAA CGCGACATCC ACCTCTCAAC TCAGGCCAAC
ACCACAAACT GGAAAAGCGC TCTTTTCTGG CAGAAACAGG GGATACGGCG TATTAACCTT
GCCCGCGAGA TGTCCCTTGA AGGGATGCGC CAAGTCAGGG AGAGGACCGA CATCGAACTG
GAGGCCTTTG TTCACGGAGC CATGTGCATC TCCTATTCGG GGCGCTGCCT CCTGTCCAGC
GTCATGAGCG GCAGAAACGC CAACAAGGGT GAATGCACCC AGCCCTGCCG CTGGAACTAC
GCCATCGTCG AAGAAACGAG ACCCGGCGAG TATTTCCCGG TCATGGAGGA TGAAAACGGC
ACTTTTATCT TCAACTCCAA AGACCTCTGC CTACTTACCT ACCTGCCGGA ACTGGCGGGC
GCTGGGGTGG ATTCCCTGAA AATCGAAGGA AGAATGAAGG GGATCTATTA CGTTGCCTCT
GTCGTGAGAA TTTACCGCCA GGCCCTGGAC CGTTACTTCG CAGAGCCGGA AACCTACCGC
TGTGATCCCG ACTGGCTGGA GGAACTCTGC AAGATCAGCC ACCGCGGCTA CACAACGGGC
TTTTTCCTCG GCCCGCCAAA AGATATTGAC CACCAGTACC ACTCCAGCTA TATTAGAAAC
CATGAATTTG TCGGCATAGT AGAAGAGCCA CTGCCGGACG GCGCCATTAT ATTGGAAGTC
AGGAACAGGA TAAAAACCGG AGACACCCTG GAATTCATCG GTCCCGCCAT GTCGTCTTCC
TTCCACGAAA TGAAAGAGAT CATCACCGAC CGGGGAGAAA GGGTTGAAGC TGCCAACCCT
AACCAACGCA TCATTGTCAG GACCGCTTTC GCAGCAGAGA AATATGACCT GGTCCGACGG
GAAAAATCCT TGTAG
 
Protein sequence
MKKPELLAPA GNMEKLRIAV HYGADAVYLG GKSFGLRNLA DNFSTAELAE AVVYAHERGV 
KVYLTVNAYP DNDDIGELLH YLEEVRPIPF DAYIAADPGV IETIREISPE RDIHLSTQAN
TTNWKSALFW QKQGIRRINL AREMSLEGMR QVRERTDIEL EAFVHGAMCI SYSGRCLLSS
VMSGRNANKG ECTQPCRWNY AIVEETRPGE YFPVMEDENG TFIFNSKDLC LLTYLPELAG
AGVDSLKIEG RMKGIYYVAS VVRIYRQALD RYFAEPETYR CDPDWLEELC KISHRGYTTG
FFLGPPKDID HQYHSSYIRN HEFVGIVEEP LPDGAIILEV RNRIKTGDTL EFIGPAMSSS
FHEMKEIITD RGERVEAANP NQRIIVRTAF AAEKYDLVRR EKSL