Gene Gura_4178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4178 
Symbol 
ID5165975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4839530 
End bp4840987 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content54% 
IMG OID640551656 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001232894 
Protein GI148266188 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00683366 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTTGT ACCGCCAACT GTCCCTTTTT ATCATCACCC TTTTTCTTGT TGTCTCCCCC 
CTCCTCTCTT TTGCCGCATC AATCCCGCGC CTGAGCGCCC CCCCTATTGG CGAACGCTGG
TACAGCGTCA ACATGGGCGA TGAACGGGTC GGATTTTCCC ATCTGAAAAT CACTGAAACA
GCCGATGGCT ACGAGATTTT CAGCGAAGGG AGCGTAAAAA TGCTGGTAAT GGGCTTTTCA
CGCGAGGCTG TGGCGCGGGA AACCTATCTG GTAAACAAGG ATCTGTCGTT AAAATCTTTC
AGCGTGGAAC AGACCATTGA CGGCAGCCCG ATGAAACTGA AGGGTGAAGT TACCGGCAAA
GGGGTAAAGG TTGTCATAGA ATCGGCCGGG AATAAAAAGG AGAAGACCCT CAAGGCAAAG
GGGAAACTCC TGCCGCCGCC CGCCGTGAAC CTGTATCCGT TCATGCAGGG CGCTATGCCC
GGCAAACCAT GCCGTGTCCA GATGCTGGAT GTGGAAGGGG TAAAGGTCAA AGAGGTAAAG
ATCCAGGTGA TCGGGGAGGA GATTCTCCCT GGCGGCGTGA AAGCCATTCA TTTCCAGAAT
GATTTATATA CCTTTGTCGA TAACGATGTC TGGCTGGACG CGGCAGGGAA CACCATCAAA
GAATCGGTGC GTGACGGCCT GGTTGTAACC CAGGCTGAAG ATGCGCAGAG TGCCGGGAGA
TTCATTGCCG AGGCAGTCCT GGCCAAGAAG GACCTGATTT TAGACTTCAG CCTGATAAAG
GTTGATACAC CGATTAAAAA TCCAGGGGAG CTGAAAAAAC TCGAGATCTC TTTCTCAGGT
ATCCCCACCG CTATCCCGCT TCTGCAAGGA GCGGGACAAA AGGGGGACAG ACTGGCAGAC
GGCAGCGTCA GGTTCACCCT GGAAATCGCC CCATATAAGG CAAAGACATC GCCTGCCGCC
TATGACAAAA CGGCATTCGC CCCCTACCTG GAGTCAAGTG AGCGGATTCT CGCGGATAAT
CCTGAAATAA TCAGCAAGGC AACGGAGATT GTCGGAGCAG AAAAAGACCA GTTGAAGATC
GTGGAAAAAC TCACCAACTG GGTCGCCACA ACGGTGAAGG GAGCAGTAAC CGACAGCCAG
TCACCACTGG AAACCCTGAA GAAGGGGAGC GGCAACTGCC AGTCACACGC ACGGCTCTAT
ACCTCACTGG CAAGGGCCGC CGGCATTCCG ACCAGATTCG TCTCGGGGCT TGTCTATGCG
CCTGGGCAGG GATTTCTCTA CCACAGCTGG GCAGAAAGCT ACCTGGGCGA ATGGGTGGCC
GTGGACCCCA CCTTCGGCCA GTTGCCGGTT GATGCAGGCC ACATAAAGCT GGTTGAAGGT
GACTCCCCCG AAGATATGTC CCTGCTGGCC GGTGTCGTCG GCAAGCTCAA GGCCAGAGTG
ATCGAACAGA AATACTGA
 
Protein sequence
MSLYRQLSLF IITLFLVVSP LLSFAASIPR LSAPPIGERW YSVNMGDERV GFSHLKITET 
ADGYEIFSEG SVKMLVMGFS REAVARETYL VNKDLSLKSF SVEQTIDGSP MKLKGEVTGK
GVKVVIESAG NKKEKTLKAK GKLLPPPAVN LYPFMQGAMP GKPCRVQMLD VEGVKVKEVK
IQVIGEEILP GGVKAIHFQN DLYTFVDNDV WLDAAGNTIK ESVRDGLVVT QAEDAQSAGR
FIAEAVLAKK DLILDFSLIK VDTPIKNPGE LKKLEISFSG IPTAIPLLQG AGQKGDRLAD
GSVRFTLEIA PYKAKTSPAA YDKTAFAPYL ESSERILADN PEIISKATEI VGAEKDQLKI
VEKLTNWVAT TVKGAVTDSQ SPLETLKKGS GNCQSHARLY TSLARAAGIP TRFVSGLVYA
PGQGFLYHSW AESYLGEWVA VDPTFGQLPV DAGHIKLVEG DSPEDMSLLA GVVGKLKARV
IEQKY