Gene Gura_3992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3992 
Symbol 
ID5163387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4643014 
End bp4646181 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content49% 
IMG OID640551471 
Producthypothetical protein 
Protein accessionYP_001232709 
Protein GI148266003 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00061561 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCCGCA AATATTCGAG GACGATAGCA GCTGTTATAT TGTGCTTTTT CACCTGGACT 
TCGGGTGGGG TGTTCAGCAT CGCCAATGCG GCGCAGGTTG AGGCCAGGAA GGCAAAGACG
CAACGTCAGG AGAAAAAGCC GGAAGGGGCG GAGGAGAAAT TTTCTAAAGT TACCGGGGAG
ATGGAGGCGA TACTTGCCGA CCCGAAGGAG GACATTGAAG CAAAGAAAGG TCGCCTGCGG
CTCAAAAAGG CGGAGGTGGA CGCACTCGAC ACGGATATCC GCAAGCAGTT CGCCGAGACG
GAGAAGCGGC TTAAGGATGC CAAGTTACCG GCCGAGATTC TGGAGCGCCA TCGCCGGTTC
GTCAAGCACT ATGACGACAA TCTGGCTGAG TTGAAGGGGA ATATCGAGCG GGTTGAGAAG
GCCAAAGATA AGAAAGAAGC AGAGGCTGAG ATCGAGAAGA CCCGGAAACA CCTGGAAAAG
GTGAAGGCTC CGACACGACA TCAGCCCCTT GATCCGAACA ACCTTCCCCA CCGGCAGCCG
AAGGTGATAA AGCGGGAGCC GCGGCTGAAG AAGGAGGAGT TTGAGCGGGA TTTAAAGAAG
GACAAGAACG CATGGAGAAA TGAAAAGCGG ATCATGGTCG CCTCGGCGGG GTCTTTGGCC
GGGTTGCTTA CCTCCAGCAT GGTCAATGCC GTTACCCCAC CCACTGCCGC TGATCTGGCG
GAGACTGTTG ACGTCCAGTT GACTCCTGAG ATCAGGGCCA AGGCTCTGGA GTTGGGGAAC
AACCCGGTTA AGATTTACGA GTGGGTGCGG AACAACGCTG AATATGAACC GTATTATGGT
TCCCTAAAAG GGGCGCAGCA GACGCTGATG GAAAAATCCG GGAATAACTA CGACCAGGCC
AGCTTGTTGA TTGCTCTGTT AAGAGCGGCA AATATCCCAG CCAAATATGT TACAGGAACC
ATTGAAATAC CAATCGAGAG GGTAATGTCA TGGATTGGAG TCAAGGACCC AACTACGGCG
GCAAATATAC TGGCAACAGG GGGGATACCT GCAAAAGCAG GCGTGAGCGG TGGAAAAATT
TCAACGATCC GTCTGGAGCA TGTCTGGGTG GAGGCGTATG TCAACATGTA CCCATCCTTT
GGCGCCAAGA ATGGCCCTGG TAACGCCTGG ACACCCATTG ACCCGAGCTT CAAGGAGCAT
GAACTCAATA CCTCCGTGGA TATGTCAAAG ATTGTTAAGT TCAATGAGTC CGACTATCTG
CGGGCTCAGA GCAAATTGCC CCCTTCACTT ACCTACCTGT TCGCCTTGGA AGACTACCAT
ACTGCCAACT ACCAGGGTGG CATGTATGAG ATGTTTTACC TGAAAAAGAT CAAAGAGCAC
GAATTCGGCG TCTTTCTTGG GACGCTGCCG TACAAAACGG TCGTTGTCGG CAGCAAGTTC
GCAAGTATCG ATACATTACT GCGACACAAG ATTTCAATAG GACTGCGCGA TCCCACAACA
GATGATGCAA CAGCCATAAC GAAAGACGTC TGTGAGCTGA CCGGAAAGCA AATCACCGTA
TCCTACACTC CCGCCACAGA TCAAGACAGC GCGGTAATGA CCAACTATGG AGGATTGCTG
AAGACACCGG CCTATCTGAT AAAAGTCAAG GCGCGGGTGA AAGTTGACGA TACGGTTGTC
CTGGAAGGGC CGCCGATAGG CATGGGAGAA TCGTTGAAGC TTAACCTACA GTTTTCCACG
CCCGGCAGTT TCGGCGACTC CATCGAAACC GAAATGGCAG CTGGCCTCTA TTACAACATC
GGGCTTTCCG CCCTGAATGT CGCAAGCAAA CAGGCACTGG GGGGACTAGA CAACTCGGAA
AACCTCGTCG GCACCTTCTA CGATTCAATC AACAGCGGGG ACAATCAGGT CGGCAAGGTA
CTGCACAACA TCGGCCTGCA ATACTTCACC CACACGAATA ATGCCTCAAA GGTGCTCGAA
GGTGTGATGC ACATTTATAA CACGAGAGCG GTAAATGCCG GGTTCGTCTC AGTATCGGCG
AAGTACCGGG AGTTTTTCGG GCTCATGGTA TCGCCCCCCA TCATTTCGGG GCTGGTTATC
GACATCCCGA GATACATTCA ATCTCCATTT TCCATCATCG GCGACAAGGA GCAAGAAAAG
GCGTTCACCA AAATCCAAGG CCTCAATTCA TCGTACTTTG AGCATGCTAT TTGGGAAAGC
TTTTCCGGGA TTGATTCAGT ATCTACGGTA AAACTACTTC AGCTGGCAAA CGAGGCGGGG
CAGACGATTT ACACGATCAA TAGCGCAAAT GTAAGTCAAA CCCTGCCGCT GCTGAACCAG
ACGCAACAGG TAAAAGATGA CATTCAAAAT GCTGTTGCTG CAGGTAAAGA GGTGACAATT
CCTGGAAGCA AGATCACTCG CAATGAGTGG AGCGGTACAG GATTCATAGT GAGGAACCCG
GATACTGGCG AAGGCGCTTA CATGATTTCG AATGGGCTGG CAGGAGGAGG AAGTACCAGT
TCACCTTCAT CAATATCACT GTTGGGGAGG CTCCTTTCGC GTCAATATGC AGCTCTTGCA
CTAGGAAAAC CAGCAAACCA GTTCGCCTTA AATAGGGGGA TAGGGAGATG GATTGGAGCT
GGATATGATG AGATTATGAT TTTTGCCATC GGTGGTATGT TACTTTCTAA AGGATTTATA
CCAAGGCATG AGTATACTTT TTCTAAAGAA GAATTGTTAA GTCTTATCAA CAGGCCAGAT
AATTGGATTG TATATTATAG CGGACATGGA AATACGAGCC CAGACTATGG GGATATGTTA
ATACCAGGTT ATGATAAAGA CGGTAAACCG GAGTATGTTT ATTCTGGCGA TATTACAAAA
GCTAATGCAC GTGTTGTTTT CCTTGACAGC TGCCGTTCTG GAGATCGGGG TAGTTTTATG
AATTCATTTG GTGTCGATAA TCTAGTATTC ATGGGGTGGA CTCACTCTGT AGAATATTTT
GAATCGGGAG ATTTTGCTTT TGATTGGTGG CGTAGCTTTA TTTCAGGTAA ATCCGCTGCG
ATATCAGCTG CAGGTATTGC TGATGGTAAA TACGTAACTC CAAGTTCAGA CGCTAATGAA
CCATATGTGG TTATAAAAGG TGGATCATTA ACTTTGGATT CACTTTGA
 
Protein sequence
MFRKYSRTIA AVILCFFTWT SGGVFSIANA AQVEARKAKT QRQEKKPEGA EEKFSKVTGE 
MEAILADPKE DIEAKKGRLR LKKAEVDALD TDIRKQFAET EKRLKDAKLP AEILERHRRF
VKHYDDNLAE LKGNIERVEK AKDKKEAEAE IEKTRKHLEK VKAPTRHQPL DPNNLPHRQP
KVIKREPRLK KEEFERDLKK DKNAWRNEKR IMVASAGSLA GLLTSSMVNA VTPPTAADLA
ETVDVQLTPE IRAKALELGN NPVKIYEWVR NNAEYEPYYG SLKGAQQTLM EKSGNNYDQA
SLLIALLRAA NIPAKYVTGT IEIPIERVMS WIGVKDPTTA ANILATGGIP AKAGVSGGKI
STIRLEHVWV EAYVNMYPSF GAKNGPGNAW TPIDPSFKEH ELNTSVDMSK IVKFNESDYL
RAQSKLPPSL TYLFALEDYH TANYQGGMYE MFYLKKIKEH EFGVFLGTLP YKTVVVGSKF
ASIDTLLRHK ISIGLRDPTT DDATAITKDV CELTGKQITV SYTPATDQDS AVMTNYGGLL
KTPAYLIKVK ARVKVDDTVV LEGPPIGMGE SLKLNLQFST PGSFGDSIET EMAAGLYYNI
GLSALNVASK QALGGLDNSE NLVGTFYDSI NSGDNQVGKV LHNIGLQYFT HTNNASKVLE
GVMHIYNTRA VNAGFVSVSA KYREFFGLMV SPPIISGLVI DIPRYIQSPF SIIGDKEQEK
AFTKIQGLNS SYFEHAIWES FSGIDSVSTV KLLQLANEAG QTIYTINSAN VSQTLPLLNQ
TQQVKDDIQN AVAAGKEVTI PGSKITRNEW SGTGFIVRNP DTGEGAYMIS NGLAGGGSTS
SPSSISLLGR LLSRQYAALA LGKPANQFAL NRGIGRWIGA GYDEIMIFAI GGMLLSKGFI
PRHEYTFSKE ELLSLINRPD NWIVYYSGHG NTSPDYGDML IPGYDKDGKP EYVYSGDITK
ANARVVFLDS CRSGDRGSFM NSFGVDNLVF MGWTHSVEYF ESGDFAFDWW RSFISGKSAA
ISAAGIADGK YVTPSSDANE PYVVIKGGSL TLDSL