Gene Gura_3334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3334 
Symbol 
ID5166740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp3918757 
End bp3920544 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content60% 
IMG OID640550820 
Producthypothetical protein 
Protein accessionYP_001232064 
Protein GI148265358 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000666243 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGC AACTTTGGAA GATGCTCCTT CTCACCTCGT TCATCCTGAG CGCCCTGGGG 
ATTTCGGGAT GTGGCGGCGG AGGGAGTACG GCAACGACAT CGGGAACCGC AATAACAGTA
AGCGGTACCG CACTGGCAGG GGCGCCCCTT TTCGGGAACG CCTGGATCAA GGATGCGAAC
GGCAAGAAAA AGGGGCCGGT TGCCATCGAC AAAAATGGAA ATTTCGCCTT CACCAACATG
ACCGGCATGC AGGCCCCCTT CATCCTCCAG GCCGACGGCA CCGCCGGCAC CAACAGCTAC
CGCCTCTGTT CCATGGCCAC CGGCGGCGGA ACCGCCAATA TCAACCCCAT GACCAACCTT
GCCGTCGCCG CGGTGACCGG CAAAGACCCG GCTGCGGTCT TCACCGACCC GACGGCCAAC
AACATCAAGA ACTCCATCAA CGACACGGCC GTGGCAGCCG CCATCGCACA GATCAAGGCG
ATGCTCAAGC CGATCCTCGA TGCGTACAAC GCCTCTGCCG TCGACCCGCT GAAGGGTAAC
GTAGCAGCCA CCAACTCCGG GTTGGACGGC GTCTTCGACG TGGTGAAGAT CAACGTCACT
CCCGACAACA CCGGTGCGGC GCAGGTATCC GTCAGCAACA ACCTTACCAA CTCGACCATA
TTGCAGACGC AAACGGTCTC CACGGCGGTC GCCAACACCA CACAGACGGC GGCGACAATC
ACAACCCAGA CGACCGGCTT GTCCACCGAC GCCGCCAACC TGCAGGCAAT TACCGCGCAA
CTTCAATTGC TGGCAACCGA GCTGGGCAAG ACCACCCCGA ACCTCGACCC CTTCTTTGCC
ACCAATTTCG GGATCAACAG CGGGCTCGAC CGGGCCCAAT CCATCCTGCA ATTGGCCCCC
CCGGGGAAAA TCACCGGCAT CTCCCCGATC AGCGTCGTGC AGAAAACCGC AAACGGCGCG
AGCTTTGACT ACGAAGTCTC CTTCCTGGCT TACTTCGCCG ACGGCTCAAA CGGCGCCCCC
GATGACAACT TCATCTTCAC CAATGAAGGC GGCGCCTGGA GGCTGAAAGG GAATAACTAT
AAATCCTACG TCAGGATTCA ACCGCAAGCC TACAGATGGA TCGATGCGGT CGGCACGACG
ACCGTCAAGA CCGGTCTCGA CGTCGAAGCC GAAGACCCGG GCGTGATCGG CATCGCCACC
ATTACGATTA CCGGCCCCGG TCTCCCCGCC GCCGGACTCA CCATGACATC GGTCGGTGTG
GGTGTCACCT ATTTCAATAT CATCCAGGCT CAACAGGACC CCACCCTGAA CACGCTGACC
AACCAATGGA ATTTCCTCCC GCTGAGTGAT GCGACGATCA CTGGTACCTT TGCCGCCACG
GCTGCGCCGT TCACCTACAC CTTTACGGTC AAGGATGCGA ACGGCGCCAC CATCGAGACC
AGGACAAAGA AGCTCGCAGT GGGACCGCTC CTCTCTGCCA CACTTGACGC CACCTACTTC
CCGACCATCA GTGGCCTGGC TTCCCAAGCC ATGTCCCTCT TGACCGGCAA GAGCTCGATT
TCCTTCTTCT TTGCCAAGCC GACCGCCTAC ACGGTCCAGG AGCAGCGAGC CAATCTCAGT
TTCTGGAACG AGACCAGCAA CGGATATTAC GATACCGAGC CTCTCCTTAC CGACACCCAG
GCCACCATCA CAGGCGGTAT CCCCTCGACG CTCCAGGGGG CATGGCTCAG CATGGGAGCA
AGGGATGGCA GCGGCCGGAG ATTTGATGCC GTCTTGATTT TCAAGTAG
 
Protein sequence
MKQQLWKMLL LTSFILSALG ISGCGGGGST ATTSGTAITV SGTALAGAPL FGNAWIKDAN 
GKKKGPVAID KNGNFAFTNM TGMQAPFILQ ADGTAGTNSY RLCSMATGGG TANINPMTNL
AVAAVTGKDP AAVFTDPTAN NIKNSINDTA VAAAIAQIKA MLKPILDAYN ASAVDPLKGN
VAATNSGLDG VFDVVKINVT PDNTGAAQVS VSNNLTNSTI LQTQTVSTAV ANTTQTAATI
TTQTTGLSTD AANLQAITAQ LQLLATELGK TTPNLDPFFA TNFGINSGLD RAQSILQLAP
PGKITGISPI SVVQKTANGA SFDYEVSFLA YFADGSNGAP DDNFIFTNEG GAWRLKGNNY
KSYVRIQPQA YRWIDAVGTT TVKTGLDVEA EDPGVIGIAT ITITGPGLPA AGLTMTSVGV
GVTYFNIIQA QQDPTLNTLT NQWNFLPLSD ATITGTFAAT AAPFTYTFTV KDANGATIET
RTKKLAVGPL LSATLDATYF PTISGLASQA MSLLTGKSSI SFFFAKPTAY TVQEQRANLS
FWNETSNGYY DTEPLLTDTQ ATITGGIPST LQGAWLSMGA RDGSGRRFDA VLIFK