Gene Gura_2220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_2220 
Symbol 
ID5166813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2578094 
End bp2579704 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content62% 
IMG OID640549714 
Producthypothetical protein 
Protein accessionYP_001230977 
Protein GI148264271 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000188992 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGT TTATGGTTCT TTTCCTTTCA CTTCTCTTCT CCGCGTTTCT GTGCGGCGCC 
GCCCTTGCCC ACGAGAGTCA CGACGACATC CTGGGAAAGG TGCCCAGGGA GCACTGGACC
TGGAAGGAGA TCGCGGAGCT TGGCGCGAAG TACGGCGCCC TGAAAAAGCT CCCAGAAGGG
GAGTCAATTG AGAAAAAGGA TCTGGCAGCG TCGTTGCTGG CGGTGATGGA GAAGGCGCTG
GAGAGGTGCG AGAGGGAGGG GAGCGAGGCG GTCCCCGCCG AGGACCTGGA GCGGATTGCC
ACCCTGCACG AGGCGCTGAA GGAGGAGATG GCCCGGTATG AAGGGTACCA GCTCCGGCGG
GATGCGATCG AAAAGATGCT GGCCAAGCCG GAAGTGCCGG AGTTCGCATA CAAGGTCGGG
GTCAACGGCT TTCTGCGCGG CGAGGGGGTC GGCAATTTCA CCCTGACCGA TTTCAGCTAC
GTCCCCGGCC ACGGCGAGGG GCGCTTCCTC TACCGGGTGA AGCCGTACGC CTACTGGCAC
CCGACCGACT ACCTGGATAT CCACGCGGAG GGGCAGGGGT ACGGCTTCAC CGGCGGGAGC
CACCAGGAGT ACAATAAGTT CTCCTTGTAT CAGGGGTTTG TTGAGGCGAA ACTCCCCGGC
AGCGAGCTGT TCGCCCTGAA GGGGGGGCGC CAGGAATTCA GCTACGGAAG CACCTTCATC
CTCGGCCCCG ATTCGTTCTA TGACGGCCTC TCGTTCGACG CGGCCAGGCT GCGGATAAGA
CCGGTCGAGC CGCTCACCGT CGATCTCCTG GTGGGTGCCT ACGCCACCCC CTTCTCCGGT
GGCATCGAGG GTAACCTGGC CGGAGCCTAC GCCACCTATG CATTTTCCGA GGGGAACGCC
GTCGAGGCCT ATGCGTTCCG CGACTCCGGC TCGACCGACC ACCATGCCGG GGAACACCTC
GCCACCTGGG GGGCCAGGTT TACGGGGAAG GCAGGCCCGG TCGCCGTCGA ATTCGAGCCG
GTCTACCAGT CGGGGCGGAC CTTCAACAGC GCGCGGGAGG CCAATGATCG GATCGACGCC
TTCGGCGGCC ATCTCGATCT CTCCGCAGAG TCGGTCCTGG CCGGCTACAA CAACAAGTTC
TTCGCGAGCT ATGCCTACGG CTCAGGGAGC AGCAACGCGG CCAATGGCGT CTCGGTCGCC
AGGGAATTCA GGACTCCCAA CAACGACAAC GCGCTGGTGG GGGACATGAG CGTCATCGGG
GACATGTCCG GCGTCACCGT CAACGGCCAT CATGCCAGCG GCCTGCAGAT CTATACCCTC
GGCTGGGGAG TGGACCTGAC AAAGGATCTG AATTTCTCCG CCACCGGCCG CTATTTCCTC
GCCAACAATG TCGAGGACGG CCTGAGCCGC CGTCTCGGCC TGGAGACCGA CTTTACCCTG
ACTTACAACC TGGCGGAGGG GCTCTCTTTC CTCGTCGGTT ATGACCGATT CTTTACCGGC
GGATTTTTCC GGGATGCCTC CGGGAGCGGC GAGGATATCG ATTACGGTTA TTTCATGGTA
CAGTTCGATC TCTCCAAGAG TAAACCGCGG ATGAAACCTG TCAAAGGGTA G
 
Protein sequence
MKKFMVLFLS LLFSAFLCGA ALAHESHDDI LGKVPREHWT WKEIAELGAK YGALKKLPEG 
ESIEKKDLAA SLLAVMEKAL ERCEREGSEA VPAEDLERIA TLHEALKEEM ARYEGYQLRR
DAIEKMLAKP EVPEFAYKVG VNGFLRGEGV GNFTLTDFSY VPGHGEGRFL YRVKPYAYWH
PTDYLDIHAE GQGYGFTGGS HQEYNKFSLY QGFVEAKLPG SELFALKGGR QEFSYGSTFI
LGPDSFYDGL SFDAARLRIR PVEPLTVDLL VGAYATPFSG GIEGNLAGAY ATYAFSEGNA
VEAYAFRDSG STDHHAGEHL ATWGARFTGK AGPVAVEFEP VYQSGRTFNS AREANDRIDA
FGGHLDLSAE SVLAGYNNKF FASYAYGSGS SNAANGVSVA REFRTPNNDN ALVGDMSVIG
DMSGVTVNGH HASGLQIYTL GWGVDLTKDL NFSATGRYFL ANNVEDGLSR RLGLETDFTL
TYNLAEGLSF LVGYDRFFTG GFFRDASGSG EDIDYGYFMV QFDLSKSKPR MKPVKG