Gene Gura_0967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_0967 
Symbol 
ID5166756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp1147769 
End bp1148728 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content64% 
IMG OID640548463 
Productrhomboid family protein 
Protein accessionYP_001229746 
Protein GI148263040 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0078595 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCCGG AGAGAGATAA CATCGAGGCA TGCGAAGAGG AGTGGCTGGC AATCCCGCCG 
GAGTTGGGAG TCTGGAAAGA CAGCGGCACA CTTTCCGAGC GGCAGGTACG CCTCTGGACC
CTGGTCCTGG ATGCACGCGG CGTGCCCTTC CGCACCGAGC GGAGCGCTAC GGGCTGGCAA
CTGCTGGTGC CGGTGGGCTA CCTCAATGCG GCCCGGGACG AGTTGCGCCT CTTTGAAAAG
GAAAACCGCA ACTGGCCCCC GCCCCTGCCT CCGGCAAGAA CCCTGACGGA GAACACCCTG
GCAACCATGT CGGTCCTGAT TCTCCTGGCC ACCTTCCACA ACCTTACCCT GCTCGACATT
TCTCTGCCCG GCCATCACCC GATCAACTGG ATCGCCCTCG GCAACGCACA CGCCGCCAAG
ATACTGGCCG GCCAATGGTG GCGGCCGATC ACCGCCCTCA CCCTCCACTC CAACTGGCAG
CACCTTCTCG GCAACCTGGC AATCGGCGGG GTCTTCATCA TCATCCTCTG CCGCGAGCTC
GGCTCGGGGC TGGCCTGGAG CATGCTCCTC GGCGCCGGCA TCCTCGGCAA CCTGGCCAAC
GCCTGCTTGC AGCTGCCGGA CCATAGCTCG ATCGGCGCCT CCACCCTCGT CTTCGGCGCC
GTCGGCATAC TCGCCGCCCT CAACATGGTG CACTACCGGC ACCACCTGCA AAAGCGCCGG
CTACTCCCCG TTGCTGCTGC CATGGCCCTG CTCGCATTGT TGGGCACAGA AGGTGAACAC
ACAGATCTGG GTGCACACCT GTTCGGCTTT GTCTTCGGCA TAGGTCTTGG CCTGGTTACG
GAATACCTGG CAGGGAAGTA CGGGCGGCCC GGGCGGCGGA TCAACGCCCT GCTGGCGCTG
GCCGGAGCCG TTGTGGTGAT AGCGGCCTGG TGGGGGGCGC TGGGTCATTT TGCACTTTAG
 
Protein sequence
MDPERDNIEA CEEEWLAIPP ELGVWKDSGT LSERQVRLWT LVLDARGVPF RTERSATGWQ 
LLVPVGYLNA ARDELRLFEK ENRNWPPPLP PARTLTENTL ATMSVLILLA TFHNLTLLDI
SLPGHHPINW IALGNAHAAK ILAGQWWRPI TALTLHSNWQ HLLGNLAIGG VFIIILCREL
GSGLAWSMLL GAGILGNLAN ACLQLPDHSS IGASTLVFGA VGILAALNMV HYRHHLQKRR
LLPVAAAMAL LALLGTEGEH TDLGAHLFGF VFGIGLGLVT EYLAGKYGRP GRRINALLAL
AGAVVVIAAW WGALGHFAL