Gene RPB_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1803 
SymbolureC 
ID3908884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2063240 
End bp2064952 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content65% 
IMG OID637883697 
Producturease subunit alpha 
Protein accessionYP_485422 
Protein GI86748926 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.659823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.557794 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCA GAATCTCGCG TTCCGTCTAT GCCGACATGT TCGGTCCGAC CACCGGCGAC 
CGCGTCCGGC TCGCCGACAC CGATCTGATC ATCGAGGTCG AGAAGGACTA CACGACCTAT
GGCGAGGAGG TGAAGTTCGG CGGCGGCAAG GTGATCCGCG ACGGCATGGG GCAGAGCCAG
GTCACCAACA AGGACGGCGC CGCCGACACG GTCATCACCA ACGCGGTGGT CGTCGATCAC
TGGGGCATCG TCAAGGCCGA CGTCGCGATC AAGGCCGGCA TGATTCATGC GATCGGCAAG
GCCGGCAATC CGGACATCCA GCCCAATGTC GATATCATCA TCGGGCCGGG CACCGATATC
ATCGCCGGCG AGGGCAAGAT CCTCACCGCC GGCGGCTTCG ACAGCCACAT CCATTTCATC
TGCCCGCAGC AGATCGAGCA CGCGCTGATG AGCGGCGTCA CCACGATGCT CGGCGGCGGC
ACCGGCCCGT CGCACGGCAC CTTCGCGACG ACGTGCACGC CGGGGCCGTG GCACATCGGC
CGGATGATTC AGTCGTTCGA TGCCTTTCCG GTCAATCTCG GCATTTCCGG CAAGGGCAAC
GCGGCGCTGC CCGGCGCGCT GATCGAGATG GTCGAGGGCG GCGCCTGCGC GCTGAAGCTG
CACGAGGACT GGGGCACGAC GCCGGCGGCG ATCGACAACT GCCTCACCGT CGCCGACGAT
CACGACGTGC AGGTGATGAT CCATTCCGAC ACGCTGAACG AGTCGGGCTT CGTCGAGGAC
ACCATCAAGG CGTTCAAGGG CCGCACCATC CACGCCTTCC ACACCGAAGG CGCCGGCGGC
GGTCACGCGC CGGACATCAT CAAGGTCGCC GGCCTGGCGA ACGTGCTGCC GTCGTCGACC
AATCCGACCC GGCCGTTCAC CCGCAACACC ATCGACGAGC ATCTCGACAT GCTGATGGTG
TGCCACCATC TCGATCCGTC GATCGCCGAA GATCTGGCCT TCGCCGAGAG TCGCATCCGC
AAGGAGACGA TCGCGGCGGA GGATATTCTT CACGACCTCG GCGCGCTGTC GATGATGTCG
TCGGACAGTC AGGCGATGGG CCGGCTCGGC GAAGTCATCA TCCGCACCTG GCAGACCGCC
GACAAGATGA AGAAGCAGCG CGGAAGCCTG CCGCAGGACT CGTCGCGCAA TGATAATTTC
CGGGTCAAGC GCTACATCGC CAAGTACACC ATCAATCCGT CGATCGCGCA TGGCGTGTCG
AAGCTGATCG GTTCGGTCGA GACCGGCAAG ATGGCGGATC TGGTGCTGTG GTCGCCGGCG
TTCTTCGGCG TCAAGCCGGA TTGCATCATC AAGGCGGGCA TGATCGTGGC GGCGCCGATG
GGCGATCCCA ACGCCTCGAT CCCGACGCCG CAGCCGGTGC ACTACCAGCC GATGTTCGGC
GCCTATGGCC GCGCGCTCAC CGCGTCGTCG GTGGTGTTCA CCTCGCAGGC CGCCGCGGCC
GGCCCTCTCG CGCGCGACCT CGGCATCGCC AAGGCGCTGT ATCCGGTCAG CAATGTCCGT
GGCGGCATCT CGAAGAAGAG CATGATCCAC AACGACGCCA CGCCGACCAT CGAGGTCGAT
CCCGAAACCT ACGAAGTCCG CGCCGACGGC GAACTCCTGA CCTGCGCCCC CGCCGAAGTG
CTGCCGATGG CGCAGCGCTA CTTCATGTAC TGA
 
Protein sequence
MSTRISRSVY ADMFGPTTGD RVRLADTDLI IEVEKDYTTY GEEVKFGGGK VIRDGMGQSQ 
VTNKDGAADT VITNAVVVDH WGIVKADVAI KAGMIHAIGK AGNPDIQPNV DIIIGPGTDI
IAGEGKILTA GGFDSHIHFI CPQQIEHALM SGVTTMLGGG TGPSHGTFAT TCTPGPWHIG
RMIQSFDAFP VNLGISGKGN AALPGALIEM VEGGACALKL HEDWGTTPAA IDNCLTVADD
HDVQVMIHSD TLNESGFVED TIKAFKGRTI HAFHTEGAGG GHAPDIIKVA GLANVLPSST
NPTRPFTRNT IDEHLDMLMV CHHLDPSIAE DLAFAESRIR KETIAAEDIL HDLGALSMMS
SDSQAMGRLG EVIIRTWQTA DKMKKQRGSL PQDSSRNDNF RVKRYIAKYT INPSIAHGVS
KLIGSVETGK MADLVLWSPA FFGVKPDCII KAGMIVAAPM GDPNASIPTP QPVHYQPMFG
AYGRALTASS VVFTSQAAAA GPLARDLGIA KALYPVSNVR GGISKKSMIH NDATPTIEVD
PETYEVRADG ELLTCAPAEV LPMAQRYFMY