Gene RPD_3500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3500 
SymbolureC 
ID4024014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3890446 
End bp3892158 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content64% 
IMG OID637963704 
Producturease subunit alpha 
Protein accessionYP_570624 
Protein GI91977965 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTGA AAATCTCGCG GTCCGTCTAT GCCGACATGT TCGGCCCGAC CACCGGCGAT 
CGCGTCCGGC TTGCCGACAC CGATCTGATC ATCGAGGTCG AGAAGGACTT CACGACCTAC
GGCGAGGAGG TGAAGTTCGG CGGCGGCAAG GTGATCCGCG ACGGCATGGG GCAGTCGCAG
GTGACCAACA AGGACGGCGC CGCCGACACG GTCATCACCA ACGCGCTGAT CGTCGATCAC
TGGGGCATCG TCAAGGCCGA CGTCGCAATC AAGGCCGGGA TGATCAGCGC GATCGGCAAG
GCCGGCAATC CGGACATCCA GCCGGGCGTC GATATCATCA TCGGCCCGGG CACCGACGTG
ATCGCGGGCG AGGGCAAGAT CCTCACCGCC GGCGGCTTCG ACAGTCACAT CCATTTCATC
TGCCCGCAGC AGATCGAACA TGCCTTGATG AGCGGCGTCA CCACCATGCT CGGCGGCGGC
ACCGGGCCAT CGCACGGCAC CTTCGCGACC ACCTGCACGC CGGGGCCGTG GCATATCGGC
CGGATGATTC AGTCGTTCGA TGCCTTCCCG GTCAATCTCG GCATTTCCGG CAAGGGCAAC
GCGGCGCTGC CCGGCGCGCT GATCGAGATG GTAGAGGGCG GCGCCTGCGC GCTGAAGCTG
CACGAGGACT GGGGCACGAC GCCGGCGGCG ATCGACAATT GCCTCACCGT CGCCGACGAT
CACGACGTGC AGGTGATGAT CCATTCCGAC ACGCTGAACG AGAGTGGCTT CGTCGAGGAC
ACCATCAAGG CGTTCAAGGG CCGCACCATC CACGCTTTCC ACACCGAGGG CGCCGGCGGC
GGCCACGCGC CGGACATCAT CAAAGTTGCC GGCCTGGAGA ACGTGCTGCC GTCCTCGACC
AATCCGACCC GGCCGTTCAC CCGCAACACC ATCGACGAGC ATCTCGACAT GCTGATGGTG
TGCCATCATC TCGATCCGTC GATCGCCGAG GATCTGGCGT TTGCCGAAAG CCGCATCCGC
AAGGAGACGA TCGCGGCGGA AGACATCCTG CACGATCTCG GCGCGCTGTC GATGATGTCG
TCGGACAGCC AGGCGATGGG CCGGCTCGGC GAAGTCATCA TCCGCACCTG GCAGACCGCC
GACAAGATGA AGAAGCAGCG CGGTTCGCTG TCGCAGGATT CCGCCCGCAA CGACAATTTC
CGCGTCAAGC GCTACATCGC CAAATACACC ATCAATCCGG CGATCGCGCA TGGCGTGTCG
AAGCTGATCG GTTCGGTCGA GACCGGCAAG ATGGCCGACC TCGTGCTGTG GTCGCCGGCG
TTCTTCGGCG TCAAGCCGGA TTGCATCGTC AAGGCGGGCA TGATCGTGGC GGCGCCGATG
GGCGATCCGA ATGCCTCGAT CCCGACGCCG CAGCCGGTGC ACTACCAGCC GATGTTCGGC
GCTTACGGCC GCGCGCTCAC CGCGTCGTCG GTGGTGTTCA CCTCGCAGGC TGCCGCAGCC
GGCCATCTTG CCCGTGACCT CGGCATCGCC AAGGCGCTGT ATCCGGTCAG CAATGTCCGT
GGCGGCATCT CGAAGAAGAG CATGATTCAC AACGACGCCA CGCCGAACAT CGAGGTCGAT
CCCGAAACTT ACGAAGTCCG AGCCGACGGC GAGTTGCTGA CCTGCGCGCC GGCCGAGGTG
CTGCCGATGG CGCAGCGCTA TTTCATGTAT TGA
 
Protein sequence
MSVKISRSVY ADMFGPTTGD RVRLADTDLI IEVEKDFTTY GEEVKFGGGK VIRDGMGQSQ 
VTNKDGAADT VITNALIVDH WGIVKADVAI KAGMISAIGK AGNPDIQPGV DIIIGPGTDV
IAGEGKILTA GGFDSHIHFI CPQQIEHALM SGVTTMLGGG TGPSHGTFAT TCTPGPWHIG
RMIQSFDAFP VNLGISGKGN AALPGALIEM VEGGACALKL HEDWGTTPAA IDNCLTVADD
HDVQVMIHSD TLNESGFVED TIKAFKGRTI HAFHTEGAGG GHAPDIIKVA GLENVLPSST
NPTRPFTRNT IDEHLDMLMV CHHLDPSIAE DLAFAESRIR KETIAAEDIL HDLGALSMMS
SDSQAMGRLG EVIIRTWQTA DKMKKQRGSL SQDSARNDNF RVKRYIAKYT INPAIAHGVS
KLIGSVETGK MADLVLWSPA FFGVKPDCIV KAGMIVAAPM GDPNASIPTP QPVHYQPMFG
AYGRALTASS VVFTSQAAAA GHLARDLGIA KALYPVSNVR GGISKKSMIH NDATPNIEVD
PETYEVRADG ELLTCAPAEV LPMAQRYFMY