Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1803 |
Symbol | ureC |
ID | 3908884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2063240 |
End bp | 2064952 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637883697 |
Product | urease subunit alpha |
Protein accession | YP_485422 |
Protein GI | 86748926 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0804] Urea amidohydrolase (urease) alpha subunit |
TIGRFAM ID | [TIGR01792] urease, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.659823 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.557794 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCACCA GAATCTCGCG TTCCGTCTAT GCCGACATGT TCGGTCCGAC CACCGGCGAC CGCGTCCGGC TCGCCGACAC CGATCTGATC ATCGAGGTCG AGAAGGACTA CACGACCTAT GGCGAGGAGG TGAAGTTCGG CGGCGGCAAG GTGATCCGCG ACGGCATGGG GCAGAGCCAG GTCACCAACA AGGACGGCGC CGCCGACACG GTCATCACCA ACGCGGTGGT CGTCGATCAC TGGGGCATCG TCAAGGCCGA CGTCGCGATC AAGGCCGGCA TGATTCATGC GATCGGCAAG GCCGGCAATC CGGACATCCA GCCCAATGTC GATATCATCA TCGGGCCGGG CACCGATATC ATCGCCGGCG AGGGCAAGAT CCTCACCGCC GGCGGCTTCG ACAGCCACAT CCATTTCATC TGCCCGCAGC AGATCGAGCA CGCGCTGATG AGCGGCGTCA CCACGATGCT CGGCGGCGGC ACCGGCCCGT CGCACGGCAC CTTCGCGACG ACGTGCACGC CGGGGCCGTG GCACATCGGC CGGATGATTC AGTCGTTCGA TGCCTTTCCG GTCAATCTCG GCATTTCCGG CAAGGGCAAC GCGGCGCTGC CCGGCGCGCT GATCGAGATG GTCGAGGGCG GCGCCTGCGC GCTGAAGCTG CACGAGGACT GGGGCACGAC GCCGGCGGCG ATCGACAACT GCCTCACCGT CGCCGACGAT CACGACGTGC AGGTGATGAT CCATTCCGAC ACGCTGAACG AGTCGGGCTT CGTCGAGGAC ACCATCAAGG CGTTCAAGGG CCGCACCATC CACGCCTTCC ACACCGAAGG CGCCGGCGGC GGTCACGCGC CGGACATCAT CAAGGTCGCC GGCCTGGCGA ACGTGCTGCC GTCGTCGACC AATCCGACCC GGCCGTTCAC CCGCAACACC ATCGACGAGC ATCTCGACAT GCTGATGGTG TGCCACCATC TCGATCCGTC GATCGCCGAA GATCTGGCCT TCGCCGAGAG TCGCATCCGC AAGGAGACGA TCGCGGCGGA GGATATTCTT CACGACCTCG GCGCGCTGTC GATGATGTCG TCGGACAGTC AGGCGATGGG CCGGCTCGGC GAAGTCATCA TCCGCACCTG GCAGACCGCC GACAAGATGA AGAAGCAGCG CGGAAGCCTG CCGCAGGACT CGTCGCGCAA TGATAATTTC CGGGTCAAGC GCTACATCGC CAAGTACACC ATCAATCCGT CGATCGCGCA TGGCGTGTCG AAGCTGATCG GTTCGGTCGA GACCGGCAAG ATGGCGGATC TGGTGCTGTG GTCGCCGGCG TTCTTCGGCG TCAAGCCGGA TTGCATCATC AAGGCGGGCA TGATCGTGGC GGCGCCGATG GGCGATCCCA ACGCCTCGAT CCCGACGCCG CAGCCGGTGC ACTACCAGCC GATGTTCGGC GCCTATGGCC GCGCGCTCAC CGCGTCGTCG GTGGTGTTCA CCTCGCAGGC CGCCGCGGCC GGCCCTCTCG CGCGCGACCT CGGCATCGCC AAGGCGCTGT ATCCGGTCAG CAATGTCCGT GGCGGCATCT CGAAGAAGAG CATGATCCAC AACGACGCCA CGCCGACCAT CGAGGTCGAT CCCGAAACCT ACGAAGTCCG CGCCGACGGC GAACTCCTGA CCTGCGCCCC CGCCGAAGTG CTGCCGATGG CGCAGCGCTA CTTCATGTAC TGA
|
Protein sequence | MSTRISRSVY ADMFGPTTGD RVRLADTDLI IEVEKDYTTY GEEVKFGGGK VIRDGMGQSQ VTNKDGAADT VITNAVVVDH WGIVKADVAI KAGMIHAIGK AGNPDIQPNV DIIIGPGTDI IAGEGKILTA GGFDSHIHFI CPQQIEHALM SGVTTMLGGG TGPSHGTFAT TCTPGPWHIG RMIQSFDAFP VNLGISGKGN AALPGALIEM VEGGACALKL HEDWGTTPAA IDNCLTVADD HDVQVMIHSD TLNESGFVED TIKAFKGRTI HAFHTEGAGG GHAPDIIKVA GLANVLPSST NPTRPFTRNT IDEHLDMLMV CHHLDPSIAE DLAFAESRIR KETIAAEDIL HDLGALSMMS SDSQAMGRLG EVIIRTWQTA DKMKKQRGSL PQDSSRNDNF RVKRYIAKYT INPSIAHGVS KLIGSVETGK MADLVLWSPA FFGVKPDCII KAGMIVAAPM GDPNASIPTP QPVHYQPMFG AYGRALTASS VVFTSQAAAA GPLARDLGIA KALYPVSNVR GGISKKSMIH NDATPTIEVD PETYEVRADG ELLTCAPAEV LPMAQRYFMY
|
| |