Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_29811 |
Symbol | ureC |
ID | 4778412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2632903 |
End bp | 2634627 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640088505 |
Product | urease subunit alpha |
Protein accession | YP_001018976 |
Protein GI | 124024669 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0804] Urea amidohydrolase (urease) alpha subunit |
TIGRFAM ID | [TIGR01792] urease, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTACC GGATGGATCG CCAGGCCTAC GCCGAAACCT ACGGCCCAAC CACAGGCGAC CGCATGCGCC TGGCAGATAC CGAATTGATC CTGGAGGTGG AACGCGATTT CACCACCTAC GGCGAAGAAG TCAAATTCGG TGGAGGGAAA GTGATCCGCG ACGGCATGGG GCAATCCCAG CAATCCCGCG CCAATGGTGC TGTTGACACC GTGATCACCA ATGCCCTGAT CCTCGACTGG TGGGGGATCG TCAAAGCAGA TATCGGCCTG CGAGATGGAC GGATCGTTGC CATCGGCAAG GCTGGCAATC CCGACATCAC TGACGGGATT GACATTGTGA TCGGCCCAGG CACTGAAGCC ATTGCTGGAG AGGGTCACAT CGTGACAGCC GGCGCCATCG ATAGCCACAT CCATTTCATC TGCCCACAGC AAATCGAGAC TGCCCTGGCT AGCGGCGTTA CCACCATGCT CGGTGGTGGT ACAGGTCCGG CAACAGGCAC CAATGCCACT ACCTGTACAC CTGGCTCCTT TCACATCAGC CGCATGCTCC AAGCAGCAGA AGGATTGCCG ATGAATCTTG GCTTCTTTGG TAAAGGCAAT GCCAGTACAA CTGAAGCTCT CGAGGAACAA GTGCTAGCCG GAGCCTGCGG CCTCAAACTC CACGAAGACT GGGGTACCAC TCCCGCAGCT ATTGACTGCT GTCTTTCGGT AGCCGATCGC TTCGATGTCC AGGTCTGCAT CCACACAGAC ACGCTCAATG AAGCCGGTTT TGTAGAAGAC ACAATCCGAG CCATCGGCGG ACGCACCATC CACACCTTCC ATACCGAAGG CGCCGGTGGA GGCCACGCAC CAGACATCAT CCGTATCTGT GGTGAAAGCA ACGTGCTGCC CAGCTCCACA AATCCAACCC GGCCTTACAC CCGCAACACC CTGGAAGAGC ACCTCGACAT GCTCATGGTT TGCCACCACC TAGATCCAGC GATTCCTGAA GATGTGGCCT TTGCCGAATC GCGCATCCGT CGCGAAACAA TCGCTGCGGA AGATATTCTC CACGACCTCG GTGCCTTCAG CATCATTGCC AGTGATTCCC AAGCGATGGG ACGAGTCGGA GAGGTGATTA CAAGAACATT CCAGACCGCT CACAAGATGA AAGTTCAGAG AGGCCCTCTG CCAGAAGATG CTGCAAATCC ACGTGGCACT CGTAACGACA ACAACCGCCT AAAGCGCTAC ATCGCCAAGG TAACGATCAA CCCCGCTATT GCTCACGGCA TTGACAACCA TGTTGGCTCA GTAGAGGTAG GCAAACTGGC AGACTTGGTG CTCTGGAAGC CAGGCTTCTT CGGCGTCAGG CCAGAACTTG TGATCAAGGG CGGGTCAATC ATCTGGGCGC AAATGGGCGA TGCTAATGCC TCGATCCCAA CACCTGGACC AGTCCATGGC AGACCAATGT TTGCAGCATT CGGCAAAGCC CTTGCCCCCA GCTGCCTCAC CTTCCTGAGC CAAGCGGCCA TCGAAACAGA TCTTCCAAAC AAGCTGGGGC TGCAACGTGC CTGCATTCCC GTTCTGAACA CACGCACAAT CGGCAAAGCA GAGATGCACA ACAACAATTC ACTACCAAAA GTAGAGGTAG ATCCACAAAC TTACGAGGTG TTCGCCGACG GCGAATTACT CACCTGCGAC CCCGCAGAAG AACTACCAAT GGCCCAGCGA TATCTCCTAA TCTAA
|
Protein sequence | MAYRMDRQAY AETYGPTTGD RMRLADTELI LEVERDFTTY GEEVKFGGGK VIRDGMGQSQ QSRANGAVDT VITNALILDW WGIVKADIGL RDGRIVAIGK AGNPDITDGI DIVIGPGTEA IAGEGHIVTA GAIDSHIHFI CPQQIETALA SGVTTMLGGG TGPATGTNAT TCTPGSFHIS RMLQAAEGLP MNLGFFGKGN ASTTEALEEQ VLAGACGLKL HEDWGTTPAA IDCCLSVADR FDVQVCIHTD TLNEAGFVED TIRAIGGRTI HTFHTEGAGG GHAPDIIRIC GESNVLPSST NPTRPYTRNT LEEHLDMLMV CHHLDPAIPE DVAFAESRIR RETIAAEDIL HDLGAFSIIA SDSQAMGRVG EVITRTFQTA HKMKVQRGPL PEDAANPRGT RNDNNRLKRY IAKVTINPAI AHGIDNHVGS VEVGKLADLV LWKPGFFGVR PELVIKGGSI IWAQMGDANA SIPTPGPVHG RPMFAAFGKA LAPSCLTFLS QAAIETDLPN KLGLQRACIP VLNTRTIGKA EMHNNNSLPK VEVDPQTYEV FADGELLTCD PAEELPMAQR YLLI
|
| |