Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_08951 |
Symbol | ureC |
ID | 4717601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 768414 |
End bp | 770123 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640078607 |
Product | urease subunit alpha |
Protein accession | YP_001009286 |
Protein GI | 123968428 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0804] Urea amidohydrolase (urease) alpha subunit |
TIGRFAM ID | [TIGR01792] urease, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTACA AAATTGATAG AAAAACTTAT GCTCAAACTT ACGGACCTAC TACAGGAGAT AGAGTAAGGC TTGCTGATAC CGAACTGTTT ATAGAAGTAG AAAAGGATTT AACTACATAC GGAGATGAAG TTAAATTTGG TGGAGGTAAA GTTATTCGAG ATGGGATGGG ACAGTCTCAA GTAAGAAGAG CTGATGGAGC TGTAGATACC GTAATAACTA ATGCTTTGAT CGTAGATTGG TGGGGAATTA TTAAGGCTGA TGTGGGTATA AAAGATGGAA TGATTTTTGA AATTGGTAAG GCTGGCAATC CAGATATCCA GGATAATGTT GATATTGTTA TTGGTGCATC AACAGAAGTA ATAGCTGGAG AGGGGCATAT TCTTACTGCA GGTTCAATAG ATACCCATAT TCACTTTATC TGTCCCCAAC AAATTGAGAC AGCACTATCC TCTGGAATTA CAACCATGTT GGGAGGAGGA ACAGGACCTG CAACTGGCAC AAATGCGACT ACTTGTACTC CTGGTTCTTT TCATATTTCA AGAATGCTTC AATCTGCAGA AGCATTTCCT ATGAATTTAG GTTTTTTTGG AAAAGGAAAC TCAACAAACG AGATCAATCT TATTGATCAG GTTGAGGCTG GTGCTTGTGG TTTGAAGCTT CATGAAGATT GGGGGACCAC CCCTTCTACA ATAAATTCTT GTCTAAATGT TGCAGATAAA TTTGACGTAC AAGTATGTAT TCATACTGAT ACTTTGAATG AGGCAGGCTT TGTTGAAGAT ACCATCAACG CTATTGCAGG AAGAACTATT CATACTTTTC ATACCGAAGG AGCAGGTGGA GGTCATGCTC CAGACATTAT AAAAATCTGT GGAGAAAAAA ATGTTCTTCC TAGTAGTACA AATCCAACAA GACCTTATAC AAGAAACACA TTAGAAGAAC ATCTTGACAT GTTAATGGTT TGTCATCATT TAGATTCTAA AATCCCAGAA GACATTGCAT TTGCTGAGTC AAGGATAAGA AGAGAGACTA TTGCAGCTGA GGATATCTTG CATGATTTAG GTGCCTTTTC AATAATTGCT AGTGATTCTC AAGCTATGGG AAGAGTTGGC GAAGTAATTA CAAGAACTTT TCAAACCGCA CATAAAATGA AAGTCCAAAG GGGGCCGCTA TCGCAGGATT CTGATAGAAA CGATAACTAT AGAGTGAAGA GATATATTTC TAAAGTCACA ATTAATCCTG CAATAGCTCA TGGTATTGAT AAACATGTTG GGTCTATAGA AAAGGGTAAA ATTGCAGATT TGGTATTGTG GAAACCTTCC TTTTTTGCGG TGAAGCCTGA ATTAGTTGTT AAAGGAGGAT CTATAGTTTG GTCCCAAATG GGTGATGCAA ATGCTTCAAT TCCTACTCCA GGTCCCGTAC ATGGTCGGCC TATGTTTGCA AGTTTCGGCC AATCTCTAAT TAAGAGTTCT TTTACCTTTC TAAGTAAAAA TTCAATTGAA CAAAATATTC CAAATAAATT AGGCTTACAA AAAAAATGTA TTGCCGTAGA AAATACAAGA AATATCAATA AATCAAACTT AAAACTTAAT ACTAAACTAC CAAATATTTC AGTTGATCCT CAAACTTATG AAGTTTTTTC TGATGGAGAA CTTCTTACTT GTGAACCACT TGATGAAGTC CCAATGGCTC AGAGGTATTT TTTGCTTTAG
|
Protein sequence | MSYKIDRKTY AQTYGPTTGD RVRLADTELF IEVEKDLTTY GDEVKFGGGK VIRDGMGQSQ VRRADGAVDT VITNALIVDW WGIIKADVGI KDGMIFEIGK AGNPDIQDNV DIVIGASTEV IAGEGHILTA GSIDTHIHFI CPQQIETALS SGITTMLGGG TGPATGTNAT TCTPGSFHIS RMLQSAEAFP MNLGFFGKGN STNEINLIDQ VEAGACGLKL HEDWGTTPST INSCLNVADK FDVQVCIHTD TLNEAGFVED TINAIAGRTI HTFHTEGAGG GHAPDIIKIC GEKNVLPSST NPTRPYTRNT LEEHLDMLMV CHHLDSKIPE DIAFAESRIR RETIAAEDIL HDLGAFSIIA SDSQAMGRVG EVITRTFQTA HKMKVQRGPL SQDSDRNDNY RVKRYISKVT INPAIAHGID KHVGSIEKGK IADLVLWKPS FFAVKPELVV KGGSIVWSQM GDANASIPTP GPVHGRPMFA SFGQSLIKSS FTFLSKNSIE QNIPNKLGLQ KKCIAVENTR NINKSNLKLN TKLPNISVDP QTYEVFSDGE LLTCEPLDEV PMAQRYFLL
|
| |