Gene A9601_08951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_08951 
SymbolureC 
ID4717601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp768414 
End bp770123 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content37% 
IMG OID640078607 
Producturease subunit alpha 
Protein accessionYP_001009286 
Protein GI123968428 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTACA AAATTGATAG AAAAACTTAT GCTCAAACTT ACGGACCTAC TACAGGAGAT 
AGAGTAAGGC TTGCTGATAC CGAACTGTTT ATAGAAGTAG AAAAGGATTT AACTACATAC
GGAGATGAAG TTAAATTTGG TGGAGGTAAA GTTATTCGAG ATGGGATGGG ACAGTCTCAA
GTAAGAAGAG CTGATGGAGC TGTAGATACC GTAATAACTA ATGCTTTGAT CGTAGATTGG
TGGGGAATTA TTAAGGCTGA TGTGGGTATA AAAGATGGAA TGATTTTTGA AATTGGTAAG
GCTGGCAATC CAGATATCCA GGATAATGTT GATATTGTTA TTGGTGCATC AACAGAAGTA
ATAGCTGGAG AGGGGCATAT TCTTACTGCA GGTTCAATAG ATACCCATAT TCACTTTATC
TGTCCCCAAC AAATTGAGAC AGCACTATCC TCTGGAATTA CAACCATGTT GGGAGGAGGA
ACAGGACCTG CAACTGGCAC AAATGCGACT ACTTGTACTC CTGGTTCTTT TCATATTTCA
AGAATGCTTC AATCTGCAGA AGCATTTCCT ATGAATTTAG GTTTTTTTGG AAAAGGAAAC
TCAACAAACG AGATCAATCT TATTGATCAG GTTGAGGCTG GTGCTTGTGG TTTGAAGCTT
CATGAAGATT GGGGGACCAC CCCTTCTACA ATAAATTCTT GTCTAAATGT TGCAGATAAA
TTTGACGTAC AAGTATGTAT TCATACTGAT ACTTTGAATG AGGCAGGCTT TGTTGAAGAT
ACCATCAACG CTATTGCAGG AAGAACTATT CATACTTTTC ATACCGAAGG AGCAGGTGGA
GGTCATGCTC CAGACATTAT AAAAATCTGT GGAGAAAAAA ATGTTCTTCC TAGTAGTACA
AATCCAACAA GACCTTATAC AAGAAACACA TTAGAAGAAC ATCTTGACAT GTTAATGGTT
TGTCATCATT TAGATTCTAA AATCCCAGAA GACATTGCAT TTGCTGAGTC AAGGATAAGA
AGAGAGACTA TTGCAGCTGA GGATATCTTG CATGATTTAG GTGCCTTTTC AATAATTGCT
AGTGATTCTC AAGCTATGGG AAGAGTTGGC GAAGTAATTA CAAGAACTTT TCAAACCGCA
CATAAAATGA AAGTCCAAAG GGGGCCGCTA TCGCAGGATT CTGATAGAAA CGATAACTAT
AGAGTGAAGA GATATATTTC TAAAGTCACA ATTAATCCTG CAATAGCTCA TGGTATTGAT
AAACATGTTG GGTCTATAGA AAAGGGTAAA ATTGCAGATT TGGTATTGTG GAAACCTTCC
TTTTTTGCGG TGAAGCCTGA ATTAGTTGTT AAAGGAGGAT CTATAGTTTG GTCCCAAATG
GGTGATGCAA ATGCTTCAAT TCCTACTCCA GGTCCCGTAC ATGGTCGGCC TATGTTTGCA
AGTTTCGGCC AATCTCTAAT TAAGAGTTCT TTTACCTTTC TAAGTAAAAA TTCAATTGAA
CAAAATATTC CAAATAAATT AGGCTTACAA AAAAAATGTA TTGCCGTAGA AAATACAAGA
AATATCAATA AATCAAACTT AAAACTTAAT ACTAAACTAC CAAATATTTC AGTTGATCCT
CAAACTTATG AAGTTTTTTC TGATGGAGAA CTTCTTACTT GTGAACCACT TGATGAAGTC
CCAATGGCTC AGAGGTATTT TTTGCTTTAG
 
Protein sequence
MSYKIDRKTY AQTYGPTTGD RVRLADTELF IEVEKDLTTY GDEVKFGGGK VIRDGMGQSQ 
VRRADGAVDT VITNALIVDW WGIIKADVGI KDGMIFEIGK AGNPDIQDNV DIVIGASTEV
IAGEGHILTA GSIDTHIHFI CPQQIETALS SGITTMLGGG TGPATGTNAT TCTPGSFHIS
RMLQSAEAFP MNLGFFGKGN STNEINLIDQ VEAGACGLKL HEDWGTTPST INSCLNVADK
FDVQVCIHTD TLNEAGFVED TINAIAGRTI HTFHTEGAGG GHAPDIIKIC GEKNVLPSST
NPTRPYTRNT LEEHLDMLMV CHHLDSKIPE DIAFAESRIR RETIAAEDIL HDLGAFSIIA
SDSQAMGRVG EVITRTFQTA HKMKVQRGPL SQDSDRNDNY RVKRYISKVT INPAIAHGID
KHVGSIEKGK IADLVLWKPS FFAVKPELVV KGGSIVWSQM GDANASIPTP GPVHGRPMFA
SFGQSLIKSS FTFLSKNSIE QNIPNKLGLQ KKCIAVENTR NINKSNLKLN TKLPNISVDP
QTYEVFSDGE LLTCEPLDEV PMAQRYFLL