Gene P9303_29811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_29811 
SymbolureC 
ID4778412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2632903 
End bp2634627 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content56% 
IMG OID640088505 
Producturease subunit alpha 
Protein accessionYP_001018976 
Protein GI124024669 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTACC GGATGGATCG CCAGGCCTAC GCCGAAACCT ACGGCCCAAC CACAGGCGAC 
CGCATGCGCC TGGCAGATAC CGAATTGATC CTGGAGGTGG AACGCGATTT CACCACCTAC
GGCGAAGAAG TCAAATTCGG TGGAGGGAAA GTGATCCGCG ACGGCATGGG GCAATCCCAG
CAATCCCGCG CCAATGGTGC TGTTGACACC GTGATCACCA ATGCCCTGAT CCTCGACTGG
TGGGGGATCG TCAAAGCAGA TATCGGCCTG CGAGATGGAC GGATCGTTGC CATCGGCAAG
GCTGGCAATC CCGACATCAC TGACGGGATT GACATTGTGA TCGGCCCAGG CACTGAAGCC
ATTGCTGGAG AGGGTCACAT CGTGACAGCC GGCGCCATCG ATAGCCACAT CCATTTCATC
TGCCCACAGC AAATCGAGAC TGCCCTGGCT AGCGGCGTTA CCACCATGCT CGGTGGTGGT
ACAGGTCCGG CAACAGGCAC CAATGCCACT ACCTGTACAC CTGGCTCCTT TCACATCAGC
CGCATGCTCC AAGCAGCAGA AGGATTGCCG ATGAATCTTG GCTTCTTTGG TAAAGGCAAT
GCCAGTACAA CTGAAGCTCT CGAGGAACAA GTGCTAGCCG GAGCCTGCGG CCTCAAACTC
CACGAAGACT GGGGTACCAC TCCCGCAGCT ATTGACTGCT GTCTTTCGGT AGCCGATCGC
TTCGATGTCC AGGTCTGCAT CCACACAGAC ACGCTCAATG AAGCCGGTTT TGTAGAAGAC
ACAATCCGAG CCATCGGCGG ACGCACCATC CACACCTTCC ATACCGAAGG CGCCGGTGGA
GGCCACGCAC CAGACATCAT CCGTATCTGT GGTGAAAGCA ACGTGCTGCC CAGCTCCACA
AATCCAACCC GGCCTTACAC CCGCAACACC CTGGAAGAGC ACCTCGACAT GCTCATGGTT
TGCCACCACC TAGATCCAGC GATTCCTGAA GATGTGGCCT TTGCCGAATC GCGCATCCGT
CGCGAAACAA TCGCTGCGGA AGATATTCTC CACGACCTCG GTGCCTTCAG CATCATTGCC
AGTGATTCCC AAGCGATGGG ACGAGTCGGA GAGGTGATTA CAAGAACATT CCAGACCGCT
CACAAGATGA AAGTTCAGAG AGGCCCTCTG CCAGAAGATG CTGCAAATCC ACGTGGCACT
CGTAACGACA ACAACCGCCT AAAGCGCTAC ATCGCCAAGG TAACGATCAA CCCCGCTATT
GCTCACGGCA TTGACAACCA TGTTGGCTCA GTAGAGGTAG GCAAACTGGC AGACTTGGTG
CTCTGGAAGC CAGGCTTCTT CGGCGTCAGG CCAGAACTTG TGATCAAGGG CGGGTCAATC
ATCTGGGCGC AAATGGGCGA TGCTAATGCC TCGATCCCAA CACCTGGACC AGTCCATGGC
AGACCAATGT TTGCAGCATT CGGCAAAGCC CTTGCCCCCA GCTGCCTCAC CTTCCTGAGC
CAAGCGGCCA TCGAAACAGA TCTTCCAAAC AAGCTGGGGC TGCAACGTGC CTGCATTCCC
GTTCTGAACA CACGCACAAT CGGCAAAGCA GAGATGCACA ACAACAATTC ACTACCAAAA
GTAGAGGTAG ATCCACAAAC TTACGAGGTG TTCGCCGACG GCGAATTACT CACCTGCGAC
CCCGCAGAAG AACTACCAAT GGCCCAGCGA TATCTCCTAA TCTAA
 
Protein sequence
MAYRMDRQAY AETYGPTTGD RMRLADTELI LEVERDFTTY GEEVKFGGGK VIRDGMGQSQ 
QSRANGAVDT VITNALILDW WGIVKADIGL RDGRIVAIGK AGNPDITDGI DIVIGPGTEA
IAGEGHIVTA GAIDSHIHFI CPQQIETALA SGVTTMLGGG TGPATGTNAT TCTPGSFHIS
RMLQAAEGLP MNLGFFGKGN ASTTEALEEQ VLAGACGLKL HEDWGTTPAA IDCCLSVADR
FDVQVCIHTD TLNEAGFVED TIRAIGGRTI HTFHTEGAGG GHAPDIIRIC GESNVLPSST
NPTRPYTRNT LEEHLDMLMV CHHLDPAIPE DVAFAESRIR RETIAAEDIL HDLGAFSIIA
SDSQAMGRVG EVITRTFQTA HKMKVQRGPL PEDAANPRGT RNDNNRLKRY IAKVTINPAI
AHGIDNHVGS VEVGKLADLV LWKPGFFGVR PELVIKGGSI IWAQMGDANA SIPTPGPVHG
RPMFAAFGKA LAPSCLTFLS QAAIETDLPN KLGLQRACIP VLNTRTIGKA EMHNNNSLPK
VEVDPQTYEV FADGELLTCD PAEELPMAQR YLLI