Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_21691 |
Symbol | |
ID | 4777500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1924892 |
End bp | 1926343 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640087679 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001018169 |
Protein GI | 124023862 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0470032 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATGGG GCATGCAGCA GATCGATGGG CTGACCAATA CTCCCCGCTG GGTTGCCCAG GCGGTGGTGT ATCAGATCTT CCCTGATCGT TTTCGTTGTA GTGGCCGTGT CTTAGCTCAT CAGCATTTGG CTTTGCGCTG CTGGGGCAGT GACCCTTCTG AGCAGGGTTT TCAGGGGGGA GATCTCTACG GGGTGATCGA GGCCCTTGAT CATCTCCAGG CGCTTGGTAT CAGCTGCCTT TACTTGACAC CCGTCTTTAG CTCTGCCGCT AACCATCGCT ATCACGCTTA TGACTACTTG CAGGTGGATC CGCTTCTTGG TGGCAATGCA GCGCTAGAGG CTTTGATTGA GGCGGTGCAT CGCCGCGGGA TGCGCATCAT TTTGGATGGC GTGTTTAATC ACTGTGGTCG CGGGTTCTGG GCTTTTCATC ATCTTTTGGA AAATGGTGAG GCTTCGCCTT ATCGCGATTG GTTTGAGGTG CGGCAATGGC CGCTTCATCC CTATCCACGG CGTGGGCAGG ATTGTGGTTA CAGCTGCTGG TGGAACGATC CAGCCTTGCC AAAGTTCAAT CATGCCCATG CCCCTGTGCG TGAGTATTTG ATTGCTGTAG CCCGCTATTG GCTCGAGCAG GGAATCGATG GTTGGCGACT TGACGTTGCT GATGAGGTGC CTGCTGAGTT TTGGCTGGAG TTTCGGCAAA TGGTTAAGGC CGTGAATCCA GACGCTTGGA TCTTGGCTGA GATCTGGGGT GATGCGAGAT CGTGGCTACA GGGGCAGCAC TTTGATGGTG TGATGAATTA TCGGATGGGT TGGAGCAGCC TTTGCTGGGT TGCTGGTAAG CGATTACGCC GTCGGTATCG CAATCCTGCC TATCCCCTTG ACCCTCTGAG TGGGGAGGCT TTTGTTGAGC TATTGGCAAC AACGCTGGGT TGGTATCGAC CTGAGGTGAA CCGCAGCCAG TTGAACCTGC TTGATAGCCA CGATGTGCCG AGAGCTCTGC ACACACTTCA CGGTGATCTT GCGGCGTTGA AGTTGGCCTT GCTGTTGCTG TTTTTGCAAC CAGGGGCGCC TTGCATCTAC TACGGCACAG AGGCGGGTTT GCAGGGTGGC CCTGAACCAG GTTGCCGCGA AGGGTTTCCT TGGCATACGC CTTGGCCTGC AGACCTGCGC GATTTCATTC AGTCGTTGAG TGATCTGCGC CAACGTTGCC CAGCGTTTGC TGATGGCGGT TTGCAATGGC AACCGATTGG AGCTGATGCA CTTCATGCTT GGTGGATGCA GCCCGAGACA ACCACAACGC AAAGGGAGAC GTCGATTCAG GTGTGGGTCA ATCGCAGTCG CAGGTCATGG TTGCCGACGA AAGTCTCATC GACAGACCCT CTTTGGCTGG AAGGAGCATT TGAATGCAAT GGCCGGGGAT TAGGCCCTCA ATCAGCAGTG TTGTTGAGCT GA
|
Protein sequence | MQWGMQQIDG LTNTPRWVAQ AVVYQIFPDR FRCSGRVLAH QHLALRCWGS DPSEQGFQGG DLYGVIEALD HLQALGISCL YLTPVFSSAA NHRYHAYDYL QVDPLLGGNA ALEALIEAVH RRGMRIILDG VFNHCGRGFW AFHHLLENGE ASPYRDWFEV RQWPLHPYPR RGQDCGYSCW WNDPALPKFN HAHAPVREYL IAVARYWLEQ GIDGWRLDVA DEVPAEFWLE FRQMVKAVNP DAWILAEIWG DARSWLQGQH FDGVMNYRMG WSSLCWVAGK RLRRRYRNPA YPLDPLSGEA FVELLATTLG WYRPEVNRSQ LNLLDSHDVP RALHTLHGDL AALKLALLLL FLQPGAPCIY YGTEAGLQGG PEPGCREGFP WHTPWPADLR DFIQSLSDLR QRCPAFADGG LQWQPIGADA LHAWWMQPET TTTQRETSIQ VWVNRSRRSW LPTKVSSTDP LWLEGAFECN GRGLGPQSAV LLS
|
| |