Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_01471 |
Symbol | |
ID | 4776603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 161853 |
End bp | 165002 |
Gene Length | 3150 bp |
Protein Length | 1049 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640085646 |
Product | hypothetical protein |
Protein accession | YP_001016167 |
Protein GI | 124021860 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTAT ACCCTCAGCC TTGGCCTCCT TTCCCAGTTA ATGAAATTTA CCCAAAATCA GTGAGAGGGC TGATAGACGA TTGGGCTCGT GCACCAGTCA GGTCACGTGA TCAGGTAGTT GATTGTCAGC TAGCACATTT GGTGTCAATT GTTGTTCCAT GCTTTGACCC TGATCCATCT CAATTTAGCC AGTTGTTGTT TTCTTTGCAG CAGCAAGGCG ATCAGGAGTT TGATGTAGTA TTGATTAATG ATGGATCGGA TAATAATTCC TGGAGAAATA TTCAACTTAA ACTTAAGAAC TTTCCTTGGA TACGGGTAAT TAATCAGCCT GAGAATAAAG GTATATCGGC AGCGTTAAAC TTGGCTGTGG ATAATATCCA GACTCCCTAT ATCGCAATCG TTGATCAGGA CGACTTGTTG CATCCGGCTG CGGTTTCGAT TGTAAATGGC TATCTCAGAG AGAATGTTGA TTGTCGTTTG CTTTACACTG ATCACCTTGC ATTTCAGAAT GATGGAAGCA AGGCAACATA TACCCCTAAG TTCCCTTGGA ATCCAGATGC TCTGCTTGAA TTTAATTATT TAATTCATCT TAGCGTCATC AGTGTGGATT CCTATCGTGC TTGTGGCGGG ATGAACTCTT ACTTTGACGG CATTCAAGAT TGGGAATTTT ATCTCCGCTT GGCAAAAGGA TTGACTTCGC AGACTGTGGC TTATTTGCCT CTGCCTCTTT ATGCTTGGCG ACTCTCAGAT CAATCTGTGG CATCCTCCGC AACACCGAAG CAGGAATTGC GCAATAAAGC CTTGGAATTT TTACAGCTTG CTCATCAAGA GCTAGGCGAA GGTACTCGCG CGATGCAATC TCCAGATGAT TCCAGTCACT ACAAATTTCG TGTGGACCGT AGTGATATGT CTAAGTTGCC TGTTTCATGC AATGTTTTGT TGCTAGGTGA GCGAGATTCG GAGAATCCTC TTCATCAAAC CCTTCAATCG CTGCAAAGCT CAGAATTATG TTTTGGGCAA ATATTTCTCA TTCAGACTCC TAATAGTTCT TTCTCAGCCT CAACTTCTTC AATGGTAGAA GGTTTGGATG TCAATCGGGT TGAAATCGGT GAACTAGCAT CTTCAATCCC TGATGATCAA CCTTTATTGG TTTTGCAAGT TGGGGTCTCC TCCAAGGGCA AGATTGATCC AGGTTTGAAT GCTTGGCTGG AGCGCACATC TCGTTGGCAG GTAATTACAT TCCCCGTTTG TTCTTCGGTA GATCTTCCGC TATGTGTCTC GGCTGGATAT GCAAGGCTAC CTGCAGTCTC AGATACCTAT ATTCCTATAG GGCAAGCTTT TACTAAAAAG AACTACAACA ATAACTTCGC TTTTTTCTCT CATGTTCGCC CTGTAGACCT TCCATCGCCT GCTGTTCAGT TGTTGCGCTC TTCTGTGGTA CGGCAGACGT TAACAACATT TGCCTCAAGT TGGGATCAGA AGATGGATAT TAGGGCTAAA TGGTGGAGCT GTTTGATGGA GCTAAGTTGG GATTGTTGCT GTATCTCTGA TGTTTGGTTC GGATTAGATA CTTTTCTTTC TGATGCTGAG CATCAGAAAC TGGTCCAGAA GAGAGAGCAA TCGCTTTTTG CCATAGAATC TCAAAATTGG TTGGGAACTA TATCACCTTG GAAGAAAGCC ATATATTCAT ATTGGCTTAA AGGGTCTTTA CGTGACTTGA CTGCCAGAGC TCATCCATTG CAGGCTGAAT TCTTTTTTAG CATCCACATA CCATTGGAGA TTCCCGCCGA TAATGTTTTT GCGGGCGAGC GTAATTCTTC CAATCTTTCC CTATTGCCTC AGCTTTTGCA TAGATCTCTG GTGATATTGA TTCCAACCGA ACTGAATCCC CGTAGTAATG GTCATGCATG CATTCTTAGT TTGGCTCTAA AGCTTATAGA TGCAGGACAT TCTGTTTATC TCCTGCCGTT TAAGCCATTT AAGTTTTTCC GAGAGTTTTA TCCTAATTTG CCTGAGAGAT TTCAAGACTT ACCTTTTATC TCTGATCCGC AAGGATTTTC ACAAGCAATG CTTTTGGTTC CAGAATCAAC ACCTAGAAGT TTAGTTAAAA GATTACGTCC ATACTTTAAG CAACTTATTT GGTGGGTTTT GGCTCCTGCA GGAGTTCTCA CTGAGTTTCG TCCAAATATT CGGATTGGAG ACTTTCTGGT TGCATTCTCT GAATTTGCTT TGCCTGGGCA GTCTGATTAT CTTTTCATCC ATCCTGATGT TGACCAAGAT GTCGATCCTC TTTTCCCAAG ATATTTGAAA CAATATCGCC ACCAACCTCC CCATAAAAAG CGTCTTCTTC TCTATACTGG AAAAGGAAGG CTTAAAGCAC TGCCGCGCAA TTTGCATCGC AACTTGTTGC ATTATGAAGT GACTTTGGTG ACAAGATCTT TCCCATCCAC TAAGGCTGAG TTGACAGATT TATTGATCAA TTCGAGTGGC TTGATAAGTT GTGACCCAAT GACAAATTTA AACTTGGAGG CAGCTATGTT GGGATTACCA GTTTATCTTC TTGCCAACCC TTTCCCCGCA GAATACTTTC GAAATTTTCC CGTTGATTTA TCTTATTCGA TAACTGATTC TGCAGAAGAC TTTATAGTTC GCTTAGCAGA TAAAGGACCT TTAAAGCAGT TGCGGACAGT AGGCATGGAA TTGAAAAGTC GATCTGCTGT TGATATTTTC GATTTACTTT TGTTGAATCC GCCTTTGCAT AATGATCATG CTCTTGATCA TGCTCTTACT CTTCCTGTTG GAGCATACCG AGTGAACGAG AGTACTCTCT ACCAGATAGA GCAATATAGA AAGTTTCTTA TGCGAAGCCG TACAATTCAA GCTTTGAAGG AAGGGCAGTC TTTCTCTTCA GCTTTTTTAG GTGAGTATGT TGACAGTCTT AAGTACCCTT TCTGGGCGCA TTCATTGATT TGTCAAGGAT TAGCTAGATT GGATGATCTG GCAGATTTTT TGGCGACTAT CAGAGTTCTC TACCTATGCT TGCTCCTGTG GAAAAAGCTT GGTTTAGCTA CTCTTTTCAG ATTCTTCCTT AATAGAATAA TCAACTTTAA CCGAATCATT GCAGTCAGCA TGTTGGCGAA AAAAGCATAA
|
Protein sequence | MSLYPQPWPP FPVNEIYPKS VRGLIDDWAR APVRSRDQVV DCQLAHLVSI VVPCFDPDPS QFSQLLFSLQ QQGDQEFDVV LINDGSDNNS WRNIQLKLKN FPWIRVINQP ENKGISAALN LAVDNIQTPY IAIVDQDDLL HPAAVSIVNG YLRENVDCRL LYTDHLAFQN DGSKATYTPK FPWNPDALLE FNYLIHLSVI SVDSYRACGG MNSYFDGIQD WEFYLRLAKG LTSQTVAYLP LPLYAWRLSD QSVASSATPK QELRNKALEF LQLAHQELGE GTRAMQSPDD SSHYKFRVDR SDMSKLPVSC NVLLLGERDS ENPLHQTLQS LQSSELCFGQ IFLIQTPNSS FSASTSSMVE GLDVNRVEIG ELASSIPDDQ PLLVLQVGVS SKGKIDPGLN AWLERTSRWQ VITFPVCSSV DLPLCVSAGY ARLPAVSDTY IPIGQAFTKK NYNNNFAFFS HVRPVDLPSP AVQLLRSSVV RQTLTTFASS WDQKMDIRAK WWSCLMELSW DCCCISDVWF GLDTFLSDAE HQKLVQKREQ SLFAIESQNW LGTISPWKKA IYSYWLKGSL RDLTARAHPL QAEFFFSIHI PLEIPADNVF AGERNSSNLS LLPQLLHRSL VILIPTELNP RSNGHACILS LALKLIDAGH SVYLLPFKPF KFFREFYPNL PERFQDLPFI SDPQGFSQAM LLVPESTPRS LVKRLRPYFK QLIWWVLAPA GVLTEFRPNI RIGDFLVAFS EFALPGQSDY LFIHPDVDQD VDPLFPRYLK QYRHQPPHKK RLLLYTGKGR LKALPRNLHR NLLHYEVTLV TRSFPSTKAE LTDLLINSSG LISCDPMTNL NLEAAMLGLP VYLLANPFPA EYFRNFPVDL SYSITDSAED FIVRLADKGP LKQLRTVGME LKSRSAVDIF DLLLLNPPLH NDHALDHALT LPVGAYRVNE STLYQIEQYR KFLMRSRTIQ ALKEGQSFSS AFLGEYVDSL KYPFWAHSLI CQGLARLDDL ADFLATIRVL YLCLLLWKKL GLATLFRFFL NRIINFNRII AVSMLAKKA
|
| |