Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_13161 |
Symbol | |
ID | 4777090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1120881 |
End bp | 1124123 |
Gene Length | 3243 bp |
Protein Length | 1080 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640086824 |
Product | hypothetical protein |
Protein accession | YP_001017328 |
Protein GI | 124023021 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3325] Chitinase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.421524 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGCTGTA TTCAAGACAA GAATTCTTGG GTTAGCGGAT TTACTGCAAT GGCCTTTGAA CTTGGTGGGC AAACCTATGC GGTTAACGCC TCAGGGGCAG ACATCACGGG CTTCGATCCA TCTCGCGATC GTCTTGATTT TGGCGATATT TCCGTACACG GACTAATCCT TGGCAAGCTT GTCGATGACA CGGCGGTGCT TGTTAATCCT TGGCAAGATA GTGATTATCA AAGGATTCTT GATCACAACG GCAATGGAAT TAACTGGAAC CAGCTAACGC TTGAAAATTT CGCCCCGGTT GGAAATGAAC ACTTGCGGGA GGACATCGGC GGTGTGATGT CCTGGGAGTT GGGTATCGGT CCGCGCGAGG CGGACACGGT CTACATACGT TCCCATGAAT ATGGCGTCCA TGAGCGGGTT GAGAACTTTG ATCCCCAGAC CCAGAAGCTG AACTTTCTAT ATCTAGGTAC ACGTGAGCGC TTGTCATTGA CCGACACCGA CGAGGGCCTA CTGATTTCAG TGGACCCCTC ATCACAGAGC TTGCTGTTGG TGGGAGTGAA GCGTACTGAT TTGTATGCAG GCAACCTGGA GTTCCATTTT GACCAGGTAA TGGAAGACAA CCTTGAGGAA CCCTTTGGGG TTGCTGAGGA TGCCGTCAGC CTGGTGAGTC GGGAGTTATT GCTGACACCG CAGTCAATTG GAGGTGCAAC GACCGATGGC TACCAGGTGC GTTCTGGCCA GTTGGTTCAG GCGGCTGAAA CGCTAACCAT CAACGAGGTT GACCTCAGCA TGCATCACGG CACGGATCAC AGCGGTATGG ATCACAGCGC CATTGAGTCT GATATGTCTA CTGGTGATGG CGCGTTGGTC AGTAATGGTC CGCTGTCGCT TGAGGTGAGT GGTTCCCTGT ATTGGGGAGG CATGAGTGGA AAGCTAACGC TCACAAATTC CGGCAATACA GATCTAGATG GCTGGTCGGT GTCTTTCGTG ACTCCGCATA CAAACTTCCA GAGCTGGGCT GGAGATGCTC AGATTGAGTC GTTGGCGGAT GGTACCAACA GGATCACATT GAGACCTGCA TCCTGGAACC AGAGCATCGC AATTGGCCAG AGTATCGAGG TGAGTTTCAA CGCTCAGAGC GTGGGTCTGC CAAATAGTGG CAGTTTGAAC AGCGAACTGT TCTTTGCTGA CGGTCAGACA CAGATGCCAT CAGGCGGCAT CACTGTTGAG GCGGATCCTA TGCAGCCTCA GGAGGCTGAG ACGTCTAGTA CCGCGACGAC CACTGATTTT GAGCCTCAGA CGGGGACCAA CACCGATGAT AATCAAATCG GTATGGATCA CAGCGCCATT GGGTCTGATA TGTCTACTGG TGATGCCGCG TTGGCGAGCA ATGGTCCGCT GTCGCTTGAG GTGAGTGGTT CCCTGTATTG GGGAGGCATG AGTGGAAAGC TAACGCTCAC AAATTCCGGC AATACAGATC TAGATGGCTG GTCGGTGTCC TTCGTGACTC CGCATACAAA CTTCCAGAGC TGGGCCGGAG ATGCTCAGAT TGAGTCGTTG GCGGATGGTA CCAACCGGAT CACATTGACA CCTGCATCCT GGAACCAGAG CATCGCAATT GGCCAGAGTA TCGAGGTGAG TTTCAACGCT CAGAGCGTGG GTCTGCCAAA TAGTGGCAGT TTGAACAGCG AACTGTTCTT TGCTGACGGT CAGACACAGA TGCCATCAGG CGGCATCGCT GTTGAGGCGG ATCCTCTTCA GCCTCAGGAG GCTCAGACGT CTAGTACCGC GACGACCACT GATTTTGGGC CTCAGACGGG GATCAACGAC GACGCGCATC TATTGGAGGT GTCTTCTACG GCCATCGCAG ATGGGTCTAA GCGGATCGTG GGCTATTTCG AAGAGTGGGG TATCTACTCC CGCGACTTTT TGGTGCAAGA CATCAATGTC GAAGACTTGA CCCACATCAA CTACTCCTTT TTCGATGTTA AGGCCAATGG AGATGTCAAC CTTTTTGATT CTTGGGCTGC CACCGACAAG CGTTACAGCG CCGAGGAGCA AGTTAGCCGT ACCTTTAGTG CCGACGAGTG GGCCGCCCTG GACGATTCAC GTCGCTCCAG CTATACGTCT GGTTCTGAAT TTACGACTCG CACCAATGGG AATGGAAGCG TGAGCGTGAG TGGTGTACCA GTGGGCTGGG ACGTTAACGG TGAGCTTGCA GGCAACCTGC GTCAGTTTGC TCTTTTGAAG CAACTGAATC CCGACATCAG TCTTGGCCTT GCCCTTGGTG GTTGGACCTT GTCCGACGAG TTCAGCCTTG CCTTTGATGA TGTGCCCGGC CGTGAGAGGT TTACTGACAA CGTCATTTCA ACACTCGAGA CTTACGACTT TTTCAATACC GTTGATTTCG ACTGGGAGTA TCCAGGAGGT GGTGGTCTTA GCGGTAATGC TTCCAGTGAT CAGGACGGCG CTAACTTCGC GGCGACGCTG AAGGTTTTGC GTCAGAAGAT GGATCTCCTC GAGACTCGTA CCGGCGAGGA CTTCGAGATC TCAATTGCTA CCGCGGGAGG TCAAGAGAAG CTGGCTAATC TCAATCTGCC GGCAATTGAT GCTTACGTCG ATTTTTATAA TGTGATGACC TATGACTTCC ATGGCGGCTG GGAGTCTGTT ACAGGACACC AGGCTGCGAT GACGGCAGAT GCTGCTGGTT ATGACGTCGT GACTGCCATT CAGCAGTTCA GGAATGCTGG AATTGCCCCC GAGAAGGTGG TATTGGGAGC ACCGACTTAC ACGAGGGCAT GGGGTGGCGT CGACAGTGGT GAAAAGCTTG GTTATGGCGA GCTGGGCTCT GCAAGCTCTG CTCCCGGTTC ATATGAGGCT GGCAATTATG ACCAGAAGGA TCTTGTTACT GGCATCAATA ATGGCTCCTA TGACCTTGCC TGGGACGACG ATGCCAAGGC TGCCTATCTC TACAACGATC AGGAGCAGAT CTGGAGTTCG ATCGAGACAC CAAGCACAAT TGCAGGTAAA GCTGCTTACG TCGATGCCGC TGAGCTGGGC GGAATGATGT TCTGGGCATT ATCCAGCGAT AGTTCTGGTG AGCAGAGCTT GATTGGTGCT GCGTCCGATC TTCTTCGTGG CGGGGTCTCT CCTGATCTGG TTATTGCACG TAGTCCTGGT TTCGATGTTG TGTTCGGTGG TGATGGGCAG TTCAACATCA GCGACTTCAC CACTCTTGCC TGA
|
Protein sequence | MRCIQDKNSW VSGFTAMAFE LGGQTYAVNA SGADITGFDP SRDRLDFGDI SVHGLILGKL VDDTAVLVNP WQDSDYQRIL DHNGNGINWN QLTLENFAPV GNEHLREDIG GVMSWELGIG PREADTVYIR SHEYGVHERV ENFDPQTQKL NFLYLGTRER LSLTDTDEGL LISVDPSSQS LLLVGVKRTD LYAGNLEFHF DQVMEDNLEE PFGVAEDAVS LVSRELLLTP QSIGGATTDG YQVRSGQLVQ AAETLTINEV DLSMHHGTDH SGMDHSAIES DMSTGDGALV SNGPLSLEVS GSLYWGGMSG KLTLTNSGNT DLDGWSVSFV TPHTNFQSWA GDAQIESLAD GTNRITLRPA SWNQSIAIGQ SIEVSFNAQS VGLPNSGSLN SELFFADGQT QMPSGGITVE ADPMQPQEAE TSSTATTTDF EPQTGTNTDD NQIGMDHSAI GSDMSTGDAA LASNGPLSLE VSGSLYWGGM SGKLTLTNSG NTDLDGWSVS FVTPHTNFQS WAGDAQIESL ADGTNRITLT PASWNQSIAI GQSIEVSFNA QSVGLPNSGS LNSELFFADG QTQMPSGGIA VEADPLQPQE AQTSSTATTT DFGPQTGIND DAHLLEVSST AIADGSKRIV GYFEEWGIYS RDFLVQDINV EDLTHINYSF FDVKANGDVN LFDSWAATDK RYSAEEQVSR TFSADEWAAL DDSRRSSYTS GSEFTTRTNG NGSVSVSGVP VGWDVNGELA GNLRQFALLK QLNPDISLGL ALGGWTLSDE FSLAFDDVPG RERFTDNVIS TLETYDFFNT VDFDWEYPGG GGLSGNASSD QDGANFAATL KVLRQKMDLL ETRTGEDFEI SIATAGGQEK LANLNLPAID AYVDFYNVMT YDFHGGWESV TGHQAAMTAD AAGYDVVTAI QQFRNAGIAP EKVVLGAPTY TRAWGGVDSG EKLGYGELGS ASSAPGSYEA GNYDQKDLVT GINNGSYDLA WDDDAKAAYL YNDQEQIWSS IETPSTIAGK AAYVDAAELG GMMFWALSSD SSGEQSLIGA ASDLLRGGVS PDLVIARSPG FDVVFGGDGQ FNISDFTTLA
|
| |