Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_12621 |
Symbol | |
ID | 5731309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1137398 |
End bp | 1138513 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641285631 |
Product | hypothetical protein |
Protein accession | YP_001551147 |
Protein GI | 159903803 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis [COG2246] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.172184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00175811 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCAGATA TTTCTATCAG CAATTTAAGA CTCTCTATTG TTTTGCCAAC TTTTAGAGAG AGGGAAAATA TACCTCAAAT TGTTGAGCAA TTATTTAAGT TAGGAAGTTA TTATGAACTA GAGATTCTTA TCATAGATGA TGATTCCCGT GATGGTACTT TCGACTTTGT TAAAAATCTC TCTATAGAGG ATCATCGCGT AAGGATAATT AGACGGGTCG GTCGCTCTGG ACTGGCTAGT GCAATTAAAG AAGGCTTTTT AAATGCAACT GGTGATATTG TTGCTTTGAT GGATGCTGAT GGCCAACATC AACCAGTAGA TGTTTTTAAA GCAATTGATT ATTTAATTTC CAATAACTAT GACTTAGTAA TTGGGAGTAG ATTTTTAGAG AGAGCAAATA TTCTTGGATT AAGCCAAAGA AGGGTAGGTG GATCTTCTAT GGCAAACTAT GTTGCTAAAT TAAGCTTACC TAAAAATTAT AATCATATAA CTGATTATAT GAGTGGATGC TTTGTTTTAA GGTTAAATAA ATGCTTGCCA ATTATATACA AAGTAGATGT TAATGGCTTT AAATTTTTGT ATGAGTTTTT AGCTTTAACT AAAGGAAGGC TATGGGTTGG AGAGGTGCCA TTAAGTTTTC AGCCTAGATT ACATGGGACT TCAAAATTAG ATATATCTAT TGTATGGGAT TTTATGATCT CACTTTTGCA TACTTTGTCT TGCAGAATCC TTCCTAGACG GGCAATAAGC TTTGCAATTG TTGGTTTGAC TGGTGTTGGT GTTCAGTTGG TTGCAACTAA TATAATGATG AGATTTTTTC TTTTAACCTT TAAAGAAGCA CTTCCAATAG CTGTAATAAG TGCGGCAACC TCAAATTATC TAATTAACAA TGCATTGACT TTCAGGTCAA AAAGGCTAGC TGGCATAAGT TTATTAAAAG GACTTCTTAA GTTCCTTTTA GTTGCATCTT TCCCAGTAAT AGCAAATGTT GGACTTGCCA CAGCGTTTTA TAACATTGTG TCTGAGAATG AGACTTGGGC ACAGCTTGCT GGCATCTCAA TAGTATTTAT CTGGAATTAT GTTGCTTCTT CAAGGTTTGT CTGGAATACT CCTTAA
|
Protein sequence | MPDISISNLR LSIVLPTFRE RENIPQIVEQ LFKLGSYYEL EILIIDDDSR DGTFDFVKNL SIEDHRVRII RRVGRSGLAS AIKEGFLNAT GDIVALMDAD GQHQPVDVFK AIDYLISNNY DLVIGSRFLE RANILGLSQR RVGGSSMANY VAKLSLPKNY NHITDYMSGC FVLRLNKCLP IIYKVDVNGF KFLYEFLALT KGRLWVGEVP LSFQPRLHGT SKLDISIVWD FMISLLHTLS CRILPRRAIS FAIVGLTGVG VQLVATNIMM RFFLLTFKEA LPIAVISAAT SNYLINNALT FRSKRLAGIS LLKGLLKFLL VASFPVIANV GLATAFYNIV SENETWAQLA GISIVFIWNY VASSRFVWNT P
|
| |