Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_07781 |
Symbol | |
ID | 5731821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 684462 |
End bp | 685622 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 641285142 |
Product | cellulose biosynthesis protein CelD |
Protein accession | YP_001550663 |
Protein GI | 159903319 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0132299 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTCT TATCAATAAA GTCATTTAGT GCCACAGATC CTAGATTAGA ATTTGTTTGG AAAAGAATAG AATCAGAATG CGAATGTACA GTATTTCAAA GCTATGAATG GTTTCAAAGA TGGTATAAGT TTGAATTAAA AAGACAAAAA AAGCTTAAAA TCTTCATCTA CATAGTCTAT AGAAAACAAG ATGATCCTTT AGCTCTATTT CCCCTTGAAT TAACAAGTCA TGGTCCAGCT AATTTAATGA GGTTTGCAGG AAGAGATTTA GCAGAGTATA AATTACCTTT AATCATAGAG AACAAGCTAA CTTCAGAAGA AAAAGTTAAT TTATGGAAAA TCGTTATTAG TGAAATGCCA CCACATGATG CATTTCTGAT AGAAAGGTAT CCTTTAAAGC ATCAAAATAT TCCCAAACAA ATGAGTCTTG AAAAAATAAA TGCATTGAAA GCTGGAGAAG TCGAAAGCGT ATCTAGCTGT AAACATGAAA ATATAGACTT ATTCTTTAAC AATATTTCTA AACGCATGGT TAAAGATAAT AAGCGATACA CAAGAAGACT TAAAGAATAT GGAAAACTAG AGTATAAGTT TCTATATACA GAAGAAGAGT ACACTGAAAT TTTTACTAAG ATTTCAAATT CATTGGAAAG AAAAGCAAGG TCTCTAGGGA TATCATTTAC AGAAAAAATC TACTCATTAA TAAATTTCTA TACATCCTTT AACGACTATT CTGATGCTTC TAATTCAAGT TCATACCTCA TAGCTATAAC ATTAGACAAA CAAGTTCTTG CTGCTTGCTG GGCTCTCAAA TATAAAGAAG TATTATATTA TCTCTATCCA GGATATATGG ATACGCCTTA TATAAAATAT TCTCCAGGGA GACTATTATT AGAACATCTA ATTAATATAG CATTTGAAAA TGACGTAAAG GCGATAGATT TTACACTTGG TAAAGAGCCA TATAAATATA ATTGGGCTGG GAAAGATAAT TTATTAGTTG ATTACGTAAA ACCAACTACA TTTATAGGAT TAATATACTA TTTTTATCGA AGACTAGTAA ATTCTATAAA GACCAATTTA TATATTTTAA CTATAGCAAG ATTCGCTATT AGAAGGTTGG AATCAGTTCT TAACCTTTTC AAAACCATAT CTAAAAAATG A
|
Protein sequence | MKLLSIKSFS ATDPRLEFVW KRIESECECT VFQSYEWFQR WYKFELKRQK KLKIFIYIVY RKQDDPLALF PLELTSHGPA NLMRFAGRDL AEYKLPLIIE NKLTSEEKVN LWKIVISEMP PHDAFLIERY PLKHQNIPKQ MSLEKINALK AGEVESVSSC KHENIDLFFN NISKRMVKDN KRYTRRLKEY GKLEYKFLYT EEEYTEIFTK ISNSLERKAR SLGISFTEKI YSLINFYTSF NDYSDASNSS SYLIAITLDK QVLAACWALK YKEVLYYLYP GYMDTPYIKY SPGRLLLEHL INIAFENDVK AIDFTLGKEP YKYNWAGKDN LLVDYVKPTT FIGLIYYFYR RLVNSIKTNL YILTIARFAI RRLESVLNLF KTISKK
|
| |