Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_01381 |
Symbol | tagG |
ID | 4776410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 152090 |
End bp | 152920 |
Gene Length | 831 bp |
Protein Length | 276 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640085637 |
Product | hypothetical protein |
Protein accession | YP_001016158 |
Protein GI | 124021851 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1682] ABC-type polysaccharide/polyol phosphate export systems, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGTGC CTCGTGCTGA AGATCAGCTA GCTCTTATTG AAACTGCTCC TGCGCAAGGT TTTGGCTTTA TTTTTGTGTT GCGCACGGCT CGGGCCTGGT GGTATAGCGC CTGGCTGCGA ACCTTGGCGC GATTTAGCCG CACTTACCTT GGCAGCTTTT GGCTTGGATT AAGCAATCTT TTGAGTGTGG CCCTTTTGGG GGCAGTGTAT GGCGCGGTTT TTAATATTGC CAATCCTTGG GACTATGTTG TTTATCTGGG ACTGGGTTAT ACGATCTGGG GTTTTATTAG CTTGACTGTG ATTAGTGCCA GCACGCTGTT TTCTACGCGT CGTGATCAGT TGATCAATAA TTCTCTGCCA GCAATTTTTT ATTGCCTTGA GGAATGGGCT TTTCAGGTTC AGACATTTGC ACAGAGTTTG GTGATTGTGA TGATCCCAAT TCTTTTTATT AAGCCGATGC TGTGGCTGCA TGCATTGACG AGTATTTGGT TGCCACTCAT CAATGTTTTG CTTTTTTGTT TTTGGATAAC GGCTTTGATG GCTGTGTTGG GTTCTCGCTT TAAGGATGTG GCTCAGCTGA TGCCGATCTT GATGCAATTA ACTTTTTTGC TTTCTCCAAT CCTGTACAAG AAAGAAGGTT TAGGTAGCAT GGCATTCGTT GCAGACCTCA ATCCTTTTTA TCGTATCCTT GCACCTTTGC GTACAGCTTT GATCGATGGA GATGTTTATT TTAGGGCCGA ATTTGCTACG CTTTTTGTGA ATATAATTTT AATATTGATT GCTTGCTATT GGCTGAAGCG CGTTCGCTAC AATATGCCTT TTTGGGTCTA G
|
Protein sequence | MRVPRAEDQL ALIETAPAQG FGFIFVLRTA RAWWYSAWLR TLARFSRTYL GSFWLGLSNL LSVALLGAVY GAVFNIANPW DYVVYLGLGY TIWGFISLTV ISASTLFSTR RDQLINNSLP AIFYCLEEWA FQVQTFAQSL VIVMIPILFI KPMLWLHALT SIWLPLINVL LFCFWITALM AVLGSRFKDV AQLMPILMQL TFLLSPILYK KEGLGSMAFV ADLNPFYRIL APLRTALIDG DVYFRAEFAT LFVNIILILI ACYWLKRVRY NMPFWV
|
| |