Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_28431 |
Symbol | |
ID | 4777546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2512518 |
End bp | 2513642 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640088366 |
Product | TRAP-T family tripartite transporter, substrate binding protein |
Protein accession | YP_001018838 |
Protein GI | 124024531 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.986733 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACGTC GAAAACTACT CGGGTCTGGT GCTACGGCTG TCTCAGCAGC CGTCGGCGCA GGCATTCTCA GTGCCTGCAC AATCCGTAAA GAGGAAGAAA GCGGCAGTAA CAGCAGCCAG CCGAAGGTGC GCTGGCGCAT GGCCACCAGC TGGCCCCCCT CACTAGACAC GATTTACGGG GCGGCCGAAA CAATCAGCCA ACGGGTGAAT GAACTAAGCG GCGGCAACTT CCAAATCAAA ACCTATGCCG CCGGTGAGCT CGTACCAGGC CTAGAAGTCC TTGATGCGGT TCAGGCGGGC TCCGTCGAAT GCGGTCATAC CGCCAGCTAC TACTACATCG GCAAAAATCC CAGCTTTGCC TTCGGCACTT CCGTACCGTT TGGCCTTAGC GCACAACAAC AGAACGCTTG GCTCTACGAA GCAGGCGGTA ACGACGCCAT CAACAACCTC TATGCCGATT TCGGAGTGAT CAGCTTCCCC GCTGGCAACA CCGGTGCACA AATGGGCGGA TGGTTCAAGC GCAAACTCGA GGGCCTCAGT TCTCTACAGG GACTCAAAAT GCGTATCCCT GGCCTGGGCG GCAAGGTGCT TGCCCAGCTG GGTGTGAACG TCCAGGTTTT GCCTGGTGGA GAGATCTACC TGGCTCTAGA GCGTGGCGCG ATCGACGCCG CTGAATGGAC CGGCCCATAC GATGACGAAA AGCTTGGCCT AGCCAAAGCT GCACGCTTCT ACTACTACCC AGGTTGGTGG GAACCCGGCC CCACCCTTGC AGCTCTGGTC AACCAACAAG CCTGGAGCAA GCTGCCGAGC GAATATCAAG CGATGTTCAA CACCGCCTGT TACGAAGCCA ACCTCACCAT GCTCAGCCGA TACGACAACC TCAACGGGGC TGCTCTGCAA AGGCTTTTAA AGGGCAACAC CGAGCTGGTT CCCTATGACC AAAGCATCCT TAAAGCTGCC CAGGAAGCAG CCTTCCAGCT CTATAGCGAT ACCGCCGCAA AAGATGCCAG CTTCCGCAGC CTGCTTCAGC AATGGCAAGG CTTCCGCAAG GAGGTTTACG CCTGGAACAA CGTCAATGAG TTCTCATTCG CTCGCTTCAG TTACGACCAG CTGCAAGGAA CTTGA
|
Protein sequence | MQRRKLLGSG ATAVSAAVGA GILSACTIRK EEESGSNSSQ PKVRWRMATS WPPSLDTIYG AAETISQRVN ELSGGNFQIK TYAAGELVPG LEVLDAVQAG SVECGHTASY YYIGKNPSFA FGTSVPFGLS AQQQNAWLYE AGGNDAINNL YADFGVISFP AGNTGAQMGG WFKRKLEGLS SLQGLKMRIP GLGGKVLAQL GVNVQVLPGG EIYLALERGA IDAAEWTGPY DDEKLGLAKA ARFYYYPGWW EPGPTLAALV NQQAWSKLPS EYQAMFNTAC YEANLTMLSR YDNLNGAALQ RLLKGNTELV PYDQSILKAA QEAAFQLYSD TAAKDASFRS LLQQWQGFRK EVYAWNNVNE FSFARFSYDQ LQGT
|
| |