Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_18251 |
Symbol | |
ID | 5731561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1652235 |
End bp | 1653563 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641286212 |
Product | putative p-aminobenzoate synthetase |
Protein accession | YP_001551710 |
Protein GI | 159904366 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.240913 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTATTC CAATTAGAAA GCTTTGCGAG TGGTGTGATC CTGTTTACGT TGCTGAAAAT TTAATTGCTA ATTTTGGGGA AGATGGATTC ATCTGGCTAG ATAGTGATGG AAGCAAAATA GGAAGGTGGA TTGTTTTAGC AGCGGAGCCT ATAGATCAAA TTTGCTCTAG GGGCTTGCCT AGCCAATATT GTAATGCTAA TCCTTTTGAT TCTTTAAGAA GTTTAGAACC TGGCCATTGG ACTGGCTGGT TAAGTTATGA AGCTGGTGCA TGGATAGAGC CAAATAATCC TTGGAAAGAA GACTCGATGG CAACTTTGTG GATAGCAAGG CATGATCCTG TATTAAAATT TGATCTGAAA GAGCAAAAAC TTTGGATAGA AGGTTGTGAT CCCAAACGAT TATTAAAATT ATTTAACTGG ATAAAAGACC TTAAGAATAA TGAGGCGAAG CAAACCTCAG TACAATCAAA AAGTCCCATA CGTATTCCAC TTCGGTCTTG GGAATGGTTA ACCAATGAAA AAGAATACGC CGAAAAGGTT GAGATAATTC AAGAGTGGAT CAAAAAAGGT GATATTTTTC AAGCAAGCCT CTCTGCTTGC TGTAAAGGTA AAAAGCCACA AAATATGCTT GCCATTGATA TATTCAAAAA ATTGAGGCAT CATTGTCCCG CCCCATTCTC AGGAATTATT ATTGCGTCAG GAGAAGCAAG CGGTGAAGGC GTAATATCTA CCTCCCCTGA GCGATTTCTC AAAGTACTAC CTAATGGAAC AGTAGAAACA CGTCCTATTA AAGGAACTCG CCCCCGTCAA AGCAATGCAC AGAGGGATGC TGATATGGCA GCTGATTTAA TATGTAGTCA AAAAGATAGA GCCGAAAATG TCATGATTGT GGACCTTCTA AGAAATGATC TAGGTAAAGT TTGTCAACCA GGTAGTATCC AAGTCACAAA ATTAGTTGGA CTAGAAAGCT ACTCTCAAGT ACATCATCTA ACATCTGTAA TAAGTGGAAC TCTTAGAGAC GGCAAAACAT GGGTAGATCT CCTTGAATCA TGCTGGCCAG GAGGTTCAAT CAGTGGTGCA CCTAAGCTAA GAGCATGTCA AAGATTATAT GAACTTGAAC CTATTGCACG AGGTCCATAC TGTGGCTCAT TCATACATGT TGATTGGGAT GGTCAATTTG ATAGCAATAT TTTAATTCGA TCTCTCATGA TTAATAAATC GAATCTTCGT GTAAATGCAG GTTGCGGGAT CGTTGCAGAT TCAGATGCTA ACAATGAAGC GGAAGAACTG ACCTGGAAAT TATTGCCTTT ATTAAAAGCA TTGGATTGA
|
Protein sequence | MIIPIRKLCE WCDPVYVAEN LIANFGEDGF IWLDSDGSKI GRWIVLAAEP IDQICSRGLP SQYCNANPFD SLRSLEPGHW TGWLSYEAGA WIEPNNPWKE DSMATLWIAR HDPVLKFDLK EQKLWIEGCD PKRLLKLFNW IKDLKNNEAK QTSVQSKSPI RIPLRSWEWL TNEKEYAEKV EIIQEWIKKG DIFQASLSAC CKGKKPQNML AIDIFKKLRH HCPAPFSGII IASGEASGEG VISTSPERFL KVLPNGTVET RPIKGTRPRQ SNAQRDADMA ADLICSQKDR AENVMIVDLL RNDLGKVCQP GSIQVTKLVG LESYSQVHHL TSVISGTLRD GKTWVDLLES CWPGGSISGA PKLRACQRLY ELEPIARGPY CGSFIHVDWD GQFDSNILIR SLMINKSNLR VNAGCGIVAD SDANNEAEEL TWKLLPLLKA LD
|
| |