Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_13141 |
Symbol | |
ID | 4912828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1100559 |
End bp | 1101647 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640160903 |
Product | hypothetical protein |
Protein accession | YP_001091538 |
Protein GI | 126696652 |
COG category | [R] General function prediction only |
COG ID | [COG2516] Biotin synthase-related enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAAA TAGGTAAAAT TATTACTGAT TTGCAAGTTA ATGGGATAAG TTCAACTCCC AAAAAGGGCA ACAGAGGTAG GAAAGGAGGA GCTGGTCCAT CGGATCATAG AGCTTTAACA ATTGAAGGAA AAACTGTAAT GGTTCCCGTT TATAATCACT TATCTAAGAA GTCTAATTAT CAACTTTCAG AAGAGTCTGA CGGACAATTT ATTCTCCAAA ATAGTGAAGA TTCAAAAATC AAGGAATTAT CTACAACAAA AGAACCAAAT TTTTATTCTT TAAAAACAAA AGATGGTATA CCTTATAAAT CGATTGCTCT TCTTCATAGC AAGGATGTAT TAGCCACAAC TATTCTCCAA AAATGTATTC GTTTCAGAAA TAGAGAAGAA TCTTGCCAGT TTTGTGCAAT AGAACAATCT CTGAAAAATG AACAAACCAT CGTAAGAAAA ACTCCAGATC AAATTGCTGA AGTTGCAGAA GCAGCTGTAA GACTTGATGG AATAAAGCAA TTAGTAATGA CAACAGGGAC CCCCAACACT AGCGATAGGG GAGCGAGAAT AATGGCAGAA GCAGCTAAGG CAGTTAAGGC TAAAGTTGAT ATTCCAATCC AAGGTCAATG CGAACCTCCT GATGATCCTA TTTGGTTTCA AAAAATGAAA GACTCAGGTG TAGATAGTTT AGGCATGCAT TTAGAGGTTG TAGAGGAGGA GATAAGAAAA AAAATTCTTC CCGGCAAATC TGAAATTCCT CTTGAAAGAT ACTATAAATC CTTTGAAGAA AGTGTCGCAG TATTTGGAAG GGGAGAAGTT TCTACATATT TATTAGCAGG ATTAGGTGAT AGCAAAGAAT CTCTAATAAA TTGCAGTAAA AAATTGATAT CTATAGGAGT TTATCCTTTT ATAGTTCCAT TTGTGCCAAT AGCAGGAACT CCTCTAGAAC ATCATCCCTC CCCAAGCACT GATTTCATGA TTGATATTTA TCAATCAGTC TCGCATTTAC TAAATGAAGG CAACATAAAA TCCGATGAAA TGTCAGCTGG TTGTGCCAAA TGCGGTGCCT GCTCAGCTTT ATCCCTATTT GAGAGTTAA
|
Protein sequence | MSEIGKIITD LQVNGISSTP KKGNRGRKGG AGPSDHRALT IEGKTVMVPV YNHLSKKSNY QLSEESDGQF ILQNSEDSKI KELSTTKEPN FYSLKTKDGI PYKSIALLHS KDVLATTILQ KCIRFRNREE SCQFCAIEQS LKNEQTIVRK TPDQIAEVAE AAVRLDGIKQ LVMTTGTPNT SDRGARIMAE AAKAVKAKVD IPIQGQCEPP DDPIWFQKMK DSGVDSLGMH LEVVEEEIRK KILPGKSEIP LERYYKSFEE SVAVFGRGEV STYLLAGLGD SKESLINCSK KLISIGVYPF IVPFVPIAGT PLEHHPSPST DFMIDIYQSV SHLLNEGNIK SDEMSAGCAK CGACSALSLF ES
|
| |