Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_00701 |
Symbol | |
ID | 4716752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 74367 |
End bp | 75422 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640077767 |
Product | hypothetical protein |
Protein accession | YP_001008465 |
Protein GI | 123967607 |
COG category | [S] Function unknown |
COG ID | [COG1873] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.273319 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTGT CTAACACATC TGCTAATAAG AATCTCCCTA ACTCGGTTCC TAGTGAACGT TTATGGTTAA GGTCAGAATT AATGGGAACA CAAGTGATAA CTACTGATAC TGGAAGACGG CTAGGCGTAG TTGGCGAAGT TGTTGTTGAT ATTGATAGAA GAGAGGTGGT CGCTTTGGGA CTAAGAGATA ATCCACTTAC AAGATTTTTA CCAGGTTTGC CAAAATGGAT GCCTTTAGAA AGTATTAAGC AAGTTGGAGA TGTCATATTA GTTGACTCCC TAGATTCTTT AAGTGAAAGT TTTTCTCCAG AAAGGTATGG GAAGGTAATT AATTGTCAAG TGATTACAGA ATCTGGACAA CTTCTGGGAA GAGTTCTTGG CTTTTCTTTT GATATTGAGA CTGGGGATTT GATATCTCTT GTTATGGGTG CTGTTGGTGT TCCGCTTTTA GGCGAGGGAG TTTTAAGTAC TTGGGAAATA CCTGTTGAGG AAATTGTAAG TAGTGGTACT GATAGGATTA TTGTTTATGA GGGTGCGGAA GAAAAACTGA AGCAACTAAG TAGTGGACTA CTTGAGAAAC TAGGAGTCGG GGGTTCTTCA TGGGATGAAA GGGAAGTCAA TGGATACTCA GCAAATCTTG TACCTGTTGA GAATCAGTTA CTTTCAGGTT CTGAATCAGA ACAGCAAAAC AATTTGGTCG AGGAATATGA AGAAGTTGTT GAACAAGATG ATTATGAAGA TGATTATGAA GATGATTATG AAGATGAACT TGAATATATT GAAATAAAGG GTTCTGAAGC AGAACTAAAT AATAGAAAAA AGCTATACAT GGATAATGAT GATTCTGATC AGATCCAGAA TCAAAATATT GTTAATCAAA TAAATGAAAA AAATAATATT GATTTAAAGC AAAAAAAACA ATCAACTACT AATTTAGCTT CGAAAAGACC AATTCAAAAT GCAACTGAAA CTTTAGATAT TGAACCACTA AACGAACAAA ATTTAGTTCA AGATAATAAA AAATCAGAAA AGTTTGAAAT TGATGACCCC TGGTAA
|
Protein sequence | MSLSNTSANK NLPNSVPSER LWLRSELMGT QVITTDTGRR LGVVGEVVVD IDRREVVALG LRDNPLTRFL PGLPKWMPLE SIKQVGDVIL VDSLDSLSES FSPERYGKVI NCQVITESGQ LLGRVLGFSF DIETGDLISL VMGAVGVPLL GEGVLSTWEI PVEEIVSSGT DRIIVYEGAE EKLKQLSSGL LEKLGVGGSS WDEREVNGYS ANLVPVENQL LSGSESEQQN NLVEEYEEVV EQDDYEDDYE DDYEDELEYI EIKGSEAELN NRKKLYMDND DSDQIQNQNI VNQINEKNNI DLKQKKQSTT NLASKRPIQN ATETLDIEPL NEQNLVQDNK KSEKFEIDDP W
|
| |