Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_07871 |
Symbol | |
ID | 4717493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 684081 |
End bp | 685400 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 640078501 |
Product | hypothetical protein |
Protein accession | YP_001009180 |
Protein GI | 123968322 |
COG category | [S] Function unknown |
COG ID | [COG4487] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0459178 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGATA TTAAATGTCC TTCATGCGGC AAAACTTTCC GGATTGATCC CAGCAGCTTT GAAGAAATAC TTCTTCAGAT AAAAGACGAG GAGTTTAACA AACAAATAAA AGAAAGACTT ACTCTAGCTG AAGAAGATAA TAAAAAAGCT TTGGAAATTT TAAAACGAGA GTTAAAAATA CAGTTAATAG AGCAAAATCG TATTAAAGAG TCCGAAATCC AAACTCTTGA ATCTAAATTA AAAATAGCTG AAGAAAAGAA AACAAATGCT CTTAATGATT TAAAAAATCA AGCAACAAAT AAAATTAATT CACTGAATAA TGAATTAATC AAGTTAAAGG ATGAAATTAA AAATCAGTCT TTAATTTCAG AATTATCCTT AAAAAATAAA GTTAGTGAAG CTGTTAATAA TTTAGAAAAA GAAAACTCAT CATTAACAAA TTCCATTGAA AAGATGAGGC TTGAACATTC AATTAATGAA AAATTAATTG AAGAAAAGTT TAAAAGCAAA ATTAGTGAAA GGGACTTGAC TATTCAGGAG TTAAGAGAAA TGAAATCCAG ATTATCTACA AAGATGATAG GAGAAACATT AGAAATCCAT TGCGAAACCC AATTTAATCT GAATCGTGCC TCTGCGTTTA AAAACTCATA TTTCGAAAAG GATAATGATG CCACTTCTGG AAGTAAAGGG GACTATATAT TTAGAGAATT TGATGAAAAT AAAACTGAAG TTGTATCAAT AATGTTCGAG ATGAAGAATG AAAGTTTAAA TGGAACTAAT AAAAGAAAAA ACGAAGATTT TTTAAAAGAA TTAGATAAAG ATAGAAGGCA AAAATCTTGT GAATATGCAG TACTCGTTTC TCTATTAGAA CCAGATAGTG AACTATATAA TGCTGGCATA GTAGATGTTT CTCATAGATT CCCAAAAATG TTTGTCATAA GACCTCAATT TTTCTTACCC ATTATTTCTC TGTTAAGAAA TGCATCTATG GAAACCTTAA AATACAAATC ACAAATTGAT TTAATGAAAC GTGAGAATTT TGATATAACT AATTTTGAAA GTACTCTTGA GCAATTCAAA AATGCAGTTG GTAAAAATGT TTCTCTTGCC CAAGATAGAT TTAATGATGC AATTTCAGAA ATTGATAAAT CAATAACTCA TTTACAAAAA ACTAAGGAGG CTTTAGTTCT CTCAAAAAAA CATCTTTTAT CTGCTGACAG CAAATCTCAA GATTTGACAG TAAAAAAATT AACTAGAAAT AACCCAACCA TGAAGAAAAA GTTTAATGAT TTAAATAATT TCGAAGATGA AGTAGCCTAA
|
Protein sequence | MKDIKCPSCG KTFRIDPSSF EEILLQIKDE EFNKQIKERL TLAEEDNKKA LEILKRELKI QLIEQNRIKE SEIQTLESKL KIAEEKKTNA LNDLKNQATN KINSLNNELI KLKDEIKNQS LISELSLKNK VSEAVNNLEK ENSSLTNSIE KMRLEHSINE KLIEEKFKSK ISERDLTIQE LREMKSRLST KMIGETLEIH CETQFNLNRA SAFKNSYFEK DNDATSGSKG DYIFREFDEN KTEVVSIMFE MKNESLNGTN KRKNEDFLKE LDKDRRQKSC EYAVLVSLLE PDSELYNAGI VDVSHRFPKM FVIRPQFFLP IISLLRNASM ETLKYKSQID LMKRENFDIT NFESTLEQFK NAVGKNVSLA QDRFNDAISE IDKSITHLQK TKEALVLSKK HLLSADSKSQ DLTVKKLTRN NPTMKKKFND LNNFEDEVA
|
| |