Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_20991 |
Symbol | |
ID | 4776836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1859890 |
End bp | 1861350 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640087607 |
Product | hypothetical protein |
Protein accession | YP_001018099 |
Protein GI | 124023792 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTCAA TCCTTTATCA CCTTTTAAAT CCTCAAAATC TTGAGAGAAA GAGCATCTCT AACCGTGGTT ATGCTGAATA CTTAGAGTAC GAATCCTGGC TTGATAGTAA GACTTTACCA TGGGCGTGGA ATTTGGCTGA TCATGAAGAT CACTTTCATA AAAATGGCCG TATCAGATTT AATGTATCTG GATGGAATAA TGAGCAACTT GCCATACCCC CAGATTCAGA TGGTTTGATC GATGAGTATA GAAATCTTGC TCGAGAAGCA TTTAAGCTCT TTGAGGAGAA CCTGGGCATA GACTTTGTCG AAACTAATGA GGAAGATGCC GACATCTTCT TCATCGATAA TCATAGGGAT GGTGGTTACT GGTCATATAC TCCATTAGAG GAAAGAAGTC TTTACTCCGA GAAGGCTTAT CCGGAACACA GCTTTATCAA TATTACAGTA GATGACTCGT TATCGGCGAA ACTGCATGGT GGGCTTTTTG CTACATTCAT TCATGAGATT GGCCATAGCT TGGGCCTTGG CCATGATGGC AATTACAACT TTGATGACTC TAACCCTAAA GCTAGTAATT ATGAAAATGC GGGCCCATAC CTGAACTCAT CGCAGCAATC GTCAATGATG TCGTATTTTA GAGTTCCATC TAAAGACGCA TTGAGAAGCT GGGGAGTTGA AATGATTAAC CCTAATATTG CTAATGCTCA TTTTGAACAT ACAAATACGC CAATGCCTAT TGACTGGTTG GCTTTAGACA ACATTTATAA ACAGCAAGGA TACGGTATAT CCAATTCCTT CAATGGCGAT ACTCTCTATG GCGCGGATAC TTCTATTCCT GCTGAAGTCA GCAACGTTTG GAACCAATTT GCTGATTTTA TTCATCACAA TGCATTTACC ATCGTTGATG GCTCGGGTCA TGACATCATC GATGTGAGCT TTTCTGGATT TGATCAGACT ATTGATCTTC GAGCCACTGA TCCTAATTCT GATTTTTTAT ACCCATCAGA CGTCAATGGA CTTAAAGGTA ATCTATACAT AGCAGCCAAT ACAGAGATTG AAGAGGCAAT TACAGGCTCC GGAAACGATC TTTTGATTGG CAACAAATTC AATAATATCC TAGATGGAGG ATCCGGTTCT GACGAGCTTT GGGGATTTAA AGGGGCGAAC ACTTTAAAGG CTGGTGATTT TGATGATGTC ACCGATAAGT TCTATATAAA AGCAAGTAAT ATGGTTGAAC AGTGCGATTT TCTTTTTCAG GTTGACTCTT CTGATAGGAT TTATATAGAC ACAAGCGATG ATGGACAGAT CACCTATCAA GATCATATTG AAGACCCCAA TGGTAGCAAT TATGTGGGTG TTGGCATCTT TGTAGATGCT GTTTTAGAGG CTTTAGTCAT TAATTCTGGC TTGACTTCAG ATCAGGTCAA TGATATTACC AAGGGCGGAG ACTTTTACTA G
|
Protein sequence | MISILYHLLN PQNLERKSIS NRGYAEYLEY ESWLDSKTLP WAWNLADHED HFHKNGRIRF NVSGWNNEQL AIPPDSDGLI DEYRNLAREA FKLFEENLGI DFVETNEEDA DIFFIDNHRD GGYWSYTPLE ERSLYSEKAY PEHSFINITV DDSLSAKLHG GLFATFIHEI GHSLGLGHDG NYNFDDSNPK ASNYENAGPY LNSSQQSSMM SYFRVPSKDA LRSWGVEMIN PNIANAHFEH TNTPMPIDWL ALDNIYKQQG YGISNSFNGD TLYGADTSIP AEVSNVWNQF ADFIHHNAFT IVDGSGHDII DVSFSGFDQT IDLRATDPNS DFLYPSDVNG LKGNLYIAAN TEIEEAITGS GNDLLIGNKF NNILDGGSGS DELWGFKGAN TLKAGDFDDV TDKFYIKASN MVEQCDFLFQ VDSSDRIYID TSDDGQITYQ DHIEDPNGSN YVGVGIFVDA VLEALVINSG LTSDQVNDIT KGGDFY
|
| |