Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_24161 |
Symbol | |
ID | 4778905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2124332 |
End bp | 2126317 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640087937 |
Product | hypothetical protein |
Protein accession | YP_001018414 |
Protein GI | 124024107 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTATTG ATCTCTTGAG CACTATCTCA AGGCAGTTTG GGTTCAGATC ATATTGCTGT GCTGGGACGC ATAGATTTGG AGTAATATGC ATTAGAAGCG GGAGAAGCCT TTTGAATCAG GAAGAGGTTA TTCGGAAGCT GCAAGCTGGC ATGCAAGCCT TACAAGGTGG AAATCTTGAT ACGGCTGAGG AGATCTTCCG TTCCATTCTC ATTGTTGATG CAAAAGAGAT GCATTCTCTT CATTTCTTAG GTGTAGTTCT TTGTAAAAAG GGTGATATTT TCAATGGCGC TTTACTGATT GAAAGGTCAA TCTTGATTGA CCCTTCTCGA TTTTACCCTT ATTACAATTT GGGAAAATTG CTTGTTGCTG ATAAGCAGTA TGGGCGTGCT ATTCCAGTTT TGAAAGAGGC ATTAAAGCGA GATCAAAAAA GTTTTTCCGC ATGGAATCTA TTGTCTAAGG CTAGTTTTCA TGATGAGGAT TTTGCTGGTG CAGTTGACTC TGGCCAGCGT GCTTGTGAGC TAAGCCCAGA TAATCCAGAA GTATTTTTTG ATCTTGGTGT TTACTTTAAT GCTCTTAAGC AGCTGGATAA AGCCGTTAAT GCTTACCAAA AGGCCATTGT ATTTAAACCT GATTATCTTG AAGCATGGGT CAATATGGGT AATATATTGA CAAAGCAGGG AAAGCTTGAG GGAGCGATTA GATGTTTCCA AAAGGTGATA GATCTGAATC CAGATCTTGT CGATGCGTAT TTTAATATGG GTAATATACT AAAGGATCAT ACCAAATTTG AGGAGGCGAT TGGGAGTTAT CGGAAAGCGA TAGATTTGAA ACCAGATTTT GCAGATGTCT ATTTTGCTTT GGGTATGGCA TTGAAAGAGT TAGGTGATAT TGACTCTGCT TCTGCTGCCT TTGAAGACTA TTATCTTAGG GAACCAATTG TTCAAACTAT CTCGCTGCCT GCCATTGCCT ATGAGCCAGC GATGTTCATT TCTAGAGGTA AGCAGGTCAT CGTTCCTCAA TCTACAGACT TTATTCCATC GTATTTGTCT GATGAAATCC CTTTTGGGAT GCATTTAATG TATTTGCATA TCCCGAAAGT AGGTGGCGTA CGCTTCGGAA ATCCAATATC GGATTGTATA CGAAGGCTTT TTCTTGGAGA GAGTTTAGAC AAGTTTCAAG ATCTTTTGTC TTGCATCTTT CCATCGCGCG ACCTTTCTTT CATAGTAAGC CCTCGCATCG ATAGTGTGCC TATCAGAGAT GGAATAATAA GAGCATTCGA TTCTTGTGGC TTAGATAGTT CGGATTTTTC TTTCTTAATG CCGCATGGGA TTTCATCAGA TGAGTTATTT GTGATGTTAA ATAACTTAGG GGTCTTTCCG ATTCGACTGG CGACTTGGCG TGATCCTGAG AAACGTTTAA TATCTGCCTT GAATTATCTA TGGCGTATTT CCAAAGGCAA CATTAACAGT ATTCGCGATA GAATCAACTC GCGAGATCCT TTTCTTGATA ACGCTATATA TCGTGCTTGC TTTAGTACCT TTAATCGCCC CCTGGAGTCT TATTCAGCAT CTGAATTAAA AGTGGATTAC TTAATTGATA TTGGTGATTT CTCGGTAATG AATCAAGTGA TGAGTATATA TATGTCTCGA TGTCGTCTGC CAAATATAGT CATTAATAAG TCTGTCAATG TTACATCTGA GGATGACAAG ATGGAACATT CTTGCTTGCT TCAACTTGCA GAGGAGTGTA TAGACAAGGG TTTTATTGCT TATGACTCAA GTCCTGTTAT TCGAGATCTG GTGAAGAAGA AGTTACCTAC TGTATTTAGA CATGACTTGG ATCATTCTGA GGATACACTG AATCCGCTTA CCTTTGTGGT GAATTCAACC ACAAATATAG ATACGTCTGC TCAAATGTAT TTTTCTCTTA CTGAGGATCT AACCTCTAAG ATGGGGCAAG ATTCCTTAAA AGATATTTAC TCCTGA
|
Protein sequence | MIIDLLSTIS RQFGFRSYCC AGTHRFGVIC IRSGRSLLNQ EEVIRKLQAG MQALQGGNLD TAEEIFRSIL IVDAKEMHSL HFLGVVLCKK GDIFNGALLI ERSILIDPSR FYPYYNLGKL LVADKQYGRA IPVLKEALKR DQKSFSAWNL LSKASFHDED FAGAVDSGQR ACELSPDNPE VFFDLGVYFN ALKQLDKAVN AYQKAIVFKP DYLEAWVNMG NILTKQGKLE GAIRCFQKVI DLNPDLVDAY FNMGNILKDH TKFEEAIGSY RKAIDLKPDF ADVYFALGMA LKELGDIDSA SAAFEDYYLR EPIVQTISLP AIAYEPAMFI SRGKQVIVPQ STDFIPSYLS DEIPFGMHLM YLHIPKVGGV RFGNPISDCI RRLFLGESLD KFQDLLSCIF PSRDLSFIVS PRIDSVPIRD GIIRAFDSCG LDSSDFSFLM PHGISSDELF VMLNNLGVFP IRLATWRDPE KRLISALNYL WRISKGNINS IRDRINSRDP FLDNAIYRAC FSTFNRPLES YSASELKVDY LIDIGDFSVM NQVMSIYMSR CRLPNIVINK SVNVTSEDDK MEHSCLLQLA EECIDKGFIA YDSSPVIRDL VKKKLPTVFR HDLDHSEDTL NPLTFVVNST TNIDTSAQMY FSLTEDLTSK MGQDSLKDIY S
|
| |