Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_12681 |
Symbol | |
ID | 4777041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1090781 |
End bp | 1092808 |
Gene Length | 2028 bp |
Protein Length | 675 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640086776 |
Product | hypothetical protein |
Protein accession | YP_001017280 |
Protein GI | 124022973 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.222575 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGACC TTGACCTCCA AAAAATATTT CACAAACTGT TCACATCGCA GCACTCAAGC AACAACAAGT CGCCACATCA AGACCCGAAC AAAGGCATCA CTGCTAACAG CGATTTCGAA AACGAAAAGT TCACACACCC CCCCCAGAAG ATCACTGTCA AATCTATCGA TGACAGTGTT TTTAAGTACG ACACACCATT GGATAAGCAG ATCCTCGAAG CCAGTGGAAT CAATCCAAAG CTCTTTAATC CGAGCTGGCT CAAACCTGGA CTCGAATTTG AACCTCAACT TCCAGAACTC AAAGAACGAC GCATCACCGA CGCTTCAGGA GACAGCAGCC TGGCTGAAGT TTTCGAGGAT GGTGCCCTAA AGATCAATCT CGATCCCAAC AAATATCGAC ACCTCAATCA AATTGACATT GACCGCTTTG GTCGACGCCA CTCACAGCCA ATGAGTCTCG TGGACGTCAC TGTCACCGCG ACCGACATGG AAACGGGTAG CGTCATCAAC CTGGGAACGG AAGTGGTCGA CCCCATCCGT GGAACGGCCC TGGTCAATCT CAAGGACAAA CTGGATGCAA TCTCCACTTG GAGCCTGCGT GATCGATACG CCATTCATGT CGAAACATCA GTGAGCTCAG GATCCACTAA GCATGTTTTC GAATCATTTG AAGAACCGAT CACATTGATC AAACCGCTGC GGTGGGGTTC ACAGCACGTT GGTGATGAAA CTGGCAACGA CCTGATCTAC CAGGAAGTTG CTCCAGCAGC GGGATCAGGC CGGATCTACC GAGGCCGTGG CGGCACCGAT TTCCTCCACC TTGAGAACAT CAACAGTGGC GACGTTGTTT CGTTCAACGG ACGCGCTGGC ATTGATCCGG CCGAGGCTGC GGATCTCGGC CGTCAAGCCT TCTACGGCGG GACCGTTTTC GACAGCCTCA CCCTGAGCAA TGGCGATGAG CTTTATCTCC AAGGAATCGA ACGATTGCGC TTCAGCGATG CGACGATCGA TCTAACACCG AACCTTGATG CCACAAGCGA ACTCCAATGG AACACCAATG TCATGGATGT GCCAGGCGCC TGGCGCTTCA ACACCGGGTC CAACGATGTG GTGCTGGTGT CTCTGGATAG TGGCTTACGC GACGACGGCA GCGGTTCGCC AGCCTTCAAC GCCGAAATCG ACCATGTCGA TTACTTGACC GAAGTCAACA GCTATAACTC CGAGCATGGC CATCGGTCGA TGAGCCTGAT GGCCGCACGC CACGACGGCA CCGGTGTCGC CGGCATCGCT CCAGACAGTC AGCTCTGGAG CTTCAACGCC AAAACAGCGA ATCAGAACGG CATCGGCTTC GAAGCCGCCC TCGAACAAAC ACGTGAGCGA CGCGAGGGGC ACCAACGCAT CGTCTTCCAA GGAGGTGTTC AAGGCGACTG GCCATGGACA TCTGATGGAG CCAGCACAAG GGAAGAAGTC GAAGCAGAAT TCGCTGCCAG CCGACCCTGG GGCTTCTTTG CAATCGCAGC CGGCAATGGA GGTGGCCAGA CGTTCAGTGA GAACGATTAC TTAACGACCG TCTCTGGAGT TGCAGAAGCA GCGGTCAGTT ACGACAACAT CACCAGCGTC GGAGCCCTTG AAAGCGGTCT CCGAACGGAC ATTGACGGAC TGATCAACGC GACCGACGTC AGCCTGGCGA CCTATTCCAA TCGCGGGAGC AACCTCACAC TTGTTGCTCC TACGGACAGC TGGGCCATGG ACGTCGACGG GACCAGAAGC TGGTTTGATG GCACGTCAGC TTCCAACCCG AATCTGGCCG GTGTGGCAGC GCTGGTCTGG AGCGAGAACA ACGATCTGAC CGGCGGCCAA CTGCGAGAAA TCCTGATCAG CAGTGCCATG GACCTCGGCA CCGGGGGGGT TGACACCACC TTCGGCAACG GTCTGGCCAA TGCCGAAAGT GCTATCCGCC GGGCCCATGC GCTGGAAGCC AATCAGGAAC TGGCATTGTT CTGGGACAAC AACTCATTCC TTGCGTAA
|
Protein sequence | MTDLDLQKIF HKLFTSQHSS NNKSPHQDPN KGITANSDFE NEKFTHPPQK ITVKSIDDSV FKYDTPLDKQ ILEASGINPK LFNPSWLKPG LEFEPQLPEL KERRITDASG DSSLAEVFED GALKINLDPN KYRHLNQIDI DRFGRRHSQP MSLVDVTVTA TDMETGSVIN LGTEVVDPIR GTALVNLKDK LDAISTWSLR DRYAIHVETS VSSGSTKHVF ESFEEPITLI KPLRWGSQHV GDETGNDLIY QEVAPAAGSG RIYRGRGGTD FLHLENINSG DVVSFNGRAG IDPAEAADLG RQAFYGGTVF DSLTLSNGDE LYLQGIERLR FSDATIDLTP NLDATSELQW NTNVMDVPGA WRFNTGSNDV VLVSLDSGLR DDGSGSPAFN AEIDHVDYLT EVNSYNSEHG HRSMSLMAAR HDGTGVAGIA PDSQLWSFNA KTANQNGIGF EAALEQTRER REGHQRIVFQ GGVQGDWPWT SDGASTREEV EAEFAASRPW GFFAIAAGNG GGQTFSENDY LTTVSGVAEA AVSYDNITSV GALESGLRTD IDGLINATDV SLATYSNRGS NLTLVAPTDS WAMDVDGTRS WFDGTSASNP NLAGVAALVW SENNDLTGGQ LREILISSAM DLGTGGVDTT FGNGLANAES AIRRAHALEA NQELALFWDN NSFLA
|
| |