Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_28031 |
Symbol | |
ID | 4778543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2469874 |
End bp | 2471475 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640088326 |
Product | hypothetical protein |
Protein accession | YP_001018798 |
Protein GI | 124024491 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAGG TCATGACATC ACAGTTGAAT GTCTCAGTTG CAGGGAGCGT GTGGTGGAAC GGGTTGACAG CCGAACTCAA CATCACCAAT ACGACCAGCG AAACTCTTTC AGAATGGAGT TATAGCTTCA TAACACCCCA CAAAATCTCA GGGATGCCAT GGGGTGTCAG CACATCGGCC GAGCAGCTCG TCAACGGACA AACGAAATAC ACACTGACAG GAATCGGTTG GGCGCAAACC ATTCCGGCTG GAGGGTCCGT CACTGTCGGC TTCAATGCGC AACAGGGCAA GGCACTCGGT ACAGAAGGAG TGCTGACCGC TGAATTGTTG ATGACTAAAG CGTCAGAAAT GGCAAGCACC GTTGCATCAT CTCTTGCGGT AAGTGATGCA CCAGCAGTTG AAGTGCAAGG GCATCCACAG AGCGAGAACG ACTCTGCAAT GGAAGGAATG CATGCACATA CAACCTCCGA CTCTGCTTTC ACCCTGATCA CCGCCTGGGG TGCATCAAGC GGCAGTGAAC ACACCACCCA CGATGAACTG ATGGGGGGAC GCACTCCCAT CACCACAGAA GCACATGTTG CGTATAACAA TCTCCGCACC TTTCTCGGAC TGGACCCTGC ATCCCTAGAA GACATCGGCA ACTGGGCCTT CGCTAATAAC CTCACCAATA ACTCCCAGGC CTGGGGCGAT GATCTCCAAG GTGTTGGTCT CTGGTACTCC ATGCAGGGGG CGAAAGTTGG CTGGATCGCC GATGAAAACT ACGATCCACA ATGGCTTGCC GATCTACAAC GCAGCGCACG TCTTGGTAGC CCCAATGACG TGATGAGCAT GGCGAGACAG ATCGCTAAAC CTGGCTTCAT CGATTACCTC GAGGGCATCG ATGGCGTCGA TCACTTCATC AACACTTTGA AGATGGAGCC CCATTTTGGT GGCTGGATGC ACGATAGGGC TCATGGATGG CTATCAATCG AAGACGTTGC TATCGCTCAT GACATCAACC ACCTCACAGT GCTCAGTCAT GACCAAACAC AACCGTTCAT GAATGACACC TTTGATTGGC CGCAATGGCC TGCCTTAGAG GTCTCCGATC AGGTCGTCAT CGACTACTTC CAAAGCATGG TGAGCCTGGG CGGCCCACTG GGATCAAACT TGGACGCTCT AGGTACACCG ATAAATGAGG AGAATGAGAA ACCTCAACAG GAACCAGTTG TTCTCGTTGA GCAAAGCCAG GTTTCACAGA TTGATCCAAT CACCGGTAGC GCGGTGGATG TTGAGGTGTC TGGTGATCTT TGGTGGGGTG GCTTCACCGC GGAGATCACC ATCACCAATA GCAGTGATCA GCGTCTGGAG AATTGGGCGG TAGGCTTCAA CAGTATTCAT CACTATTACG GCGAGTCCTG GGGTGTTGAT GTCGTTACCG AAGAGGTCGC TGATGATCTC TACAGTTATA AAATCTATGG AGCTGACTGG GGTCAGTCGA TCGGAGCTGG TCAATCGATG ACTGTGGGCT TCAACGCGCT AACGGGTATG GATCTGGAGC GTAGCGGTTC TCTCACCGCC GAGAGCCTAT TTGCCGAGGG CAGCGAGCCT GTACTGCTCT AA
|
Protein sequence | MNQVMTSQLN VSVAGSVWWN GLTAELNITN TTSETLSEWS YSFITPHKIS GMPWGVSTSA EQLVNGQTKY TLTGIGWAQT IPAGGSVTVG FNAQQGKALG TEGVLTAELL MTKASEMAST VASSLAVSDA PAVEVQGHPQ SENDSAMEGM HAHTTSDSAF TLITAWGASS GSEHTTHDEL MGGRTPITTE AHVAYNNLRT FLGLDPASLE DIGNWAFANN LTNNSQAWGD DLQGVGLWYS MQGAKVGWIA DENYDPQWLA DLQRSARLGS PNDVMSMARQ IAKPGFIDYL EGIDGVDHFI NTLKMEPHFG GWMHDRAHGW LSIEDVAIAH DINHLTVLSH DQTQPFMNDT FDWPQWPALE VSDQVVIDYF QSMVSLGGPL GSNLDALGTP INEENEKPQQ EPVVLVEQSQ VSQIDPITGS AVDVEVSGDL WWGGFTAEIT ITNSSDQRLE NWAVGFNSIH HYYGESWGVD VVTEEVADDL YSYKIYGADW GQSIGAGQSM TVGFNALTGM DLERSGSLTA ESLFAEGSEP VLL
|
| |