Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_20431 |
Symbol | |
ID | 4776292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1798841 |
End bp | 1800976 |
Gene Length | 2136 bp |
Protein Length | 711 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640087554 |
Product | hypothetical protein |
Protein accession | YP_001018046 |
Protein GI | 124023739 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.192171 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTTGTTG CAGGCTTGGC CCTGACAGCA CTAGTGCTTG GTGCCCAACC ATTGCCTTTT CAAGCAATGG TTGTCGTTGC TGTGGTGTTG CTGATCACTG CTCGTTTCTA CTTGATTCGT CTCGGCATTA CCCAAAGCAG GGCTACAGCA TTGGCGTTGA TGTTGCCATT GTTGTTGGCA TGGGCTTGCT GGTGGGGGCA GCCGAGGCCT GGACCTTTGG ACCCTGTGCG TTTGTTGGCG AGTGCCGGTT CTTTTGATCG TTCCTCACGC CTACCGCTTC GCAGCAGCCT CGAAGGCCGG CTGCTGAGTG ATAGCCGATT GCTTCGTGAT AGCAGCTTTG TGGCTGATAG TTGCCGTGCA TTGTTGGCGG TGAACCGGAT CAATGGCCTG CCGAGACATG GCCGTAGTGA AGTTCAGCTA AGACCTTGCC CTCAGCTTTT GCAGCAGGGT TGGCGAATCC GTGTGCATGG TCAGTTGCGC GCTCCTGCTC TCGGGCCTCA TTCGCTACTG CCTGGGCCAG CGGAGCGGCT TCAGCAGCAG GGAAGTTGGA GCCAATTCTG GGGTGATCAG GTTGAGGTGT TGCAACGCCC CTGGACGCCC ATCGCCGATG CTCGCCGTGG GGTCGCCTTG CGCTTACAGC AACTGGCAGG TCCACGTTCG GGAGGATTGC TGGCGGCTTT GGTGTTGGGT CGTGCCCAAG TTGATCTGCC GCTTGATTTG GTTAAGGCGT TCCGCGTTGC TGGCCTTTCC CATGCCTTGG CGGCTTCGGG CTTTCATCTT TCTGTGCTGT TGGGAGCGTC GCTGGGGATT GCTCGGTTGT TGCCAAGGCC CTTTCGTTTG ACGTTTGCTG CAATGGCACT GATCAGCTTT TTGATGCTGG CCGGACCACA ACCGTCGGTT GTGAGAGCTG TCTTGATGGG TAGCACGGTG TTGTTGATCA ACGAGGGCGG AGGCCGCAGT CGTCCTCTAG GGGTCTTGCT AGCGACGTTG GTGCTCATGT TGTTGGTGAA TCCCGCCTGG GCGCGTTCGA TTGGTTTTCA ATTGAGTGCG GCAGCTACAG CCGGATTGGT CGTCACAGCA GGTCCCCTAG AGCAGGCTCT CTCCAAGCGT CTACCTACAT GGCTTCGAGG GTTGGCACCG GCGCTGGCTG TTCCTTTGGC CGCAATCGTC TGGACGCTGC CACTACAGAT TGTTCATTTT GGATCCACTC CTGTCTATGC CCTATTGGCG AATCTGCTGG CTGCTCCTGT GTTGGTGCTG CTCACCCTCT CAGCGATGGC GTTGGCCTGG TTGTGTTTGC TCTTGCCTGC TGGCTTGCTG ACTCCATTGC TGAACTGGAT CAGTTGGCCG ATCCAGCAGT TGGCTGGGTT ATTGATTGCT CTGGTGCGTT GGATCTGCAC TTGGCCAATG GCGCAGTTGT TCACCGGTCA TCCCCAGCCT TGGTTGGTGC TGTTGTTGGT GTTGGGATTG CTGCCTTGGT TGGTAGCGGG ATTGCGTCAC TGGCGTTACT GGGGAGTGGT TGCGTTGTTG GTTTGTTCGT TATCGCAGGC TGTTGTACAG ATGGGTGATG GATTCGTGGT GGTGCATCAG CGCAGTCGTC AATGGTTACT AGCTAGGCAT GGTGGGCGGG CGGTTTTGGT TAGTACGCAT GGAGATGGTC GTAGCTGTTG GCAGGCTCAT CGGCTGAGTG AAGCGTTTGG CCATGCCCGT CTTGATTGGG CGATGGTCTT GGATCCGGTT GCCAGTGAGG CGGCGTCCTG TTGGCGAAAT CTGGCCCATA CCGTGCTGGC TGAGCATCAG GGTGTGATGC CTTTGCAGGT TGGACAGCGG TTAGTCAGTC CTGGCCTGGA GGTGAGACCA ATCGCTGCAG CAGAGCAGAG TTTGCAGTTG CGAGCTGGTC GTTTGCGTTG GCATCTGTTG CCAACGCGTC AGGCCTATTG GAGTTGGCGT GATCGTCAGG GTTTGGATGG CGTGGAGCGT CAACCTTTAG TCAAATCAGA GGGTTTGACA GGGGTATGGC TTGGCTTTGT CCCAACCTCT CTTGAGCGGC GATGGTTGCT ACAACACACT GCAGGGCGGC TTTGGGTCAG TGGTAGAGCC AGCGGCTCCT TCAGCATGGT GGCCTCAGCG TTTTAA
|
Protein sequence | MLVAGLALTA LVLGAQPLPF QAMVVVAVVL LITARFYLIR LGITQSRATA LALMLPLLLA WACWWGQPRP GPLDPVRLLA SAGSFDRSSR LPLRSSLEGR LLSDSRLLRD SSFVADSCRA LLAVNRINGL PRHGRSEVQL RPCPQLLQQG WRIRVHGQLR APALGPHSLL PGPAERLQQQ GSWSQFWGDQ VEVLQRPWTP IADARRGVAL RLQQLAGPRS GGLLAALVLG RAQVDLPLDL VKAFRVAGLS HALAASGFHL SVLLGASLGI ARLLPRPFRL TFAAMALISF LMLAGPQPSV VRAVLMGSTV LLINEGGGRS RPLGVLLATL VLMLLVNPAW ARSIGFQLSA AATAGLVVTA GPLEQALSKR LPTWLRGLAP ALAVPLAAIV WTLPLQIVHF GSTPVYALLA NLLAAPVLVL LTLSAMALAW LCLLLPAGLL TPLLNWISWP IQQLAGLLIA LVRWICTWPM AQLFTGHPQP WLVLLLVLGL LPWLVAGLRH WRYWGVVALL VCSLSQAVVQ MGDGFVVVHQ RSRQWLLARH GGRAVLVSTH GDGRSCWQAH RLSEAFGHAR LDWAMVLDPV ASEAASCWRN LAHTVLAEHQ GVMPLQVGQR LVSPGLEVRP IAAAEQSLQL RAGRLRWHLL PTRQAYWSWR DRQGLDGVER QPLVKSEGLT GVWLGFVPTS LERRWLLQHT AGRLWVSGRA SGSFSMVASA F
|
| |