Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_15921 |
Symbol | |
ID | 4776100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1397667 |
End bp | 1399469 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640087101 |
Product | hypothetical protein |
Protein accession | YP_001017601 |
Protein GI | 124023294 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2192] Predicted carbamoyl transferase, NodU family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTTA TAAGTATCCA ATGGTATAGA AAAATGAAGG GTGCAATACT TGGAATATCC TCTGGTTTTC ATGATTCTGC TGCAGCACTA TTGAGCTACG AAGGCAAAAT TCTATTTGCT TCATCAGAAG AAAGATTTAC GAGAAGAAAA GGGGATAAGT CCTGGCCTAA ATCTGTCATA TTAGAAATTT TGGACTATGC AGAACTTCAT GATTTAACAA TCAAAGGAAT TTGCTACTAC GAGAATCCAA TCAAGAGATT GAGGTGGTCC TTAATCCAGT CTTTTCGGGC GTCGATTCCA CTCCGTGAGA AAATTCGCAG AGCTCGCCTT TTATCTTCTA GTTATATTGA TTTGACAAAT CATTTAGACT CTCTTTTAAG TCAATTAAAT CTATCTTCAT CAGAGCTCTT CATTAGCGAC CACCATATAA GCCATGCCGC AGCTTCATTT GCTTTCAGTA GCCACAATTC AGGATTTGTA TGTGTGTTGG ATGCCTTTGG CCAAGATTGC TCTGGAATTA TAGGTTATCT ATCACCATCT AAAAGCCTTA GAACATTTAA GGTTTTATCT GTAGATCAAT CAGTAGGTCT CTTTTACTCA GCAATTACTT CAATTTGTGG ATTCAAGATT TTAACTGGAG AGTATAAAGT GATGGGCTTG GCACCCTATG GAAAACCCAT ATTTTTTGAC AAGCTTGTCA AGATCTTTGG TTATCCATCG ATTAACCAGT TCAGTACAAG TATTCTTGAT CCATTCCTGC CTGCATTGGC TTCCAAGTTA TTACGCTCAA AACTCGGTAT TCCTTCGAGA GAGCAAGAGG GGCCCATCGA TTCTGTTTAT ATGGATTTGG CGGCTTCTGC ACAGAAATAT CTAGAATACC TCGTAGTAGA TATTTTTTAC ACATACCTCC CTAAAGTTCC TGGAAGTTGT AGTCAACATA TATTTCTAGG TGGAGGTGTA GCTTTGAATT GTAAGCTTAC GTACGTTCTG GAAAACACAT TCCCAAATTA CACCTTTTCA ATTTGTCCCT CTGCTGGTGA TAGCGGTTCA TCAGTTGGAG CATGCTATGC ACATTTGATG GAATTTAATT CGACCAATCT TCATGCTAAC TATTCGGTTC ATCTTGGCTA TAGTAATGAT AAACGATATA TAAAGAGCTC CCTGAAATCA CTTGGTTTTA AAATTTCCAT CTATGAGGGC TCTTCTCTCG CTAGGACAAT TTCATCACTT TTGATTAAGG GCAAGGTTGG AGCAATTTGT ACCGGCCCAT CTGAATTTGG ACCTCGAGCT CTAGGCAGTC GTTCAATTCT TGCAAACCCA AATGATAACA ATGCTATTTC ATTCGTTAAT AGATCTATTA AATCGAGAGA AGACTTCAGA CCCCTTGCAC CTGTTACGAC CTTAGAAATT TATCGTGAGT TATTTGACGA GAACTCTGTA AATCAGCTTT TGTATTACAT GTTGACTCTT GTCAAAATAC CTTCTAATGT TATAAATATT ATACCTTCTG CCGTACACGT AGACGGAACT GGTCGTCTTC AAGTTTTAAG AGCTAATCAG AATCCTTTTC TTCACGAGAT CATCACATGT TTTTATCGTG AATCAGGGGT TCCCGCATTG ATTAATACAA GCTTTAATCA AAGAGGTGAG CCATTGGTAA ATACTACAGT AGATGCGTTG AGATGCTTTT GTTCAACTGA ACTCGATTTT CTGTGCATTG AAAGTGAGCT TCTTATCAAA TCAGAACAAC ATTCAAATAT TGTTTCTGGC TTTAGGCAGA GATCTTCATT CCCTTTGGAC TGA
|
Protein sequence | MKVISIQWYR KMKGAILGIS SGFHDSAAAL LSYEGKILFA SSEERFTRRK GDKSWPKSVI LEILDYAELH DLTIKGICYY ENPIKRLRWS LIQSFRASIP LREKIRRARL LSSSYIDLTN HLDSLLSQLN LSSSELFISD HHISHAAASF AFSSHNSGFV CVLDAFGQDC SGIIGYLSPS KSLRTFKVLS VDQSVGLFYS AITSICGFKI LTGEYKVMGL APYGKPIFFD KLVKIFGYPS INQFSTSILD PFLPALASKL LRSKLGIPSR EQEGPIDSVY MDLAASAQKY LEYLVVDIFY TYLPKVPGSC SQHIFLGGGV ALNCKLTYVL ENTFPNYTFS ICPSAGDSGS SVGACYAHLM EFNSTNLHAN YSVHLGYSND KRYIKSSLKS LGFKISIYEG SSLARTISSL LIKGKVGAIC TGPSEFGPRA LGSRSILANP NDNNAISFVN RSIKSREDFR PLAPVTTLEI YRELFDENSV NQLLYYMLTL VKIPSNVINI IPSAVHVDGT GRLQVLRANQ NPFLHEIITC FYRESGVPAL INTSFNQRGE PLVNTTVDAL RCFCSTELDF LCIESELLIK SEQHSNIVSG FRQRSSFPLD
|
| |