Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_02811 |
Symbol | |
ID | 4778671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 294098 |
End bp | 296611 |
Gene Length | 2514 bp |
Protein Length | 837 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640085785 |
Product | hypothetical protein |
Protein accession | YP_001016301 |
Protein GI | 124021994 |
COG category | [N] Cell motility [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.370164 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAGGAT TCGGGGAAAA AAGCGAAGCA GGGAAGCAGA AAAAAATAAA TAACGAAAAA AAAGGATTAC TTTATTTCAA TAAAGCTGTT AAAAGCCATG CACAAGGCGA CATCCAGCAA GCCAAACTCT TATATCTTAA ATCAATAGCG AATGGCTTAG AGAACGAATC CCTATATACA AACCTAGGCG TAATTTACAA AAACGAAGGA GACTTTAAAG AGTCAGGCAG ATGCTATAGA TCCGCTTTGC GGATCAATCC ATTCTCATGC GATGCTTACA CCAACCTGAG CTCTCTTGCA ATCGCAGAAA ACGAATTCAC ATCAGCCTTG GATCTCGCGA ACAAAGCCAT AAAGTTAAAT CCTAATTGTG ATGTTGCCAA CTTAAATGCA GGAAAAGCGC TTTTAGAGCT CGGTGATCTT GAACAGGCTC TTGCCTCAAC CCTCAAATCT CTAGAACTCC AGCCAGATAA CCACACTGCC CACATGAACC TGGGCAGCAT TTACCAAGAT CTCGGTGAGC TCGATCAAGC TCTTGCTTCC ACTATCAAAT ATCTAGAGCT CAAGCCCGAC AACCCCGATG CCCTCATGAA CCTGGGCGGC ATCTACAAAG ATCTCGGTCA GCTCGATCAA GCGCTCGCAT CAACGCTGAA GAACCTAGAG ATCAAGCCTG ATAACCCCAC TGCCCACATG AACCTGGGCG TCATCTACAA AGATCTCGGC AACCTGGATC AAGCTCTTAC CTCAACCCTC AAATCTCTAG AACTCCAGCC AGATAACCAC ACTGCCCACA TGAACCTGGG CAGCATTTAC CAAGATCTCG GCAACCTCGA TCAAGCTCTC ACCTCAACCC TCAAATCTCT AGAGCTCAAA CGCGATAACC CGGACGCCCT CACGAACCTG GGCGGCATCT ACAAAGATCT CGGCAACCTA GATCAAGCTC TCACCTCAAC CCTCAAATCT CTAGAGCTCA AACCCAATAA CCCGGACGCC CTCACGAACC TGGGTGGCAT CTACAAAGAA CAAGGGCAAC TCGATCAGGC TCTTACTGCT TACAAAAAAG CAAGCACACT TGCACCGAAG GAGTTGAGGC ATGTAGCAGC GTCAACGCTG TTTTTCAGTG ATCTGCACAA AGACAATGAT GAGATCAACT CCGAAAGAAC TGCATACAGG CAAGGCATTA AACAACTCGC GCGGAGCTCT ACAGAGATGG AGCAACCCAA ATCAAGTTAC TCAACTGATA TGTTCTGGAT TGCCTACCAC AACAGGGATG ACGACAGAGA GATTCTCGAA AGCCTAGGGA GAGCCCTGGC TTCTCTGCAA AAAGGAACCC TCACAAAAGC GATCAGTGGA GCAGGAAGAA ATTTAGCAAG TAGGGGAAAA ATAAGGCTAG GCATCTGCTC TGACTACCTC CGTTCTCATA GCATCGGCAA GCTATATGCA GGAATGATTA AGGAGTTTAA AGACCGCGGA TTTAACATCA CTATTTTTAG GGGACCCCAA TCAAAAACCG ACGAAGAAAG TCTAAGAATA GACTCTTACG CAGTTTCATC AATTAAGCTT CCGGAATCAC CACAAGCAGC TTGCGAGATC ATCAGAAATG AACACCTAAA TGTTCTCCTA TATCCAGATA TTGGGATGTC CCCCTATACG TACATTCTTG CCATGTTCCG ACTTGCTCAG GTGCAAGTTA CAGGCTGGGG GCATCCAAGC ACGACAGGCC TGAAGACAAT GGACTACTTT TTGTCGTGCG AACCTATTGA GCCAGACAAC GCTCAATCAA AATATACGGA ACAGCTAATA AAGCTCAAAA AACTACCTTG CATCTATACA CCTCCGGAGA CCACGGCAAT ATCTAGCTCC CGAGACAAGT TCATGCTGCC ATCGGATAAA ATCTTAATCG GAATTCCTCA GAGCCTATTC AAGTTTCACC CTGATTATGA TGTGGTCTTA GAAGAGATTC TTTACAGGCT TCCCAATGCA AAGTTTGTCT TGATCGAAGG GCAGAACAAG TCGCAAACGG AGCGCCTCAA GAACCGATGG GCAACTAAGG CACCAAAGAC ATTAGAGAAT GCGATATTCC TGCAAACAAT GCCCCAGGCA GATTATTTGT GTCTGCTAAA AACCGTAGAC ATTCTACTAG ACCCGATTTA CTTCGGAAGT GGCAACACAT TTTATGAGTC GATGGCAGTT GGCACACCTC TGGTAACCAT GCCTGGAGAC TATATGCGAG GTCGAATCGT AGCTGGTGGT TACAAGCAGA TGAAACTTGA GAATGCACCT ATTGCCGCAA ACACTCAGGA ATATATTGAG ATTACCGTCA TGCTTGCGGA GAATGTTGAA TCAAGGAAAT GCTTAAAAAA GCAAATTGAG GCTCGTGCTC AGAAGTATCT TTTTAATGAT CAGGAAGCAG CTAATGAAAT AATCGAGTTC CTCCAGGCAG CTGTGGATTG CAGACATAAG ACAGGCGGTC TTCTGCCTAT AGGCTGGATT CCATCTCAGA GACCAAGCCA GTGA
|
Protein sequence | MKGFGEKSEA GKQKKINNEK KGLLYFNKAV KSHAQGDIQQ AKLLYLKSIA NGLENESLYT NLGVIYKNEG DFKESGRCYR SALRINPFSC DAYTNLSSLA IAENEFTSAL DLANKAIKLN PNCDVANLNA GKALLELGDL EQALASTLKS LELQPDNHTA HMNLGSIYQD LGELDQALAS TIKYLELKPD NPDALMNLGG IYKDLGQLDQ ALASTLKNLE IKPDNPTAHM NLGVIYKDLG NLDQALTSTL KSLELQPDNH TAHMNLGSIY QDLGNLDQAL TSTLKSLELK RDNPDALTNL GGIYKDLGNL DQALTSTLKS LELKPNNPDA LTNLGGIYKE QGQLDQALTA YKKASTLAPK ELRHVAASTL FFSDLHKDND EINSERTAYR QGIKQLARSS TEMEQPKSSY STDMFWIAYH NRDDDREILE SLGRALASLQ KGTLTKAISG AGRNLASRGK IRLGICSDYL RSHSIGKLYA GMIKEFKDRG FNITIFRGPQ SKTDEESLRI DSYAVSSIKL PESPQAACEI IRNEHLNVLL YPDIGMSPYT YILAMFRLAQ VQVTGWGHPS TTGLKTMDYF LSCEPIEPDN AQSKYTEQLI KLKKLPCIYT PPETTAISSS RDKFMLPSDK ILIGIPQSLF KFHPDYDVVL EEILYRLPNA KFVLIEGQNK SQTERLKNRW ATKAPKTLEN AIFLQTMPQA DYLCLLKTVD ILLDPIYFGS GNTFYESMAV GTPLVTMPGD YMRGRIVAGG YKQMKLENAP IAANTQEYIE ITVMLAENVE SRKCLKKQIE ARAQKYLFND QEAANEIIEF LQAAVDCRHK TGGLLPIGWI PSQRPSQ
|
| |