Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_1233 |
Symbol | |
ID | 3606626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | + |
Start bp | 1732125 |
End bp | 1734854 |
Gene Length | 2730 bp |
Protein Length | 909 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 637688108 |
Product | TPR repeat-containing protein |
Protein accession | YP_292426 |
Protein GI | 72383071 |
COG category | [N] Cell motility [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.161918 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACCGAAG AAAGAAAGAA TCAAAAGCAA GAAGGCTCTA AAGTAAAAAC ATTCCCAGTC TCTTTTGCTT TAGGTGAAAT TAAAGAAAAC ATCACTATTA CGACTAATAG CGCTTCTAAT ACTTCAAAGG AACAAAAAAG ATTTGGAGAC CAGAGTAAAA CTATAAAGAA AAAAGATACC AATACAATTA CTAAACCTTC CAAAGATCAA ATCATTAATC AAGCATTTAA ATTTCATTCA CAAGGCAATA TAAAAGAAGC AGCAAAAAAT TATCAATATT TCATTAATCA AGGTTTCTCT GACCACATGG TTTTTTCTAA TTATGGAGCA ATATTAAGAG ATCTTGGTAA TTTACAAGAT GCAGAATTAT ATACTCGCAA AGCAATTAAA ATTAATCCTA ATTACGCATT GGCTTACTCT AATCTGGGAA ATGTATTAAA AGATCTTGGT AAGTCACAAG ATGCAGAATT GTCATACCGC AAAGCAATTC AAATTAATCC TAATTACGCA GATGCACATT ACAATCTGGG AATAATATTG AAAGAACTTG GTAATTTACA AGACGCAGAA TTGTCATATC GCAAAGCAAT TCAAATTAAT CCTAATTACG CAGATGCATA TTCCAATCTG GGAAATGTAT TAAAAGATCT TGATAATTTA CAAGACGCAG AATTGTCATA CCGCAAAGCA ATTCAAATTA ATCCTAGTTA CGCAGACGCA TATTCAAATC TGGGAAATGT ATTAAAAGAT CTTGGTAATT TACAAGACGC AGAGTTGTCA TACCGCAAAG CAATTCAAAT TAATCCTGAT TACGCAGAGG CGCATTTCAA TCTAGGAAAT CTATTAAAAG ATCTTGGTAA ATTACAAGAC GCAGAGTTGT CATATCGCAA AGCAATTCAA ATCAAATCTG ATTACGCTGA GGCGCATTAC AATCTAGGAA TCATATTGAA AGACCTTGGT AATTTACAAG ACGCAGAATT TTATAATCGC AAAGCAATTC AAATCAAACC TGATTACGCT GAGGCGCATT TCAATCTGGG AATCATATTG AAAGACCTTG GTAATTTACA AGACGCAGAA TTCTCATATC GCCAAGCAAT TCAAATCAAA CCTGATTACG CAGATGCCTA CTCCAATCTG GGAAATGTAT TAAAAGATCT TGGTAAGTTA AAAGACGCAG AATTGTCATA TCGCAAAGCA ATTCAAATCA AACCTGATTA CGCTGAGGTT TATTCCAATC TGGGAAATGT ATTAAAAGAT CTTGGTAATT TACAAGACGC AGAATTTTCA TACCGCAAAG CAATTCAAAT CAAACCTGAT TACGCAGATG CTTACTCCAA TCTGGGAAAT ATATTAAAAG AGCTAAGTAA TTTCACTGAC GCTATAAATC AATTCAAGGA TGCACTAAAA TTGAACAATG AATTAACATC AGCTCAGACT GGTTTAATGT CAACTCAGGG TAATATATGC GATTGGAGTG ATGAGGAGAC TCATAATAAA TGGCTTAAAT CACTTGGTAT TAAAGGAAAA GCTATTAATC CATGGGGATT ACTTTCATTA GAAGATAATC CTTTAAATCA TTTAAAAAGA TCTAAGAAAT TTTATAAAGA AAAATATGTA CGCGCAACTC AATATATTAA ACCTTCCCCA AAAAGTTTAA TTCATATAGG TTATTTCTCT GCTGATTTCA GGACTCATCC TGTAATGCAA CTAATTGCTC CTTTACTTGA GCTACATGAT AAATATAGGT TTAAAATATA TTTATACTCA TTTGCACCAA AAGAAGATGA ATATACTGAA AGAGCAAAAA AGTCTGGATG CATATTTAGA AACATCAAAA ATTTAAATGA TATTGAAGCA GTTGAATTAG CAAGAAGTGA TCAGTTAGAT ATTGCAGTAG ATCTCATGGG ATATACCAGA CACAATAGAA TGCCTATATT CTCATATAGG GTGGCACCAA TACAAATCAA TTACTTAGGT TATATTGGCA GTATTGGTTC AGATACTATT GATTATATTA TCGCGGATAA AATCACAATT CCGAGGGAGT ATGAAAAATT TTATTCCGAA AAAGTAATAC GAATGCCAAA TTGCTTTATA TGTGATGATC ATAAAAAAGA AATTACTAAG GAGTCCATAT CTCGTAAAGA TTTCAACCTT CCTGACCAAG GCTTTATATT TACTTGTTTT AATAATAATT ATAAAATAAC AAAAAAAGAA TTTAATATAT GGATGAACTT ACTTAGAAAA GTAGAAGGAA GTGTTCTTTG GTTATACAAA TCAAATCAAC TTTCCATGAA TAATTTATAC AAGGAAGCGA GTAAACGAAA AATAGATCGA GACAGAATAA TATTCGCTGA AAAATTACCA ATGAGCAAAC ATTTAGCTAG GCATTCTCTA GGTGATTTAG CACTTGATAC TTTTAATTGT AATGGAGGTA AAACAACTTG TGACGCGTTA TTGGCTGGCT TACCATTACT TACAAAGATA GGTCAGAGTT TTACTGCTAG GATGTCTGCC AGCCTGCTTA CATCATTGGG ACTTCCCGAA TTAATTACTT ATAGTGAAAG TGAATACGAA GATAAAGCTT TATATATTGC TAGCAACTCT GAAGAGATTA TTCGATTGAA ATCTAAATTA AACAAATCGA AAGAAACATC ACCACTTTTT AATTCAAAAT TATTTACGCA AGATCTTGAA AATATTTATC TTGATCTGGT AAAAAAATAA
|
Protein sequence | MTEERKNQKQ EGSKVKTFPV SFALGEIKEN ITITTNSASN TSKEQKRFGD QSKTIKKKDT NTITKPSKDQ IINQAFKFHS QGNIKEAAKN YQYFINQGFS DHMVFSNYGA ILRDLGNLQD AELYTRKAIK INPNYALAYS NLGNVLKDLG KSQDAELSYR KAIQINPNYA DAHYNLGIIL KELGNLQDAE LSYRKAIQIN PNYADAYSNL GNVLKDLDNL QDAELSYRKA IQINPSYADA YSNLGNVLKD LGNLQDAELS YRKAIQINPD YAEAHFNLGN LLKDLGKLQD AELSYRKAIQ IKSDYAEAHY NLGIILKDLG NLQDAEFYNR KAIQIKPDYA EAHFNLGIIL KDLGNLQDAE FSYRQAIQIK PDYADAYSNL GNVLKDLGKL KDAELSYRKA IQIKPDYAEV YSNLGNVLKD LGNLQDAEFS YRKAIQIKPD YADAYSNLGN ILKELSNFTD AINQFKDALK LNNELTSAQT GLMSTQGNIC DWSDEETHNK WLKSLGIKGK AINPWGLLSL EDNPLNHLKR SKKFYKEKYV RATQYIKPSP KSLIHIGYFS ADFRTHPVMQ LIAPLLELHD KYRFKIYLYS FAPKEDEYTE RAKKSGCIFR NIKNLNDIEA VELARSDQLD IAVDLMGYTR HNRMPIFSYR VAPIQINYLG YIGSIGSDTI DYIIADKITI PREYEKFYSE KVIRMPNCFI CDDHKKEITK ESISRKDFNL PDQGFIFTCF NNNYKITKKE FNIWMNLLRK VEGSVLWLYK SNQLSMNNLY KEASKRKIDR DRIIFAEKLP MSKHLARHSL GDLALDTFNC NGGKTTCDAL LAGLPLLTKI GQSFTARMSA SLLTSLGLPE LITYSESEYE DKALYIASNS EEIIRLKSKL NKSKETSPLF NSKLFTQDLE NIYLDLVKK
|
| |