Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_00171 |
Symbol | |
ID | 4776065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 20686 |
End bp | 21966 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640085516 |
Product | pili biogenesis protein |
Protein accession | YP_001016039 |
Protein GI | 124021732 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCCT TCGTCGCCAC CTACGAATCA GCAAGTGGTC AAACACGCAC AATGACCATC AAAGCGGCCG ATCTAACAAC CGCAAAAAAG TTACTACGTC GTCGTGGAAT ACGAGCAACT GACCTAAAAG CAGCTCTCAA CAATAAAGGA CAAGAAAGTA GCAAAAAAAT TGACCGCCAA AAGAACACAA AATCTTCAAA CCTTGGACTA TTTTCCATCG ATCTAGCTGC CGCTTTCGAA AAGTCCCCAG GTGTAAAAGA CAAAGCAGTT TTCGCAAGCA AATTGGCAAC TCTGGTGGAT GCAGGCGTTC CGATCGTGCG CAGTCTCGAC CTAATGGCTA ATCAGCAGCG CTTGCCTATG TTCAAACGTG CACTGATGAA GGTGAGTCTT GATGTAAATG AAGGCAGTGC CATGGGCACT GCTATGAGAA TGTGGCCAAA GGTATTCGAC CAACTCAGTA TCGCGATGGT GGAAGCCGGA GAGGCTGGTG GTGTTTTGGA TGAATCTCTC AAGAGACTCG CCAAGCTATT AGAAGACAAT GCCCGACTCA AGAACCAAAT CAAAGGGGCT CTTGGTTACC CAATCACCGT GCTGGTGATA GCCATCCTCG TCTTCCTAGG CATGACAATC TTCTTGATCC CGACCTTTGC AGAAATCTTT GAAGATTTAG GTGCCGAGCT GCCCCTGTTC ACCCAGTTCA TGGTCGACCT AAGCAAATTA TTGCGTTCCT CCTTTTCCCT GCTATTAACA GGTGTCCTAC TGGTGTGCGC TTGGATTTTT AACCGCTACT ACTCCACTCA CCAAGGACGC CGTCAGATCG ATCGACTGAA GCTGAGAATC CCCCTATTCG GCAATCTGAT CATCAAAACT GCCACCGCTC AATTCTGCAG AATCTTCAGC TCATTGATCA GAGCAGGCGT ACCAATCCTG ATGTCACTAG AAATTGCCAG CGAAACCGCT GGCAATGCGA TCATTTCCGA CGCCATTCTT GAATCACGCA CCCTTGTGCA GGAAGGGGTA CTCCTTAGTG CTGCCTTGAT TCGCCAAAAA GTACTTCCAG ACATGGCCTT AAACATGCTG GCCATTGGCG AGGAAACCGG AGAGATGGAT CAAATGCTCA GCAAGGTGGC TGATTTTTAC GAGGATGAGG TCTCTACCTC GGTGAAGGCT CTTACCTCAA TGCTTGAACC AGCGATGATC GTTGTAGTGG GTGTCATCGT GGGCTCCATC CTGCTAGCGA TGTATCTCCC CATGTTCACC GTGTTTGATC AGATCCAGTA G
|
Protein sequence | MTSFVATYES ASGQTRTMTI KAADLTTAKK LLRRRGIRAT DLKAALNNKG QESSKKIDRQ KNTKSSNLGL FSIDLAAAFE KSPGVKDKAV FASKLATLVD AGVPIVRSLD LMANQQRLPM FKRALMKVSL DVNEGSAMGT AMRMWPKVFD QLSIAMVEAG EAGGVLDESL KRLAKLLEDN ARLKNQIKGA LGYPITVLVI AILVFLGMTI FLIPTFAEIF EDLGAELPLF TQFMVDLSKL LRSSFSLLLT GVLLVCAWIF NRYYSTHQGR RQIDRLKLRI PLFGNLIIKT ATAQFCRIFS SLIRAGVPIL MSLEIASETA GNAIISDAIL ESRTLVQEGV LLSAALIRQK VLPDMALNML AIGEETGEMD QMLSKVADFY EDEVSTSVKA LTSMLEPAMI VVVGVIVGSI LLAMYLPMFT VFDQIQ
|
| |