Gene P9303_00171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_00171 
Symbol 
ID4776065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp20686 
End bp21966 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content49% 
IMG OID640085516 
Productpili biogenesis protein 
Protein accessionYP_001016039 
Protein GI124021732 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCCT TCGTCGCCAC CTACGAATCA GCAAGTGGTC AAACACGCAC AATGACCATC 
AAAGCGGCCG ATCTAACAAC CGCAAAAAAG TTACTACGTC GTCGTGGAAT ACGAGCAACT
GACCTAAAAG CAGCTCTCAA CAATAAAGGA CAAGAAAGTA GCAAAAAAAT TGACCGCCAA
AAGAACACAA AATCTTCAAA CCTTGGACTA TTTTCCATCG ATCTAGCTGC CGCTTTCGAA
AAGTCCCCAG GTGTAAAAGA CAAAGCAGTT TTCGCAAGCA AATTGGCAAC TCTGGTGGAT
GCAGGCGTTC CGATCGTGCG CAGTCTCGAC CTAATGGCTA ATCAGCAGCG CTTGCCTATG
TTCAAACGTG CACTGATGAA GGTGAGTCTT GATGTAAATG AAGGCAGTGC CATGGGCACT
GCTATGAGAA TGTGGCCAAA GGTATTCGAC CAACTCAGTA TCGCGATGGT GGAAGCCGGA
GAGGCTGGTG GTGTTTTGGA TGAATCTCTC AAGAGACTCG CCAAGCTATT AGAAGACAAT
GCCCGACTCA AGAACCAAAT CAAAGGGGCT CTTGGTTACC CAATCACCGT GCTGGTGATA
GCCATCCTCG TCTTCCTAGG CATGACAATC TTCTTGATCC CGACCTTTGC AGAAATCTTT
GAAGATTTAG GTGCCGAGCT GCCCCTGTTC ACCCAGTTCA TGGTCGACCT AAGCAAATTA
TTGCGTTCCT CCTTTTCCCT GCTATTAACA GGTGTCCTAC TGGTGTGCGC TTGGATTTTT
AACCGCTACT ACTCCACTCA CCAAGGACGC CGTCAGATCG ATCGACTGAA GCTGAGAATC
CCCCTATTCG GCAATCTGAT CATCAAAACT GCCACCGCTC AATTCTGCAG AATCTTCAGC
TCATTGATCA GAGCAGGCGT ACCAATCCTG ATGTCACTAG AAATTGCCAG CGAAACCGCT
GGCAATGCGA TCATTTCCGA CGCCATTCTT GAATCACGCA CCCTTGTGCA GGAAGGGGTA
CTCCTTAGTG CTGCCTTGAT TCGCCAAAAA GTACTTCCAG ACATGGCCTT AAACATGCTG
GCCATTGGCG AGGAAACCGG AGAGATGGAT CAAATGCTCA GCAAGGTGGC TGATTTTTAC
GAGGATGAGG TCTCTACCTC GGTGAAGGCT CTTACCTCAA TGCTTGAACC AGCGATGATC
GTTGTAGTGG GTGTCATCGT GGGCTCCATC CTGCTAGCGA TGTATCTCCC CATGTTCACC
GTGTTTGATC AGATCCAGTA G
 
Protein sequence
MTSFVATYES ASGQTRTMTI KAADLTTAKK LLRRRGIRAT DLKAALNNKG QESSKKIDRQ 
KNTKSSNLGL FSIDLAAAFE KSPGVKDKAV FASKLATLVD AGVPIVRSLD LMANQQRLPM
FKRALMKVSL DVNEGSAMGT AMRMWPKVFD QLSIAMVEAG EAGGVLDESL KRLAKLLEDN
ARLKNQIKGA LGYPITVLVI AILVFLGMTI FLIPTFAEIF EDLGAELPLF TQFMVDLSKL
LRSSFSLLLT GVLLVCAWIF NRYYSTHQGR RQIDRLKLRI PLFGNLIIKT ATAQFCRIFS
SLIRAGVPIL MSLEIASETA GNAIISDAIL ESRTLVQEGV LLSAALIRQK VLPDMALNML
AIGEETGEMD QMLSKVADFY EDEVSTSVKA LTSMLEPAMI VVVGVIVGSI LLAMYLPMFT
VFDQIQ