Gene P9303_19131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19131 
Symbol 
ID4776461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1678311 
End bp1681121 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content45% 
IMG OID640087422 
Producthypothetical protein 
Protein accessionYP_001017920 
Protein GI124023613 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.105709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACG TGGCGGCCAT CAAAGAACTC TCAACCGCTG GCCGCCATCA GGAATGCCTA 
CAAGCTTGCC AAAATTCTCT AAAGGAAAAG CCTGAAGAAA TATTTGCTTT TAAGTATGCA
GGGAAATCAC TACTGGCTCT AGGGCAATTC GAGAAGGCAC AGCAATCCCT AAAGAAAGCT
CATCAGCTTG ACAAGACCGA TCCAGAAATC GCCAAAGACA TCGGCAACAC CTTTTTAAAT
CTTGGCAACC AAGAGAATGC CCTTCAATGG TACGAGAAAG CGCTAGAAAT CAATAACAAG
TATGCTCCTG CTTTCAACAA TATTGCCAAC TTAAAGAGGC AGATCGGAAA TCACCAAGAA
GCAATCGATC TATTCAAGCG AGCCATACAA GCAGATCCAA AACTGATCCA GGCATATATC
GGAGCAGCCG CAAGCCATCT GGCACTTGGC AACCTGGATC AAGCCGAATC ATTCGCAACA
CAAGCATTAA AGATCAACGC AAATGCTCCA GAAGTCAACG AAATCCTTGG CATCGTCTTT
CAGAACAAAC AAAATCCCGA CCAAGCAATT GAGCACTATC AAAAAGAATT AAACATTAAT
CCACAAGCAC GCAATTCGCT TCTTAATTCA GGGCTGCTAC TCCTACAGAA TAGGCAGCCA
ACAGCAGCAC TTGAATCACT GGCAAAAGCA TCAGCTATTG CGCCAAGTGA ACAATGCTCA
CTTCTTTTAG CGCAAACCCA TCAAAAACTC GGACAACTCA AAGAAGCCAT TATCGAATAC
CAGAAACTGA ATATCAACCA ATCAAATAAT AAGCTCATCC CATTCAACCT TGGACTCTGC
TTGTTCGAAA TTGGCGACAA CAATCAAGCC ATAGGAGCAT TCCAGATAGC CATTCAACTA
GATGAGACAT TTATTGCCGC ATGGATAAAT ATAGGGACGG CTCTTAAAAG AGAAGGCCGC
AATCAAGAAG CCTTACAAGC AACACAAAAA GTTTTAGAGC TCAAGCCTGA TAACCCTGAT
GCACTAATGA ACCTGGGCGG AATCTACCAA GATCTCGGCA AGCTCGATCT AGCTCTTGCT
TCCACCCTTA AATCTCTAGA GCTCAAGCCT GATAACCCCA CTGCCCACAT GAACCTGGGC
GGAATCTACA AAGATCTCGC CAAGCTCGAT CTAGCTCTTG CTTCCACCCT TAAATCCCTA
GAGCTCAAGT CTGATAACCC CAATGCCCTA ATCAACCTGG GCGGAATCTA CAAAGATCTC
GCCAAGCTCG ATCTAGCTCT TGCTTCCACC CTTAAATCCC TAGAGCTCAA GCCTAATAAC
CCCGATGCCC TAATGAACCT GGGCGGAATC TACCAAGATC TCGGTGAGCT CGATCCAGCT
CTTGCTTCCA CCCTTAAATC CCTAGAGCTC AAGCCTGATA ACCCCGATGC CCTAATGAAC
CTGGGCGGGA TCTACCAAGA TCTCGCCAAG CTCGATCTAG CTCTTGCCTC CACCCTTAAA
TCCCTAGAGC TCAATCCTGA CAACCCCACT GCCCACATGA ACCTGGGCGG GATCTACCAA
GATCTCGGCA AGCTCGATCT AGCTCTTGCC TCCACCCTTA AATCCCTAGA GCTCAAGTCT
GATAACCCCA GTGCCCTAAT GAACCTGGGC GGAATCTACA AAGATCTCGG TGAGCTCGAT
CTAGCTCTTG CTTCCACCCT TAAATCCCTA GAGTTCAATC CTGATAACCC CAGTGCCCTA
ATGAACCTGG GCGGAATCTA CCAAGATCTC GGCAAGCTCG ATCTAGCTCT TGCTTCCACC
CTTAAATCCC TAGAGTTCAA TCCTGATAAC CCCGATGCCC TCATGAACCT GGGCGGGATC
TACAAAGACC TCGGCGAGCT CGATCTAGCT CTTGCTTGTC TCCAAGAGGC TAAAAAAAGC
GAAAAAGCAA AAGAAGAGTC TTCAATAAGA TTAGCCGAAA CTTATTACTG CATGGGCCTT
TATCGTGAAG GAGTAAATGC AATAAATGGT CTGCATAGTA AATCAGCGCT GAATCTTCTT
CTTTCCCTCT ACCTCTGCTT AGGAGAAAAG GCAAAACTTA ATCGGTGCGC AAACAATCTA
ACAAGCAAGC GTTGGCTCAA TCAGCAAGGC ATCGCAGCGA TAGATCATGC CAATGCTTTG
TACAATCAAA GCCTAGACAA CGGGCTTATC GAAAGCACTC TCGATTCCAT CTTTATTCAG
ACAATCAATA AGCAGGAATT CTCAAATTCA TTGATCGAAG AAATACTGAT GTACCTTTCA
AGCGGGTCAC TTCAATCCAG AAAGCAAGGA CTTCTAACCA ATGGGTCACA AACGTCTGGG
AACATACTTG ACATCCCCAA AAAGCCATTC CAAGAGCTAA AAAAACTACT AATAAGAAAG
ATCGACAACT ATAATCAATC AATCAAAGAC AACACCGACA AAGATTTCAA GGCTAACTTG
GAAAAAAATT GGTACATCCT GAAAGGGTGG GCAATCATCA TGGAAAAAGG CGGAAGCCTT
AAATCTCACA ACCACGAGAG AGGATGGCTA ACCGGAACCT TCTATTTGCA AATACCTGAA
AGAGAGGGCA ATTCAGAAGA AGGTGCTATT GAGTTTAGCC ATCAAGGCCC AAAGTACCCT
AAAGGAGATT CGCCATTCGA AAGAAGAGTA ATTCGACCAG AACCTAGGGA CTTAAATATT
TTCTCCTCAT CGCTTTTTCA CAGAACCATG CCCTTCCAAT CTACTACGCA AAGAATTTGC
ATAGCATTCG ACGTCACTAG AAATGAAAAG CTGTGGCACG TCTCCAATTA G
 
Protein sequence
MADVAAIKEL STAGRHQECL QACQNSLKEK PEEIFAFKYA GKSLLALGQF EKAQQSLKKA 
HQLDKTDPEI AKDIGNTFLN LGNQENALQW YEKALEINNK YAPAFNNIAN LKRQIGNHQE
AIDLFKRAIQ ADPKLIQAYI GAAASHLALG NLDQAESFAT QALKINANAP EVNEILGIVF
QNKQNPDQAI EHYQKELNIN PQARNSLLNS GLLLLQNRQP TAALESLAKA SAIAPSEQCS
LLLAQTHQKL GQLKEAIIEY QKLNINQSNN KLIPFNLGLC LFEIGDNNQA IGAFQIAIQL
DETFIAAWIN IGTALKREGR NQEALQATQK VLELKPDNPD ALMNLGGIYQ DLGKLDLALA
STLKSLELKP DNPTAHMNLG GIYKDLAKLD LALASTLKSL ELKSDNPNAL INLGGIYKDL
AKLDLALAST LKSLELKPNN PDALMNLGGI YQDLGELDPA LASTLKSLEL KPDNPDALMN
LGGIYQDLAK LDLALASTLK SLELNPDNPT AHMNLGGIYQ DLGKLDLALA STLKSLELKS
DNPSALMNLG GIYKDLGELD LALASTLKSL EFNPDNPSAL MNLGGIYQDL GKLDLALAST
LKSLEFNPDN PDALMNLGGI YKDLGELDLA LACLQEAKKS EKAKEESSIR LAETYYCMGL
YREGVNAING LHSKSALNLL LSLYLCLGEK AKLNRCANNL TSKRWLNQQG IAAIDHANAL
YNQSLDNGLI ESTLDSIFIQ TINKQEFSNS LIEEILMYLS SGSLQSRKQG LLTNGSQTSG
NILDIPKKPF QELKKLLIRK IDNYNQSIKD NTDKDFKANL EKNWYILKGW AIIMEKGGSL
KSHNHERGWL TGTFYLQIPE REGNSEEGAI EFSHQGPKYP KGDSPFERRV IRPEPRDLNI
FSSSLFHRTM PFQSTTQRIC IAFDVTRNEK LWHVSN