Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_19131 |
Symbol | |
ID | 4776461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1678311 |
End bp | 1681121 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640087422 |
Product | hypothetical protein |
Protein accession | YP_001017920 |
Protein GI | 124023613 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.105709 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGACG TGGCGGCCAT CAAAGAACTC TCAACCGCTG GCCGCCATCA GGAATGCCTA CAAGCTTGCC AAAATTCTCT AAAGGAAAAG CCTGAAGAAA TATTTGCTTT TAAGTATGCA GGGAAATCAC TACTGGCTCT AGGGCAATTC GAGAAGGCAC AGCAATCCCT AAAGAAAGCT CATCAGCTTG ACAAGACCGA TCCAGAAATC GCCAAAGACA TCGGCAACAC CTTTTTAAAT CTTGGCAACC AAGAGAATGC CCTTCAATGG TACGAGAAAG CGCTAGAAAT CAATAACAAG TATGCTCCTG CTTTCAACAA TATTGCCAAC TTAAAGAGGC AGATCGGAAA TCACCAAGAA GCAATCGATC TATTCAAGCG AGCCATACAA GCAGATCCAA AACTGATCCA GGCATATATC GGAGCAGCCG CAAGCCATCT GGCACTTGGC AACCTGGATC AAGCCGAATC ATTCGCAACA CAAGCATTAA AGATCAACGC AAATGCTCCA GAAGTCAACG AAATCCTTGG CATCGTCTTT CAGAACAAAC AAAATCCCGA CCAAGCAATT GAGCACTATC AAAAAGAATT AAACATTAAT CCACAAGCAC GCAATTCGCT TCTTAATTCA GGGCTGCTAC TCCTACAGAA TAGGCAGCCA ACAGCAGCAC TTGAATCACT GGCAAAAGCA TCAGCTATTG CGCCAAGTGA ACAATGCTCA CTTCTTTTAG CGCAAACCCA TCAAAAACTC GGACAACTCA AAGAAGCCAT TATCGAATAC CAGAAACTGA ATATCAACCA ATCAAATAAT AAGCTCATCC CATTCAACCT TGGACTCTGC TTGTTCGAAA TTGGCGACAA CAATCAAGCC ATAGGAGCAT TCCAGATAGC CATTCAACTA GATGAGACAT TTATTGCCGC ATGGATAAAT ATAGGGACGG CTCTTAAAAG AGAAGGCCGC AATCAAGAAG CCTTACAAGC AACACAAAAA GTTTTAGAGC TCAAGCCTGA TAACCCTGAT GCACTAATGA ACCTGGGCGG AATCTACCAA GATCTCGGCA AGCTCGATCT AGCTCTTGCT TCCACCCTTA AATCTCTAGA GCTCAAGCCT GATAACCCCA CTGCCCACAT GAACCTGGGC GGAATCTACA AAGATCTCGC CAAGCTCGAT CTAGCTCTTG CTTCCACCCT TAAATCCCTA GAGCTCAAGT CTGATAACCC CAATGCCCTA ATCAACCTGG GCGGAATCTA CAAAGATCTC GCCAAGCTCG ATCTAGCTCT TGCTTCCACC CTTAAATCCC TAGAGCTCAA GCCTAATAAC CCCGATGCCC TAATGAACCT GGGCGGAATC TACCAAGATC TCGGTGAGCT CGATCCAGCT CTTGCTTCCA CCCTTAAATC CCTAGAGCTC AAGCCTGATA ACCCCGATGC CCTAATGAAC CTGGGCGGGA TCTACCAAGA TCTCGCCAAG CTCGATCTAG CTCTTGCCTC CACCCTTAAA TCCCTAGAGC TCAATCCTGA CAACCCCACT GCCCACATGA ACCTGGGCGG GATCTACCAA GATCTCGGCA AGCTCGATCT AGCTCTTGCC TCCACCCTTA AATCCCTAGA GCTCAAGTCT GATAACCCCA GTGCCCTAAT GAACCTGGGC GGAATCTACA AAGATCTCGG TGAGCTCGAT CTAGCTCTTG CTTCCACCCT TAAATCCCTA GAGTTCAATC CTGATAACCC CAGTGCCCTA ATGAACCTGG GCGGAATCTA CCAAGATCTC GGCAAGCTCG ATCTAGCTCT TGCTTCCACC CTTAAATCCC TAGAGTTCAA TCCTGATAAC CCCGATGCCC TCATGAACCT GGGCGGGATC TACAAAGACC TCGGCGAGCT CGATCTAGCT CTTGCTTGTC TCCAAGAGGC TAAAAAAAGC GAAAAAGCAA AAGAAGAGTC TTCAATAAGA TTAGCCGAAA CTTATTACTG CATGGGCCTT TATCGTGAAG GAGTAAATGC AATAAATGGT CTGCATAGTA AATCAGCGCT GAATCTTCTT CTTTCCCTCT ACCTCTGCTT AGGAGAAAAG GCAAAACTTA ATCGGTGCGC AAACAATCTA ACAAGCAAGC GTTGGCTCAA TCAGCAAGGC ATCGCAGCGA TAGATCATGC CAATGCTTTG TACAATCAAA GCCTAGACAA CGGGCTTATC GAAAGCACTC TCGATTCCAT CTTTATTCAG ACAATCAATA AGCAGGAATT CTCAAATTCA TTGATCGAAG AAATACTGAT GTACCTTTCA AGCGGGTCAC TTCAATCCAG AAAGCAAGGA CTTCTAACCA ATGGGTCACA AACGTCTGGG AACATACTTG ACATCCCCAA AAAGCCATTC CAAGAGCTAA AAAAACTACT AATAAGAAAG ATCGACAACT ATAATCAATC AATCAAAGAC AACACCGACA AAGATTTCAA GGCTAACTTG GAAAAAAATT GGTACATCCT GAAAGGGTGG GCAATCATCA TGGAAAAAGG CGGAAGCCTT AAATCTCACA ACCACGAGAG AGGATGGCTA ACCGGAACCT TCTATTTGCA AATACCTGAA AGAGAGGGCA ATTCAGAAGA AGGTGCTATT GAGTTTAGCC ATCAAGGCCC AAAGTACCCT AAAGGAGATT CGCCATTCGA AAGAAGAGTA ATTCGACCAG AACCTAGGGA CTTAAATATT TTCTCCTCAT CGCTTTTTCA CAGAACCATG CCCTTCCAAT CTACTACGCA AAGAATTTGC ATAGCATTCG ACGTCACTAG AAATGAAAAG CTGTGGCACG TCTCCAATTA G
|
Protein sequence | MADVAAIKEL STAGRHQECL QACQNSLKEK PEEIFAFKYA GKSLLALGQF EKAQQSLKKA HQLDKTDPEI AKDIGNTFLN LGNQENALQW YEKALEINNK YAPAFNNIAN LKRQIGNHQE AIDLFKRAIQ ADPKLIQAYI GAAASHLALG NLDQAESFAT QALKINANAP EVNEILGIVF QNKQNPDQAI EHYQKELNIN PQARNSLLNS GLLLLQNRQP TAALESLAKA SAIAPSEQCS LLLAQTHQKL GQLKEAIIEY QKLNINQSNN KLIPFNLGLC LFEIGDNNQA IGAFQIAIQL DETFIAAWIN IGTALKREGR NQEALQATQK VLELKPDNPD ALMNLGGIYQ DLGKLDLALA STLKSLELKP DNPTAHMNLG GIYKDLAKLD LALASTLKSL ELKSDNPNAL INLGGIYKDL AKLDLALAST LKSLELKPNN PDALMNLGGI YQDLGELDPA LASTLKSLEL KPDNPDALMN LGGIYQDLAK LDLALASTLK SLELNPDNPT AHMNLGGIYQ DLGKLDLALA STLKSLELKS DNPSALMNLG GIYKDLGELD LALASTLKSL EFNPDNPSAL MNLGGIYQDL GKLDLALAST LKSLEFNPDN PDALMNLGGI YKDLGELDLA LACLQEAKKS EKAKEESSIR LAETYYCMGL YREGVNAING LHSKSALNLL LSLYLCLGEK AKLNRCANNL TSKRWLNQQG IAAIDHANAL YNQSLDNGLI ESTLDSIFIQ TINKQEFSNS LIEEILMYLS SGSLQSRKQG LLTNGSQTSG NILDIPKKPF QELKKLLIRK IDNYNQSIKD NTDKDFKANL EKNWYILKGW AIIMEKGGSL KSHNHERGWL TGTFYLQIPE REGNSEEGAI EFSHQGPKYP KGDSPFERRV IRPEPRDLNI FSSSLFHRTM PFQSTTQRIC IAFDVTRNEK LWHVSN
|
| |