Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44863 |
Symbol | |
ID | 7199795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 470670 |
End bp | 473540 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178782 |
Protein GI | 219115974 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTCTT TCCCCCTACG TCGACTCGAA GTCTCGCTCC TTCGCGACGA CGACGATAGC GGAGGAAGCC ATACACGACG AGCCTTCTAT TCTGGACGTC CCGAATCCAC CCCGCAAATA CCGCTGTCGG AATATAAGCT CCCCAAGCTC CCCGCAACCT TCTCCCGTCT CATAGAAGGT CCCCCCATAC AATTCGGTAC GGGAAAACTA GCCCGTCTCA CGCATTCCTC CGTCGTGGGA CAGGGCGGAT CGACCGTCGT TCTCGCAACG GTCGCACACC AGCCACCGGA TACACCCAAT GGGGATCAAA ACAACGCCGC CTCCACTTTT CTCACCGTCG ATTACCGCCA GCGATACCAC GGTGTGGGGA AAATCCCGTC CAACGGAGGA CGGCGGGATA ACGCCGTTCT GTCGACCCCT GAAGTTCTCG CGTCCCGTGC CATCGATCGG GCCCTCCGAC CTCTAATCCG CACGGAGGAC GCTATCGATG GACGTCTGCA CGTTACGTGT TCCGTACAAT CCTACGAAGT ATTCGCATCC GAGGTTGATC TAAGGCAGGA TGGACTACTC GACGAGAAAG ACAACGACGC GACTAACGCT TCACACTCGA CCCATCCCGT CGCGATGGCG TTGAATACGG CGGCAGCGGC ACTACTACAG CAAGGGCACT TGACCCAACC CGTCGCCGCT ACGGTCCTGG CCGTAGCCGC GCACGGCGAC GGTGTGGTCT GGCAAGACCC CGTGCCGGAA CAATTGGCCG ACTCGTACGG GGAACTCCTG TACGCGGGTA CCGCGGACGG ACACGTCGTC ATGTTGGAAT GGACTTCGCA CCGACTGGCG ACCGGGTTAT CGGAATCGCA CATGACACAA CTTTTGGAAC TCGCACAAAC CTCCATCCGA CCCGTTCTCG ATCTTTTGAC GGCAAGGTTT GGGGACGTTC CTCACAACAA TAACTCCTCC AGTATCGGCT TATCCGCTAC GGACCGGTGG CTCGAAGAAA ATCAACTGCG ACAGAGTTTG GGATTGGATC CCATTCCGCA AGACGAAGAA GCAATAAATT TGCCTAGCGT GGTCGCGTCC AATACCACTC AGGCGCGGTT GCCCGACATT ATTCAACACG TGCAAACTTG TATAGGTCAC GTGTTGCCTC GCCTTTTTGG ATGGGACGCA ACCTTGCCGG CTGACACCGA AACCGACCGA AACAACGAGA AAAACAGAAC GGTGGTGGAC CAGTCGATGG GAGCGGCTAT TCATCGAGGG GACTTGCCTT CCAAAACCGT CCGCGGTCGT CGAGAACAAA TTGTACAAAC CGAAATAGCT CGTCTCGTGG AATCTTATGT AACCGCCGCC GGCGATGCAT TCGAACCCAG AGAATTGCAG TGGCTTTCGG AACAAACCAC GAAACACTTA CTCCAGAAAG GCTTCGTCCA GTCTGCTCTC CAAGGTGGCG CTCGTGCGGA TGGACGTGGG TTGCCGGGAC ACGGCTGGAA GACGATCCGT CCCCTCAAGG TACAAATGCC AGCACTGCCC GACACCGTGC ATGGTTCGGC CCTCTTTGCT CGAGGCGATA CTCAAGTTCT CTGCACCGCC ACCTTGGGCC CGCCCGGCGA TGGGCAACCC ATGAAAGACC CTTACCTGGC TACGGACAAT CCCAGACGCG TCAAATCGGG AGCCGAAACG ACGGCGGCGG GCTACTACAG TGAGCTCCCG GTGGGATCTT TGCGCTTTCT GCGCACCCAG GAAAGTTTGG TTTCGGACAT GAACTCCCGC AAAGTTAAGG CCGACAAGGA ACGTACCGGA GACTCGGGTA GTCTCGCGGA AGTCAAACGA GCCTTTTTAC AGTATGATTT CCCGCCCTAC GCCACCGGGA CAGTTCCCAT CGGAAGTCAG GCACACAATC GGAGAGCCAT CGGACACGGC GCGTTGGCCG AAAAGGCCTT GTTGCCTGTT TTGCCCGACG CGCACGACTT TCCTTACGCC ATTCGGATTT CAAGTGAAGT GACTGACTCG AACGGCTCGA GCAGTATGGC CTCTGTGTGC GGTGGGACCT TGGCTTTGTT GGATGCCGGG GTGCCGATTC AAATGCCCGT CGCCGGTGTG AGCGTGGGAC TGGCGCGAGA CGTGGATGCG GGCGAGGCCG GTGTGCACCG ACTGCTGTTG GACATAACCG GAACGGAGGA CTACTATGGA GGGATGGATT TCAAAATTGC TGGTACACGG AAAGGTATTA CGGCGTTTCA ACTCGACGTT AAGCAACTTT TGCCGCTAGA AATTGTCACC GAAGCTTTGC AACTAGCTTG TCGCGGAAGA AACGTTATTC TGAACGAGAT GCAGGATCAG TGCTCTGACG GTTTGAAGGC TCGGCCGTCG CCCAAGGATA GCGCACCACG AGTGGAAGTC GTCCGCTTTA ATCCACAGCG CAAACGAGAT TTGGTAGGAC CAGGAGGTAT CATTTTGCGG CAACTGGAAG ATCGCTACGG TGTGGCTTTG GACTTGACGC AAGAAGGTCG TTGTCTTTTG TTTGGGGCCG ACCAAGAATT GGTCGATCAA GCCAAACTTA CCGTCATGGA CCTGGTTGCC GACGTGGTAC CCGGAGAAGT CTACGAAGGA ACCATTATTG AAATTAAGGA TTTTGGTGCA ATTGTGGAAC TTTTACGCAA TAAAGAAGGG CTCCTGCACG TGAGTGAATT GACCAACGAA CAAGAAGCTG GCGATCACCC TGGTGGCATT GCCGGATTTG TTCGGCAGTA TCTCAAGGAG GGTCAGAAGG TGGAAGTGCT GTGTACAGAC GTCGATCCCG TACAAGGAAG CATCAGACTT AGTCGCAAAG CAATTTTGGT TAAAGAACAG AAGAAAGCAA TGCGTCGTTG A
|
Protein sequence | MESFPLRRLE VSLLRDDDDS GGSHTRRAFY SGRPESTPQI PLSEYKLPKL PATFSRLIEG PPIQFGTGKL ARLTHSSVVG QGGSTVVLAT VAHQPPDTPN GDQNNAASTF LTVDYRQRYH GVGKIPSNGG RRDNAVLSTP EVLASRAIDR ALRPLIRTED AIDGRLHVTC SVQSYEVFAS EVDLRQDGLL DEKDNDATNA SHSTHPVAMA LNTAAAALLQ QGHLTQPVAA TVLAVAAHGD GVVWQDPVPE QLADSYGELL YAGTADGHVV MLEWTSHRLA TGLSESHMTQ LLELAQTSIR PVLDLLTARF GDVPHNNNSS SIGLSATDRW LEENQLRQSL GLDPIPQDEE AINLPSVVAS NTTQARLPDI IQHVQTCIGH VLPRLFGWDA TLPADTETDR NNEKNRTVVD QSMGAAIHRG DLPSKTVRGR REQIVQTEIA RLVESYVTAA GDAFEPRELQ WLSEQTTKHL LQKGFVQSAL QGGARADGRG LPGHGWKTIR PLKVQMPALP DTVHGSALFA RGDTQVLCTA TLGPPGDGQP MKDPYLATDN PRRVKSGAET TAAGYYSELP VGSLRFLRTQ ESLVSDMNSR KVKADKERTG DSGSLAEVKR AFLQYDFPPY ATGTVPIGSQ AHNRRAIGHG ALAEKALLPV LPDAHDFPYA IRISSEVTDS NGSSSMASVC GGTLALLDAG VPIQMPVAGV SVGLARDVDA GEAGVHRLLL DITGTEDYYG GMDFKIAGTR KGITAFQLDV KQLLPLEIVT EALQLACRGR NVILNEMQDQ CSDGLKARPS PKDSAPRVEV VRFNPQRKRD LVGPGGIILR QLEDRYGVAL DLTQEGRCLL FGADQELVDQ AKLTVMDLVA DVVPGEVYEG TIIEIKDFGA IVELLRNKEG LLHVSELTNE QEAGDHPGGI AGFVRQYLKE GQKVEVLCTD VDPVQGSIRL SRKAILVKEQ KKAMRR
|
| |