Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49523 |
Symbol | |
ID | 7195855 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 503291 |
End bp | 505362 |
Gene Length | 2072 bp |
Protein Length | 584 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184265 |
Protein GI | 219128111 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.047501 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACAACAAC ACTTGCGGCT CATAATCCGC TGTACTCATA TTATTCATCT CAAAAACGCC ATGAAGAATA CGTCGGGTTG CTTCCTTGCC AACCCGTCCA GGCGTAGTCG CCTCGTTCGT AGCTACTCGT CGGCGTTACG ATGCGGCGTG CTGGTCTGGC TCGCCGTCGT TCGCGACGCG GTGCTCGCGC AGGAATGTGC CACCGACGGC ACCACGTGCG ATAAGCACTC GCGTTGTCCC GTGTGGAAAG AAGAAGGCGA GTGCATGCGG AACGCCAAGT ACATGAAAGA ACACTGCCCG GTATCCTGTC GGGACATTCA TCAGGAAGCC GTAACTATAG ACTGCATTGA TCTCCACGAA CGCTGTCCCG TCTGGGCTGG CTTGGGAGAA TGCAAGAAGA ACCCCATCGA TATGAACCGG TACTGTTCCA AATCCTGTAA GCAGTGCGAA GACGACAACG ACGGAAAAAA CAACGAAAAG GAGAAACGTG GGGACCCAGA CAACAACAGC AAGACGGCCG ACGACGACGA CGATCCCCAA TGTCAAGATG GGGACAAAAA TTGTCCCTAT TGGGCGAAAA ATGGCGAATG CCAAACCAAC AAGATTTGGA TGACCAGTAA GTTTAGTTCC GAGTACATGT ACTATTCGTA ATAGTACCCT TGTATATGAT AGTCTCACCT CCCAACGAAC ACTTGTACTC GCGCGAAATA ACTCCTTCAC CACCGACTCA CTCCTTGCCT GACGAAAAAT GTTACAGCCA ACTGTCCCAA ATCGTGTCAA ACGTGTGAAG AAATCAAGCC CAAAACACCT CGGCGCGCCT CACAGATGAA ACCGGCCGAA GTCCAAGAGA TTTTGCGAGC GTCGGCCTCC TTTGGCGAGC CACAAACCGC AGAAGGATCC CAAACCTCCG ATACAGTTGA CATTGTTCGA GCTTCGATAG ACTACATGAA TAGTGAGGAC GTCCAGCAAC TACCATCGGC GATTGTGGCG TCCTGTCGCA ATCAGCACCA TCTGTGTTCC TTCTGGGCCT TGATCGGAGA GGTACGGCAC GTTAGCAGCT CGGCGATACG GCTGCCCCGT TACCCTATGA AACTCAACCA CTCTATCCCT TTGTCTTTTT TTTATATATT TTTTATTCAC ACCCCACTGT TTTTTAGTGC GACGCCAACA AATCGTACAT GCGTACCAAC TGTGCCCCCG CCTGTCAAAC CTGCCAACTT ATCGACATCG AAAACCGCTG TCCCCGTCTC GAACACGCCG AACCCGCTCT CGTACCGGGT GATCTTAACA AGCTCTTTGA TCGCATTGTG CGGACGGCTC CCGGCAACCG CACCTTGACC GAGGCCGAAC GACAGGAACT AATTGATCAA AAAATGCATT TGTACACCGC GCACGTGCAT TCTCGTCCCA GTGCGAACCC CGTGGTTGAA GTTAGTACCG TCCTCGACAA ATCGTTGCCA CCATGGGTCA TCACTCTCGA CAACTTCTTG ACGCTCGAGG AATGCACCGA ACTCATCAAC ATTGGACACA AGCACGGCTA CAATCGCTCC AAGGATGTTG GGAAAGTCAA GGTGGATGGC ACCCACGAAG CGGTGCAAAG CACGCGACGT ACTTCCGAAA ACGCCTGGTG TTCCAATCAA AGTGGCTGTC GCGACGAAGC TCTCCCGCAG CTCTTGCACG AGCGCATGGC GACGGTCATG CGCATCCCTG CTCAGAATAG TGAAGATTTT CAGCTTTTAA AGTACGAAAA AGGGCAGTTT TACCGAACGC ACCATGACTT CATTCAGCAC CAGACGAAAC GGCAGTGTGG ACCGCGGATT CTTACTTTCT TTCTGTATTT AAGTGACGTG ACGGCGGGCG GTGGGACCAA CTTTCCTGAT CTCGACATTA CCGTTGAACC CAAAGCCGGT CGCGCATTGC TGTGGCCCAG TGTGTACGAT TCCGATCCCA TGGCCAAGGA CGGACGCATG ATGCATCAGG CGTTGGAGGT GGAAGACGGT GTCAAGTTTG CTGCCAATGG ATGGATTCAC TTGTACGACT ACGTGACGCC CCAAAGCATT GGTTGCACTT GA
|
Protein sequence | MKNTSGCFLA NPSRRSRLVR SYSSALRCGV LVWLAVVRDA VLAQECATDG TTCDKHSRCP VWKEEGECMR NAKYMKEHCP VSCRDIHQEA VTIDCIDLHE RCPVWAGLGE CKKNPIDMNR YCSKSCKQCE DDNDGKNNEK EKRGDPDNNS KTADDDDDPQ CQDGDKNCPY WAKNGECQTN KIWMTTNCPK SCQTCEEIKP KTPRRASQMK PAEVQEILRA SASFGEPQTA EGSQTSDTVD IVRASIDYMN SEDVQQLPSA IVASCRNQHH LCSFWALIGE CDANKSYMRT NCAPACQTCQ LIDIENRCPR LEHAEPALVP GDLNKLFDRI VRTAPGNRTL TEAERQELID QKMHLYTAHV HSRPSANPVV EVSTVLDKSL PPWVITLDNF LTLEECTELI NIGHKHGYNR SKDVGKVKVD GTHEAVQSTR RTSENAWCSN QSGCRDEALP QLLHERMATV MRIPAQNSED FQLLKYEKGQ FYRTHHDFIQ HQTKRQCGPR ILTFFLYLSD VTAGGGTNFP DLDITVEPKA GRALLWPSVY DSDPMAKDGR MMHQALEVED GVKFAANGWI HLYDYVTPQS IGCT
|
| |