Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42481 |
Symbol | |
ID | 7196047 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 180837 |
End bp | 183098 |
Gene Length | 2262 bp |
Protein Length | 332 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176537 |
Protein GI | 219109565 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00503465 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTTTGCAGTC TATTTATATA GTTATTCTTG TTGATATTCA ATGCCATTCT CGCGCATCAG TGTCTCTAGC TCACGCTTTA CAAAAAGAGC CTCTTCCATT CGACCCTCTC GTAGCTTGTC GTTCAGATCT ATACGCAATC CGTCCAGTCG ATCCATCAAT TCCTGCTCGG CCTTGAGGAC GGCCTCGTCC TGGCCAGCGA CTTGGACGAA ATTCAGTCGT GGAAGTTTCA ACCACGGCAG CGGATCAGTT CGGGCGAGGG TACGATCCTG ATCCGCCAAC GGTTGGAATC CCGACTTTGA ATACCCGCGT CCGTCCTTCA AAAAGGCACT CAGGCCCTGC TCGCCTCGCA GTTGCTCGTT GATCAGACCT TGTGCGGCGG CAAACAAACC CAGCGTAAGC ACAATTATCA GCGGAACGAA GGGAAGATCG CCGTCCATAA GAAACCAGTC GTCGAATGGG TTTGCGCACA GCAACATTCG TTCAGAAGAG CGTTTGTATG GCCGTAGCAA CCATGTACGC GGTGGTATGA CATGCGGTCC CACACTGGAC AATGAACGGG TTGGCAGTAA CTGTCGCCCA TCTTTCCCCG AAAGTCTTGT ATGTGGAACC GTGTACACCA GCCAACCGTA ACTTGATTGC AATAGTAAGA AAATAAAACC CAGAGCATTT AGCCTTTGCA TGGTAAACTT TCCGAACTTT CCGAAATGAT CTGCAAAGCT GCTTCGTCGT ACGTGCACGT ATTGTCCGCG CTGAGCGTTC GCGCGCAGAC AGTGCCTCCT CCCAGGCACT TCTGGCTAAC GTAAGCTGCC AATACAATGA AGATCAAAAA TTGCGTCTAG TATTGTTCTA CTGGGGACTG TCAAAATTAT TTTCGTTTCA CACACTTTCC AATTCCTGGA CGTCGTCGAG AGAGGATCAC ACCAACAAAG ATGACGGAAA TTGTAAATTC AATTTTCATT GTATCGCCGA TAGTCGGACG AGCTGCAGAC AGACCTTCAG AAAAACGAGC CCAAAGAAGA TACGCAAATC CCTTTTCGAA CGAAATATGA TTTTCACACG TCAATAGTCA AACGTTGCGT TCGCACTCAG AAGAAGGTGC TGGAACTATG GAACTGGCCA GTAAAGAGAC GAAAGGGATA TTGATCCTGC ATCGGAGGTA TGAGGGCAGT CTGCATTTGC ATATCGTGGG AAAGGCAGAA TTTGCTGACT GCGAACCCGT GGCTTGCATG TATTTTCCAA CTTACTTCTC ACTCGTACGC ATGTGTTTGC TCGTTCTACG CCTCGTAGTG GTGTTAAAAT CAAGACTTCA CAGAAGAATA AGGATCACCC GCCGGTCCTT TTACTCTTGT CCGGATTCCC AGATACCGCC GACACCTGGG ATAGATTCGC GGCTCCTTTC GAATCCAAGT ACCATGTCGT CAAAATGGCA TACCCGGGAA TGGAGGTGCC CATCACCAAG TGGTGGGGAT ACTCGTTCCC AGAAGTTCAG GATGCCCTGC TGAATGTGGT GCAAGGATAC CGCGACATGG GCTGCGAAAA TGTCTACCTG GTGGGACATG ACTGGGGAGC TATCGCGTCC ATCATGTACG CCAACAGGTA TCCTTCGACA ATCACCAAGC TCGTTCTGGA AGATGTGGGA GTTGTCTCGT TGTCTGAAGT TACATTTTCC GAAGCGATGG TTACGCTGAT CTATCAATGG TTCCTCACTT TTTTATTCCT ACTGTCAAGA TTTTTTCCGG GCACTTGGTG GTTTCATTTT TTGGTGAACC ATTTTCCGTG GGCGACACTT GGACCTGATC CCAGCCTACG GGTACAAAAA AATCTCACCA AGGTACAACC GTGGCAGTGC CATCCCTATT GGCAGCTGGT GATGTTCGTA CTTCGAGCAA GGAACTTATC CGTGCTGCGA TTTCCTAAGG ACGTGGCTAT CCTCTTTGTG TATGGGAAAC ACAAGAATTG CTGTTTCCAC GGTCAACGAT TTCTAAACAA ACTAGATCAG ATCGCAACTT GTCGTCAGGT TGGCTACGAT ACGGGCCATT GGGTCCACGA AACGGAACAC GAACGGTTTG CAGCTGATGT GCAAGCCTTT TTGGAGTCGT GAACGTCGTG GTCGGTGGTA TCCAGGGCTG TGACTGCTTC ATAGTTGAAA GTGTGTGACG ACAAGACGAA TCCCTCAACA AACTTCATGT TGTTTTTCTG TCACAGGAGC AAGAAAAGAC GTGTATCATA GAAAGAGTAC TGTAAATCAT GTATTGTTTA TT
|
Protein sequence | MICKAASSYV HVLSALSVRA QTVPPPRHFW LTQTLRSHSE EGAGTMELAS KETKGILILH RSGVKIKTSQ KNKDHPPVLL LLSGFPDTAD TWDRFAAPFE SKYHVVKMAY PGMEVPITKW WGYSFPEVQD ALLNVVQGYR DMGCENVYLV GHDWGAIASI MYANRYPSTI TKLVLEDVGV VSLSEVTFSE AMVTLIYQWF LTFLFLLSRF FPGTWWFHFL VNHFPWATLG PDPSLRVQKN LTKVQPWQCH PYWQLVMFVL RARNLSVLRF PKDVAILFVY GKHKNCCFHG QRFLNKLDQI ATCRQVGYDT GHWVHETEHE RFAADVQAFL ES
|
| |