Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43023 |
Symbol | |
ID | 7196829 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1822796 |
End bp | 1824707 |
Gene Length | 1912 bp |
Protein Length | 524 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177380 |
Protein GI | 219111257 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000014504 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACCATGACA TTTCCAGTAG TGCTTATACC ATCGCCTGTT CACGAGGCTT TCGGAAGTAT TCAATGGGAT TCATTTTTAC ATTTAAAGCT TTCTAATCCA GAAAAAGCGC TTTTACAAGC CGGAGAACAG AACGTTATCG AGTAAGGCAT AAAGGGATTT GTCGTACAAT TTGAGAAGAA TATCTGGACG TCTTACACTC TGTTTTTCTG GAACACAGGT ACACGCTGTA CGACAAAGAA GATGCTGCAG CTTATGTGCA GCTCCTGCTT AAAGTTCTCG ATCAAATAAC TAGTCATCGT GGAACATCGT CGCGTCGAAC GTCACTCAAA GTTTCCAAGC TCTCCTTGGC CGAAAGTTTG CCACCCGTTG ACGCGCTGCA GTACCTCGAC TGTGACGGCA TTGGAGTGGC CACACACTAT GTCATCTCGC AACTTTATGA AGTCATCACG ACACTCAAAA GTAACGCTTT CGCTTCACGG ACCAAAAGCA ACAGTGTCGT TGCTAGCAGC AACTTTGTGT CGAAAGCCAC TTTGTCCAGC ATATTCTATC CATCAGGCAT TTTGATTGAC GACTGGCGAC CTCTTTTGCG TATTCTACTG GGTACAAAGA GCGATTCTTA TGTGCACAGT ACGTTACAAC AGATTTAGTG CTTAAAAATT GTTTTTATCC GCTTTAACTC ACCTTGACTT TTTCTCTGAA ATTATAACAC AATCAACGTC AAGGAGGATC AGGTTTCTGT CTGGCGTGTA TCCTCTTGGA AGGATGCACT CTGCAAACAA ACGGCCATCT ATTTTCGCCA ATATCATCAA TTTTGGAATC CTTTGTTTCG TGGATTGTGT CTCGGCTACA GAGTTCTAGC ACACAATCTC TTTCTATGGT TACCTCAAGC TTGACTGTCA TCATCTTGTC AAAAGAAGTT CGGCATCTCT TTGGCAAAGC GGGAGGTGTT GGATATCTGA GTCGGCGCTT ACGCGTTCAT CAAAAATCAA TTGATTCAAA GATTAGTGCT TCCGTGCAAC ATCAATACGA ACTCTCCTAC TGCCTCTGGA TCATGTCGTA CGATTGTGAC ACATCGGTAT CAATGCGAAG CCACTTTCAT AGAGACGGAT CAGTCCAAGC ACTGGTAGAT ATGGTGGCCG CTGCACCTCG CGAGAAGGTT GTGCGGTGTG CACTAGCAAC CCTGCGTAAT TTGGCAACGT GCGCTGCGGA CGAAGCGCCT TTGGAATTGG CAAAGAAAAA CATCAATGGT TCAACATATT TGATTGATAT GATAGGTTGT GGTCTACCCA AGTTGATTGA CCTGATGATG AATTGCCCAA TTGCTGACTT CGAGATTAGC GAAGGTATGC AATGTTCGGT CCCTTCCCCA TTTTCAAAAG CGGAACCGTA TCTAAACAGG TTGATGATTT GTGTCTCCTA TATGACAGAC TTGGACATTC TCCATAAACT TTTGCACGAA ACTTGTCAAG AGCTAACACG TTGGGACGTC TATAAAGTGG AGCTTGATTC CACTAACTTG ACATGGGGGA TAGTCCACAC CGAAAAGTTC TTTCGCGAAA ACGCCCGAAA AATGGAAGGA TCCGATGGAA AGTTTGAAAT GGTGAAGACC CTAATTCAAT TGACTGCATC GGATAGCGAG GATGTCGCCG CAATTGCTTG CTTTGATTTA GGGGAATTTG TTCGTCATTA CCCTAATGGA AGAGACATTG CTCGGCGCCT TGGTGCGCGG GATTTTGTTT TCCCGCTCAT TGAGCACGAA AATCCCAAAC TACAGCATCA AGCATTAACT TGTATCTCAA AATTGTTAGT CCAAAATTGG AAGGTGAGCA GTTACATTGA CAGAGATAAA CCTGAATGTG TTCCCCTAAC TCCAGCACCC CTACCTCTTG GATCTTGCAG TCATTAGGAT AA
|
Protein sequence | MTFPVVLIPS PVHEAFGSIQ WDSFLHLKLS NPEKALLQAG EQNVIEYTLY DKEDAAAYVQ LLLKVLDQIT SHRGTSSRRT SLKVSKLSLA ESLPPVDALQ YLDCDGIGVA THYVISQLYE VITTLKSNAF ASRTKSNSVV ASSNFVSKAT LSSIFYPSGI LIDDWRPLLR ILLGTKSDSY VHRGSGFCLA CILLEGCTLQ TNGHLFSPIS SILESFVSWI VSRLQSSSTQ SLSMVTSSLT VIILSKEVRH LFGKAGGVGY LSRRLRVHQK SIDSKISASV QHQYELSYCL WIMSYDCDTS VSMRSHFHRD GSVQALVDMV AAAPREKVVR CALATLRNLA TCAADEAPLE LAKKNINGST YLIDMIGCGL PKLIDLMMNC PIADFEISED LDILHKLLHE TCQELTRWDV YKVELDSTNL TWGIVHTEKF FRENARKMEG SDGKFEMVKT LIQLTASDSE DVAAIACFDL GEFVRHYPNG RDIARRLGAR DFVFPLIEHE NPKLQHQALT CISKLLVQNW KSLG
|
| |