Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43024 |
Symbol | |
ID | 7196830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1825350 |
End bp | 1827354 |
Gene Length | 2005 bp |
Protein Length | 576 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176853 |
Protein GI | 219110203 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000321984 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACGAAGCAA CTTGATTCTT CACATACGAT GGTATCTGAC CGATCTGGAT GCACATGATC TGTAAAGAAT GGAACATTCA TAAAAAGGGA GGCCTTTGGT AGACTTGATT GCTTTCGAGT TTTATTCACC GTCGAAAGAA ATCTTGCTAT CCTGACCATC ATCCCCTCTG GATATGCAGA CTGCATAGAG TTCCGGAATT ATCTTACGCA ACAACCTTCC TCCGGAGCCA AGCTAGTGTC TTTGCTGCTG TTCACTCGAA TCATTTTTGA CATCATGGGA AACGAGGTGT CTTTGCAATG GTTTCGGGGA GATGCGATCA AACTGGAAAC GGAGCTCCGA AAGCCTACTT CAGATACAGA AGACTCATAC CGTTCTTTTG ACGAAAATGA TGACGACGAA GAAGACTCCG ATTATCCGCG ACAAATATCT CTCGATCCTT ACCGATTGAG TCCAATCAAG GACCACGCTC AAACATGTTG GGCGGGATGG ATTTTACCAC CATTGGCACC TCCCTTGCTG CAAAGATGTA AGTCCTTGCC AAAGATGGTT CCCGTTCTCT ATTCGGATGC GAGCCTGGAA AGTATCGAAC CGGTAGACAT GTCGTGGTCG ACCAGTTCTT CAATGTTTCA CAAAACGAGT CAGGCTCAAA ATTTTGCAAA AGAGCATTGC GGTTCTTCCG ACGAAACTTC CACTTCCTCG GGAAGTGAAG CAGAAGAGGG AGCGGAGCCT CTACATCTGC AACTCCATGA CAGCCGAATT TCAGTCTCTA CTGCGTACCC GACACGGTCA CCCGCTTTGG ACCAAGGCTC CTTTTACGAA CGACATGGTT CTCTGGCCAA AGCTCTACAA TCGTACCAGA GTAGTTTGCA AGAGTTGTCA GATTTGAGCC TTCTTGCTGG AACTCATTAC CGTGTGGGGG TAGTGCAATG GAAAAAAGGT GCATATGATG ACAGTTTGCA CAGTTTCAAA CAATCTCTCG CAACCTACCA TCTAAAGGCT TGCGCCGGAT CAGATGAAGA TATTGCACAG GTGTACTTGT CCATTGGACG GGTCTACGCT TCCAAAGGAA AACATCGAAA AGCGAAGAAG TCATTCAAGC TTGCCTTACA ATTGATAGTG CTGGATGAAG TGCTTGATCA TAACTGTGAA GTAAGCAACC GTGTCCAAGT TCTTTACGCA AAGATTTTGT TGGCTTACGG TTCCATCTTG GTTGAAGATT TTGACTACGT GACGGCAACC AACCTCTTTC ACGACGCACT CGCGATTCAG AAGGAAGTGC TGGGTGATAT GCACGTTGAT GTAGCTTCAA CGTTGCTATC GTTTGGTTCC CTCAATGAGA AATTGGAAAA GCTCGAAGAC GCCAACACAT GTTACTGGGA AGCTTTTCAG ATTTACCGAT CCGCGGATTC TTCGGCAGTT GATATGGGTG TCACACTGAC GAGTATCGGT TGGATCTACT ATCAGCAGCA CAACCTGGAA AGTGCGATGC ACGCCTACCA GGATGCTTTG GATTTACTTC TGCCCAAACT TGGCGACGAT CATCGAAACG TAGCTTCGGT ACGGATTCAA ATAGGGATGG TGCATATCCA GAAAAATGAA CTGGATCTGG CCCTGGAGCA GTATAAGGAA GCTCTCCGAG CTCAACGTAT CGCGCTGGGT GACGAGCACA AAGACGTTGC GATTACCTTG AGCTTGATTG GCGCCACATT GGAGAGCCAA GGTCGCTTCA ACAAGGCCAT CGAGTTTGTG GACCGCGCGT TGTGTATTCG TAAAAAAGAT TTTGGTCCTT CACACTTGCA CGTGGGCACA ACGTTGGCCC AACTCGGTGA ACTGTACAAA ATTGCCGATA GACCCGAGGC GGCCGCCCAG TGTCTCAGAG ACGCAGTGAG TGTGTTTCGT GCCAACCAAG TGGATGGCAA GAATCCTTGC CTTGTCCAGT CCAAACGTGC GTTGCGAAAC TTACGCCGCA GCCGCCTTCC TGCTGCTCCC AACGAGACTT TGTAG
|
Protein sequence | MGNEVSLQWF RGDAIKLETE LRKPTSDTED SYRSFDENDD DEEDSDYPRQ ISLDPYRLSP IKDHAQTCWA GWILPPLAPP LLQRCKSLPK MVPVLYSDAS LESIEPVDMS WSTSSSMFHK TSQAQNFAKE HCGSSDETST SSGSEAEEGA EPLHLQLHDS RISVSTAYPT RSPALDQGSF YERHGSLAKA LQSYQSSLQE LSDLSLLAGT HYRVGVVQWK KGAYDDSLHS FKQSLATYHL KACAGSDEDI AQVYLSIGRV YASKGKHRKA KKSFKLALQL IVLDEVLDHN CEVSNRVQVL YAKILLAYGS ILVEDFDYVT ATNLFHDALA IQKEVLGDMH VDVASTLLSF GSLNEKLEKL EDANTCYWEA FQIYRSADSS AVDMGVTLTS IGWIYYQQHN LESAMHAYQD ALDLLLPKLG DDHRNVASVR IQIGMVHIQK NELDLALEQY KEALRAQRIA LGDEHKDVAI TLSLIGATLE SQGRFNKAIE FVDRALCIRK KDFGPSHLHV GTTLAQLGEL YKIADRPEAA AQCLRDAVSV FRANQVDGKN PCLVQSKRAL RNLRRSRLPA APNETL
|
| |