Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47769 |
Symbol | |
ID | 7202929 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 781367 |
End bp | 783193 |
Gene Length | 1827 bp |
Protein Length | 522 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182134 |
Protein GI | 219123648 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAGCTTGCCG AACCAAAGGA AAAACCTGTT GGCAGCGTTA CTCAGACAAG ATGGAATCTA GTGAAAACTT CCCACGTAAC GAAACGATAC TTACGCGGGC AACGATTGTG TTCGCCCTTT GCGCGTCGCT GAACTCCGCC AACTTGGGAT ACGATATTGG CGTGAGCACA GAGGCTGGAC GCCTGATCCA AGATGATTTA CAACTCTCCA GATTTGAACG TGAAATGTTT ACGGGAAGCA TCAACTTTTG GGCAAGTGAG TAGAAAGATA CATGATCCGG CTTTGAGTTG AGGGCAGAGA CAGCGGTTAT CTGTTTGCCG TTTATGCGGC CTCGTATTGT CACAAAAGCT TGTCCAATCG ATCAATACAA CTAACCTGCT CTTTTTAAGT GTTTGGTGCA TTTTTTGCTC ATCATTTTAC TGATACATAC GGCCGAAGGT CAACGTTCAT TCTGGCAGCT GTTGGCTTCA TCGTAGGCGT ACTCCTGCAG TCGTTTTCAA GCACTTTCGA CCTCTTGATG CTGGGTCGAT CGTTTGTCGG TCTTGGGGTA GGAACAGGCC TCGCGGTTGG TAAGTACCGA GCTTATCACT CATCGTCTTG CCAAACGTAA ACAGCTCAAA CCAGCTTACG GATTTCCCAA CAGATCCTCT CTATATTGCC GAAGTCACCC CGCCACACCA CCGCGGGGAA CTTGTGACAT GGTCCGAAAT CGCCAACAAT GTGGGGCTGG TGTTGGGGTT TTCAACTGGT TTTTTTCTAG CATGGTTACC GGATGGTCAG GAATGGCGTC TTATGATTTT GCTTGGTGCT ATTTTGCCAA CTGTCATGAT CGCATTAGTC ATTTTCGTCA TTCCGGAGTC GCCGCGTTGG TTGATCTCTC GGAATCGTGT AGATGAAGCC ACGGAAATTT TGCTACAAAC GTATCCTCCG GGCTCCGATG TGGACTTGGT CGTGGAAGAG ATCAAACAGG CTATTATTCG GGAACGAGTC GCCGAGAATT CCGTGGGTTG GATGGTGTTG CTACACCCGA CACCCGCTAT TCAACGTATG CTTTTGGTTG GAATCGGCAC TGCTGTATCT CAACAAATAG TTGGCATCGA CGCAATTCAG TACTACTTGT TGGATGTTAT CGATGAGTCC GGCATCGAAT CGCGACAAGC GCAAAGTGCA GTACTGGTTA TTCTCGGAAT AGTCAAATTA TCATTTGTCA TTCTCGGCGG GAAGCTTTTT GACACCAAGG GACGACGGCC ACTCTTGTTT ATTTCATTGA TCGGTATGGC TGTTTCTTTG GCTCTAGTAA GTCTCGCCTT CTGGATTGAC ACTGCATGGA GTCAAGGGGT CATCATTTGT GGTCTCGGTC TGTATTTGGC CTTTTTCAGT GTGGGTGTAG GCCCCGGTGC ATGGCTAATC CCGTCCGAAG TGTTCGCTAA CTGTATTCGG GGGAAAGCTA TGAGTGTCGC TGCTTTTTGG AATCGTCTCG GTGCCACGAT CATGGCCAGT ACTTTCCTGT CGATAGCCAA CGGAGTAGGC TGGGCAGGTT TCTTTCTATT GCTGAGTGGT GCGTCTTTGC TAGTCCTGTT TTTTTTGTAC ACATACCTAC CGGAAACCAA AGGCAGGTCT CTGGAAGACA TGTCGGTATA CTTTGCTGAG ATCACCAAAG ACGGTTTCAT TTTAGAAGCC GAGGCAGCGC TATACAAGGA TGACGGAGAT GAGCTTGAAT TGGCCTCGTC ATCATTACAG CACACTTTGC CCCCGTCGTC GTTGGCACAT CGACCTTTCT CTGAAGCCAA TGCTAATCTG CACAGCGAGA AAGACCAGCT TTTGTAA
|
Protein sequence | MESSENFPRN ETILTRATIV FALCASLNSA NLGYDIGVST EAGRLIQDDL QLSRFEREMF TGSINFWAMF GAFFAHHFTD TYGRRSTFIL AAVGFIVGVL LQSFSSTFDL LMLGRSFVGL GVGTGLAVDP LYIAEVTPPH HRGELVTWSE IANNVGLVLG FSTGFFLAWL PDGQEWRLMI LLGAILPTVM IALVIFVIPE SPRWLISRNR VDEATEILLQ TYPPGSDVDL VVEEIKQAII RERVAENSVG WMVLLHPTPA IQRMLLVGIG TAVSQQIVGI DAIQYYLLDV IDESGIESRQ AQSAVLVILG IVKLSFVILG GKLFDTKGRR PLLFISLIGM AVSLALVSLA FWIDTAWSQG VIICGLGLYL AFFSVGVGPG AWLIPSEVFA NCIRGKAMSV AAFWNRLGAT IMASTFLSIA NGVGWAGFFL LLSGASLLVL FFLYTYLPET KGRSLEDMSV YFAEITKDGF ILEAEAALYK DDGDELELAS SSLQHTLPPS SLAHRPFSEA NANLHSEKDQ LL
|
| |