Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31584 |
Symbol | |
ID | 7195951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 433311 |
End bp | 435167 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176585 |
Protein GI | 219109662 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000066135 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACAC TGATACTTTT GCCTGCCTCC GAGTTTGGTT TTGCGAAAGA TTACGCAGAA GCACCGATTC AGGATGCATC TGCAACGAAC AATCGTGGAG ATTCGCCGCT TGAACAGCTG GAGGAATCGG AGGGAGACCA AGTCTCGGTG CCGTCTTTGT GCCAAGAAGA CGAGGTGGAG CTACAGCCAG CCAGGAACGA AGTTTATGCT GCTGCGGTCG ATGATGGTGA CGATGGCGAT AGCGAGCACT GCGTGATCAG AGATGAGGAC CACCTCGACG TTGCCATGAT GGAGCCGACT ACGCAGTTGC TGGAAGAAGC AAGTTCACCA TTAGCCATGG ACGAGGAGGC TGACGTACCA ATCAGCATCA CGTACTCCAC AGACCCTGGT AGCGACACTG ACTTTGGCGG TTTGTTGAAC CTTGGCAATA CGTGTTACAT GGCTTCGGCA CTACAAATGA TTGCCAGCCT AGAGTCATTC GTGGACGAAT TGAAGACCAA AGTAGAAGAA ATACCGACAG ATTCTCAGTT GCAACGCTGT TTAGTGGACC TATTCGACCA ACTCGCGAGA GGCAAATCAG TTCGGCCTGT CGTATTGAAG GACACGGTCG ATACGCGCTC AAGCTTGTTT GTTGGATATG ATCAACAGGA CGCGCACGAG TTTCTGACGA CCCTTTTGGG CATGCTTGAT GACGAATACA ATATCAAGAC GAAGACGACT CGAAAAGTCA ATGAAAATCT TGTATCATAT GTCAATACAC CAGATGACGC CATGGACGAA GATAATGATG GCGAAATGGA TGAAACAGAG CCAGTGGACA TGAATCTCTC GCTAACCGTA AATGTCCCAA AAGGATCTGC CTTGGTTCCA CATTCCGTGG ATACAATAGA TCCTCCTTCT CTGTTGTACA GCCGCCACGC CTTTTCCGAG TTGGATGTGG ATGAGATTCG CCATTTGTTG CACGGGACTC CTACCAGTCG AGAGGGCATG TTGTTGCCCG CATATACTGC TGCAACTGAA CCACGTTGCA AGCTCGTAGG CGGTCGAATG CATACAACTG ACATCCCTTG GACATCGTAC GAGTCTCATT CCTTTGTTGG TAATCAAATG AAGGACAACG AATGTCTAAA TTCCCATCAT TCACCGCTTC ATGCTTCTAC CGCCGCAACG TCGACAGCTG ACGATGCCGA TCATCTCACC GCGGACGACG ACAAAGTGGT TTCTCCTGTG AATGATTTCT TTACAACTGT GGCTCGTGCG CGGCTTACCT GCGATTCCTG CATGTATACG CGCACTCATC TCGAAACCTT TTTACACTTG TCTCTCGAAA TCGGGAATGA CGTCACCGTT GAGGATAGCT TGCGTCGATT CTTTGCGCCC GAGCCACGCG AACTCAAGTG CGAAAAGTGT TTTTGCGAAA GGGCCACTCA GACTACCGAG ATCGTCAAGC TGCCACGAGC CTTGCTTTTG CATTTTAAAC GCTTTATCGT GGACGTGAGT GATGATTGGG CTTCCGTTTC GTATCGCAAG AATCAGTCAG CCGTAGTCTT CGAAGACACC CTGTCGTTGG ACAAAGACAT GGGTGTGCTT TCGGAATTCT TAGCGACCGA TTATTCATTA CCAACCACAA GCGGAACCTC TTGTAGTTTG GGGAACAGGT ATGGGATTCG CAGCGTGGTA AACCACATTG GGGCGTCGGC GAGTTGTGGG CATTATACGG CGGATGCGTA TCGAAGGAAA GATGAGACGC GGAAGTGGAT GCGCTTCAAT GACGCCTTTG TTTCAAGTAT ATCGGAGAAG CAGGCTTTGT TAGATTCACA AAAGACGGCC TACATGGTAT TGTACGAGTT GGAATAG
|
Protein sequence | MATLILLPAS EFGFAKDYAE APIQDASATN NRGDSPLEQL EESEGDQVSV PSLCQEDEVE LQPARNEVYA AAVDDGDDGD SEHCVIRDED HLDVAMMEPT TQLLEEASSP LAMDEEADVP ISITYSTDPG SDTDFGGLLN LGNTCYMASA LQMIASLESF VDELKTKVEE IPTDSQLQRC LVDLFDQLAR GKSVRPVVLK DTVDTRSSLF VGYDQQDAHE FLTTLLGMLD DEYNIKTKTT RKVNENLVSY VNTPDDAMDE DNDGEMDETE PVDMNLSLTV NVPKGSALVP HSVDTIDPPS LLYSRHAFSE LDVDEIRHLL HGTPTSREGM LLPAYTAATE PRCKLVGGRM HTTDIPWTSY ESHSFVGNQM KDNECLNSHH SPLHASTAAT STADDADHLT ADDDKVVSPV NDFFTTVARA RLTCDSCMYT RTHLETFLHL SLEIGNDVTV EDSLRRFFAP EPRELKCEKC FCERATQTTE IVKLPRALLL HFKRFIVDVS DDWASVSYRK NQSAVVFEDT LSLDKDMGVL SEFLATDYSL PTTSGTSCSL GNRYGIRSVV NHIGASASCG HYTADAYRRK DETRKWMRFN DAFVSSISEK QALLDSQKTA YMVLYELE
|
| |