Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40760 |
Symbol | |
ID | 7198538 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 323974 |
End bp | 325933 |
Gene Length | 1960 bp |
Protein Length | 602 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184779 |
Protein GI | 219129192 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00139009 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTGTCA AACTGAACCC AAATGAAGGG CAGATGGGAG TCCTTTCGAA GAGATCCAAC GGCATCAGGA GGACAGTTGT ACGACCGGCG GCGGCCACTT GGAGTCGTTC AGAGTTCATT GTCCTCCTCT GCGTGTGGTG GTGGGGTCCT GGAATTTCAT GGGTGAAAGC ACAGAAAAAC CGGGGATGCG ATAAGATCAT TGGTAGCTTT ACTGCCCAAG AAATGAAAGA TGCGGGTCGG CAATTCCTTG TACCCATTGT AGACAGCACG CTCGTCAACA ACACCGGTAA CGTCCTCTTT TTTACTGCTC CATATGACGT TCAGGTAAGG AAGAAAACGC AATTTTTTGG TACTTTTGTC CAGGCGTACG TACTGAGAAT ACGGGGTAGA TCTAACTGAC CGCATCCTCT GCAACTTTTG AAAAACATGT CGTCAGGTCC TTAAAACCTG CAGCTCTTGC TTCGATACGG ACCGCTGGCT TATTTCCATC GATCGAAGCG TGCCAGGTCA GTATCTGCAA TACTGTAGTT CGACCGGGTA TGGCTACGAG CAAACCCAGT CCTCTCTAGT TTTTTTGCCA ATAGATCCAT TGACGGGCCA AGTTCTACAG CGTGTCAAGT TACGGACATT TGTGTCCTAC GGTGGTATGG GTCACGATTT GGGGACAACG CCAACGGTCG AATGGCCAAA TTCCGTACAA GACTTGATCA ATGCGGTCCA GAATGAGACA GTATCGACCG AATCCTACCA ACAATTCATG TCCAAATACT TGCCCGGAAT GCTGGCAGCC AGTAGTGGTT CCATTGGGTT GTTTCCGGAC TACTTTGGTT ACGGTGAAAG CAACAACATG AACCGTGCCT TGTACTGGCC CCGGATGCAC GAGCAGACTA CCGTGGTGAG CTACTTGAAG TTACAGCGCT ACATAGCGAG TCTGACGGGA GGATGCACCT TACTTGATGA TGCTGTGACC ATTCGCGGGG TTGAAGAGGG CGCCGCTGTT GCCATAGTTG CTTCGGATAG CTTGCGACGT TTTGGCTCTG AATCCTTAAC GGTCTTTGCC GGGGCTGGAC CATTGGACAC AGAAACATTA CTGTATGATA CAATACAAGC CTACGAGGAC GACAATGCGA CTCCCCAACT TGATCTTATC CTTGCATTCA TGGCGTTTTC GTTTTCGGCG GAGATCGCTG GAATGAACAA TACCGGTTTT GCCCAGCCTC TCGCAGCAGA CGAGTATAGC GCAGCGGTGG CGAGTACCGA TCCTGACCAG GTTTTGCTAA ATTGGTTTGC TGCACCTGAT CCGCCCAGCT CAAGTGTCGT GGCATCGCGT GTCCCTATCA ACAGTATCGA CATTGTGCGC CAGTCGGTAC GAGAACTCTT TGTGCAAGCT CGGGCCTCGG GAGAATCGTC CCCCTGTCGA ACCTTGGCTT CTGGGGCTGA AAATGGTAAA CTATGTGAAG CCATTGTCGA GGCCTCATCC TGGCGGCTTT TGGAAGGAGA AACTCGCGAT GTCATCTATC CCTATCAATT GTGTTTCGGT CGAAACGACG AAATCTTTAC GAGCAATCAT TACCCCTCCA GAATATTTGC AAACAGTTTG GTCAATTTTT ATCGTGGACC AATCGGATTT CCCGACTTGG CCCCAGCGGG CGACCACGAT CGCGTTCGTC AACTTTGTTC GCTGGATCCG ATTTTATTTT TTAATTTGGA AGGGCACGCA CCGCAGGACG AAGAAAATCG GCCGAACTAC AGGACGCCGC TTTCGGCCGA AGAAATGCTA GTGTGCGAGT CCGGTCCGAG CGGATCCACA ATGGCACCAA TGCTTTCGCA GCCAACAGAT TCTCCCTATC CCACGTCCGA GCCAATCGCA GCGTCGTCTA TCGAGCCAAC GCGGATCCTT ACGTCAAGAA TAGGAGTGTT GGTACTTGTT GCTTATTTGT GTGGGGGTGA ACTGATCTGA
|
Protein sequence | MCVKLNPNEG QMGVLSKRSN GIRRTVVRPA AATWSRSEFI VLLCVWWWGP GISWVKAQKN RGCDKIIGSF TAQEMKDAGR QFLVPIVDST LVNNTGNVLF FTAPYDVQLL LRYGPLAYFH RSKRASSTGY GYEQTQSSLV FLPIDPLTGQ VLQRVKLRTF VSYGGMGHDL GTTPTVEWPN SVQDLINAVQ NETVSTESYQ QFMSKYLPGM LAASSGSIGL FPDYFGYGES NNMNRALYWP RMHEQTTVVS YLKLQRYIAS LTGGCTLLDD AVTIRGVEEG AAVAIVASDS LRRFGSESLT VFAGAGPLDT ETLLYDTIQA YEDDNATPQL DLILAFMAFS FSAEIAGMNN TGFAQPLAAD EYSAAVASTD PDQVLLNWFA APDPPSSSVV ASRVPINSID IVRQSVRELF VQARASGESS PCRTLASGAE NGKLCEAIVE ASSWRLLEGE TRDVIYPYQL CFGRNDEIFT SNHYPSRIFA NSLVNFYRGP IGFPDLAPAG DHDRVRQLCS LDPILFFNLE GHAPQDEENR PNYRTPLSAE EMLVCESGPS GSTMAPMLSQ PTDSPYPTSE PIAASSIEPT RILTSRIGVL VLVAYLCGGE LI
|
| |