Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49245 |
Symbol | |
ID | 7195542 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 357103 |
End bp | 360733 |
Gene Length | 3631 bp |
Protein Length | 717 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183860 |
Protein GI | 219127267 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGA CGACGGATAC CGAGACCGTG GACGACAGCA CTACCCACAA TAACATTCTC GAGCGAGCCT TTCAAACAGC CCGGTATAAC GAAATCGTCA CCGTCACCCT CGAAGATGGC GTGACGGAAG TCTCGCTTGT GCCGTGCTTG GACGAAACCA ATGGTAACGT CCAGTGGAAT CCTTTGCCGC TCCCCGACAC AAACTCATCC TCCGTCGATG ACGATGGCAG TGATAACAAC GAGTCCTTCC GAATAGGCGT GCGGCGACAC CTGCAAACCA AACGGTGGAT ATTCCCCATG CTTAACGATA CGGTACGGAA CGATTTGTAC CAAAAGGCCA TTGACCGGGC CGTAACGTAT CTGTCCGGAT CGCAATCGGA GGACACGATG TGGCACGTTT GGGACGTTGG CACCGGCACG GGGCTGTTGG GGATGATGGC AGCCACGGCA ATCAGGAAAA ATGATCGACC CGATACCGAC GTCCAACGGA GCCGCGATGG GGGCGTCAAA GTGGTCCGTG CGTTCGAAAT GTCCGCCCCC ATGGCCATGG TGGCGCGCCA AACGGTTCGG GACAATCATT TAGCCGATCG CGTACACGTC CACAACGCCC ATTCGGCTCA AATAGCACCA CTCCACGCAA CCCGTGATGC TACTGATCGG ATGCCCGGCG GCGACGATAT CGACGACGTT TCTACGCGCA CTCCGTCCGT ACTATTGTGT GTTTCGGAAT TGCTCGAAGA CGGCTTGCTG GGGGAAGGTT GGCTGCCCGC AATACGGGAC GTCTGGAATC GACATTGGTC ACCACAGTCC CACCACCACT GCCACCATAT GAAGAAAGCA ATTATTATTC CACAACAGGC CCGTGTATAC GCGCAAGCGG TGACGGCGGA CAACGATTGG ATCTCCATGT ACTACCCACC GACCCGTCAA CACGGTAACG CCACGTCCAT GTCATTGACT TTGGATGCAC AGGGCACATC CTTGGTGGAC ATGCCCGTCG TGAGGATTCC CCTACACGCC CGCACCCTGC TGCATACGCC GGTAGACCCC AACGACTGTA CTCGTCGACC ACCCGCTCTT CGAGTGCTCT CGGATCCGTT CAAGGCTCTG GATATTTCGG TACAACGGGA CATAATTCCC GGACCGGAAG GGCAGGCCTG CACTCTTTTG GTACCCGTGA CGCACTCCGG AACCGTGCAC GGGTTTTTGG TGTGGTGGGA ACTCGATTTG TGGACCGCGC ACGACAACGA CGACGATACG TTGACGTATT CTACAAGTCC CCACACCGGG ATGGCTTGGC AGGATCATTG GCATGTTGTT TTGCACGTCT TACGGGATAC TCAAAAAGTG CAAAAGGGCG AGACTATGAC AGTACAGGCT TCGCACGACG ATACGGCAAT TACACTGTTG CCTATAATTT CGGCACCGGC CCCACCACCG TCCAAGCGTA TCCGCACGGA ACCAAACTCC GGCCATCACG CCCTCATCAC ACCTTCACGA GCCTTGCAAC TAAACGATAC TGCCCGGGCC TCCTTTCTCG ACCAGGCCAT CACCCATGCA CTAAGTGTAA AAGGACCGGA CCAACTGGTG TTGGATGTTT CGGACTTTAG TTGGTGCGCC ATTGTGGCGG CTCGTCAGGG CGCCACGCAG GTAGTATCGT TGGAAGCTAG TAGTAGCAAT ACCAGTCTAC CCCACACGAC CGCTCGTGTC GCTCAGCTCG GCAACCAATT GCCGCGTGCC CCCCATGGCC GATTCGAAAT ACTACAGGCC CACGCTGAGC AACTGACGAT CTCGGCCTTG GGTGATGTTC CCGCAGATAT AGTTGTGGCT GAACCCTACT ACGAGCTGTT AGAGAACTGG CATCTGGAGG AAGCTATCAA TTACTACAAC TTAGTTCGAG CCTTGCGGCG CACCAAGCTC ATCACACCCG ACGCGTGTGT CATTCCGTCC ATCTGTCGCG TCATGGGCTG CGCGATCCAG AGTGATCAAC TGCGATCAGC CTACCGAGCC TGTGGCGACG AAAAAGGCAA AATTCACGGT CTCGACCACC AATACGTTAA CGCCATCGGC GCCGATTTCC ACCAGTACAA CTTGAACCTT CCCATGTGGC AATACGAGTA CCAAATGTTG TCCGCACCGA GCGTCCTTGC CACTCTGGAC TACAGGAGTC CCGGTAACGG TCCAGTTCGT GGTGAAGCGC AGATGCCATT CACCGCCAAG GGGCGTTGCG ACGCCTTGCT GATCTGGGTC GAATACTCGG CAGTGGCCGA CTGTACACTT TGCTACACCA CCAACAACCA CTTCCACCAT CAAGCGGTGC GTATGCTGCC CACGTCATGG ATCGTAGACC CGTCGGAAAT GTCCAAAACC GCCTTAATTT GTCAGAGTCA AATCGGTGGT CTCGATCCGT TCAATTGTCA TTCATTTGAG GTACAGATTG CATGAGACAA CGAAACATGC AACCTACATA TTCAACTTCG ACACTCATCG TTCGCTACCT CCACCGGGCA ACTTTCCTTC AACTCGTTCA AGGATATTTC CACGTCGACG TATTCCAAAT TCAGGGCACG GTGCAGAATC GATTTGTACA CCTGAATTTC TGGACACAAT TCACGGCAAA GAAGAGTCGT CGCCGTTTCC GATAACATAC GATCGGCATC GGATTTTTTT CCAGCAGACG AATTGCGCGT GGGAAAGGCA TCCGTGACCG TTTCGTTACC ACCAATTGCG CGTTCGGTTG AATTCCAATC GGATTCCAAA TGCTCGGTCC GGATGATTAA CAGTTTGGCG TCTGACGGCA CTTGATTCAT AAAGTATCCA AAGTTGTAGT AGTTGTGTCG GACCATGGGT CGTATTCCAC GAATGGCCGC GTGCGCCCGA TCTTTACAAA CTTTCGATGC GATGCCGTCG TCCGCCAAGC CCTGTTCGGC CAAATCATTG AGGGTGGGAA AGGGACAATC TAGAAAGAGC GCTTTGCGTT CTTCGTACAT GTAACTGTTC GGATCCAGAT GGATACCCGG ACGTTCGTAG GTAAACCAGG ATTGCATCCT CGCCAGAGGA TTGCGGACGA CCATGAGGTA GTACGCCATG TCGTCGTAAC AGTCGTTGAT GTAATTGTGC AGGACGTTTG TAACGGAGCG CGGCAATCGT CCCCCCGGAT TCGCGATGCT CTCATTCCCT GTTGGTTGCT GGCAGTCGTA CCGGAAACCG AGTAGACAGC TCAGGGTACT TCCTGCGGTC TTGCCCACGT GGACGAAACA TACGCGTTCG TCGGGTGGAC CTCGCGCTTG GAAAGCGGCA ATTTTCGTGG CCCAAACGGG GGGCGGCAAA CCCGATCGAA TCTTGTAGGC CGATTCCGTG GCAGGATCAG CAGCGTCGCG GTACTTGACC GTAACCTTTT TTTCGACTAT CGGCCATACG TCTTGCCAAT CTCTCGTGGG AGTTGGTCTT TCGATCGTGC GAGCTCTCGT TCCAAGAGTC GCACCAGTCG TCAACGAGGA CGATACCAGA CCCAAGACTT GGACTTCCAG CTCCTCCAGA CTCCACCAAA GCCTGCAGGA CGACAGCAGA CACATTAGTA TTCCCATGAT GCACAGGGTC CCAGTGCGGT TTAGAGGGAC CATGGTGTAT T
|
Protein sequence | MATTTDTETV DDSTTHNNIL ERAFQTARYN EIVTVTLEDG VTEVSLVPCL DETNGNVQWN PLPLPDTNSS SVDDDGSDNN ESFRIGVRRH LQTKRWIFPM LNDTVRNDLY QKAIDRAVTY LSGSQSEDTM WHVWDVGTGT GLLGMMAATA IRKNDRPDTD VQRSRDGGVK VVRAFEMSAP MAMVARQTVR DNHLADRVHV HNAHSAQIAP LHATRDATDR MPGGDDIDDV STRTPSVLLC VSELLEDGLL GEGWLPAIRD VWNRHWSPQS HHHCHHMKKA IIIPQQARVY AQAVTADNDW ISMYYPPTRQ HGNATSMSLT LDAQGTSLVD MPVVRIPLHA RTLLHTPVDP NDCTRRPPAL RVLSDPFKAL DISVQRDIIP GPEGQACTLL VPVTHSGTVH GFLVWWELDL WTAHDNDDDT LTYSTSPHTG MAWQDHWHVV LHVLRDTQKV QKGETMTVQA SHDDTAITLL PIISAPAPPP SKRIRTEPNS GHHALITPSR ALQLNDTARA SFLDQAITHA LSVKGPDQLV LDVSDFSWCA IVAARQGATQ VVSLEASSSN TSLPHTTARV AQLGNQLPRA PHGRFEILQA HAEQLTISAL GDVPADIVVA EPYYELLENW HLEEAINYYN LVRALRRTKL ITPDACVIPS ICRVMGCAIQ SDQLRSAYRA CGDEKGKIHG LDHQYVNAIG ADFHQYNLNL PMWQYEYQML SAPSESR
|
| |