Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37556 |
Symbol | |
ID | 7202416 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 400149 |
End bp | 402348 |
Gene Length | 2200 bp |
Protein Length | 703 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181720 |
Protein GI | 219122786 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.388143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCTT GGGAGCATTC TCCGCGACAG AAACCTTCCG CTCACTCTTT CTCCCACACC CACGGTACTT GCACTGCGAG TGCTCGCATG GCTACCGCTT CCATCCTCAC GCGATCCACC AGTCCACCCG TCGTGGCTAC TTCTACGCCC GACGAACTCG ACGACGAAGA CCGACCATGC CGTGCGTGTC GTCGCACCGG TACGTTACGG ACTGACTGGC AGCAGGGTGA TCGGGTTTGC ACCCAATGCG GCGTCGTCGA CCAGGGTCAC GTCATCGACA CGCGTCCGGA ATGGCGAGAG TTCCACGACG ACGCCGATCT CGCCAAAGGC CGTCCTTCGC AAGCGCGCTC GGGTTTGACG GTGGTGGACG AATCGCGGTA CCTTGGGGGA CTCCAACCCA CCATGCTCTC CACGAACGTC TACGGCACGG CCTCGTCCAC GTCGCAGCAA ACACGGAAAC GCCTCTTGGT GGCCGCGCAC AAAATGGACC GACTCATGGA ACGATCGCAC GCGCGCGCGT TGGAGGCGGT CCGTGTCTCG CGCGCGGCGA ACCGGAAACG TCGACGAGTG GAAGGACACC CGGTACCGGG AGCGGTGGAT CCGGACGAAG ACGACGCGGA TATCGACGCC ACGATGCGAC CAGAATACGA AGATTTCGTA CAGTTGGAAG AGCAGGAAGC TCAACGGTTG CAAATTGCGT CGTACGGCGA CAAATGGAGC CTGGAACGGG CCATCCGGTT GTACGGATCC GCCTTGGAGC AACAGAACTT GTCGACCGAC GACACAATAG ATGACCGGGG CCATCTGGAT GACGGATTAA AACGTGCGTC TCGAGATCTC TATCAAGCGT ACACATTTCT GTCCACAGCG GTGCAAACGC TGGAGCTCAC GGATCGGGTG CAGCATGAAG TGGTCGGACT GCTGGTTCGG TACGCAAAGT GTCGCGACGG CCTCCAAGTT CGAGGTGTGT CCTCCACGCT ACAAAAGCGC CCCTCCACCA AAGCTACCTC TCCCAACGAG ACGCAGCGGG CGCGTCGGAG TTTACGCGAA TACAATCAGG CCAAGCAAAC CGGTGCGCTT CTGGCCGCGC TCCTCTTTTA TACCGCCCGC AACCTCGGCT GGCCCCGCAC CCTCGTCCAA GTCTGTCACG CGATTCCTTT TCCATCCCAG TCCTTGCCTC ATTTGGATCT CAGATGCGAG GACGGGGAGT TTATTAAGCG AAAACACTGT TCCAAAGCCA TGACGGAAGT AAAACAGGTT TTCCCAGATA TTTGCCGGGT GACGGCCACG TTGCATGCGG TATCGAACGT TTCGTCTGCC AGTAGCAGCA GTACCAATAG CAACAGTAAC AACTACAACA AGAGGAGCGA AATTCCGCAA CCACAGAGAC TTCAAGACCA TGTTTCGGTA ATCAATTTTG TCGATCACGC CATTCGCAAA TTACGTCTAC CACCTGTTGC CGAAGCATGC GTGCGGATCT TGGTATTGCG GTATTGCCAC GGAACAAAAG ACTCTGCTTT GCGATTAGGT GCCATAACGG CTTCCTCAGT GTATTTTGTT ACACAGACGG GAGACATTAT GCAGCGATTG GCCAAGCAGG CTGTGAGCGG TAGCAAACCG TCCTTGGCAT CGAAGCACGA CAAAGTGACG AGAACGAATT CCTCGCCTAC AGGTTTCCAC AGGACAAAGT TGGAACTCGA CGGTTTCACT GCCGCACGAG ATCCGTCGGC TTCCGCAAAC GTCAAGCATG AAGATTTGTT CAGCGCGGAA GCAGTGCAGG AATTTGCCTC GGAGCAAAAG GTGTACGAAA TGCGACGGGT GTGGGACGCT TGGTCGGAAC AAACAACCTG GATGCGCAGC TTGGGTGAGA TTGAACGAGC AATGGGAGTT TCGAGACCAA CGCTTGTGGA AGTCTTCAAG AAAGAGATTT TTCCGAAGCG AGTTGAGCTC TTGCAGGCTC TTCAAGATTC TGTCGAGACG AGTGATACAG AGCAGAAGAC TGTTTTGTCC GAGACACCAT TGGCCTCGGT GTTAGTTCCA CACATTGCTG CTGCGGCGCC CTTGCTCAAA GCTTCTAAAT TGTAAAAGTA TGTAATGTAA ACTTTTGTTA CAATCGAGCG GGATCAGTTA CTACATTGAA AAGCAAGAGA AAACAGCGCA GAGGTGGATG GATGTCTCTG TTACTGTCAG CGCTTCTTAA
|
Protein sequence | MSPWEHSPRQ KPSAHSFSHT HGTCTASARM ATASILTRST SPPVVATSTP DELDDEDRPC RACRRTGTLR TDWQQGDRVC TQCGVVDQGH VIDTRPEWRE FHDDADLAKG RPSQARSGLT VVDESRYLGG LQPTMLSTNV YGTASSTSQQ TRKRLLVAAH KMDRLMERSH ARALEAVRVS RAANRKRRRV EGHPVPGAVD PDEDDADIDA TMRPEYEDFV QLEEQEAQRL QIASYGDKWS LERAIRLYGS ALEQQNLSTD DTIDDRGHLD DGLKRASRDL YQAYTFLSTA VQTLELTDRV QHEVVGLLVR YAKCRDGLQV RGVSSTLQKR PSTKATSPNE TQRARRSLRE YNQAKQTGAL LAALLFYTAR NLGWPRTLVQ VCHAIPFPSQ SLPHLDLRCE DGEFIKRKHC SKAMTEVKQV FPDICRVTAT LHAVSNVSSA SSSSTNSNSN NYNKRSEIPQ PQRLQDHVSV INFVDHAIRK LRLPPVAEAC VRILVLRYCH GTKDSALRLG AITASSVYFV TQTGDIMQRL AKQAVSGSKP SLASKHDKVT RTNSSPTGFH RTKLELDGFT AARDPSASAN VKHEDLFSAE AVQEFASEQK VYEMRRVWDA WSEQTTWMRS LGEIERAMGV SRPTLVEVFK KEIFPKRVEL LQALQDSVET SDTEQKTVLS ETPLASVGIS YYIEKQEKTA QRWMDVSVTV SAS
|
| |