Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45234 |
Symbol | |
ID | 7200109 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 561953 |
End bp | 564909 |
Gene Length | 2957 bp |
Protein Length | 935 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179240 |
Protein GI | 219116891 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCTC ATGAGTCTGC AATTGTAGTG ATTGACGATT CGGAGAACGA CGATGGACCT GATAGTCCCA CTTCCAACAC GTCACCAGAA GTGCTTTCAT TCGAGCAACA AGAGGCTATT CACTTGTTTT CCTCCATCAC CGACATCTCC GACACTAAAG CGGTTAAGGA TAGCCTCGAA ATGTTCGATT ACGATGTGGA GAGAACGATT AACAAATGGA TCGAACAGCA GCATTCTTAC CAAAGATCGC CCATCGCCGA CAACTCTGCA GCTTTTCGAC TAATCAACCG AAATGCTTCA CGAATTAACT CGCCGGTAAA ACGAAACTTG TTTACGGGTA TGCCGAAAAA GTTGGATTTA GATACGAAGG TGTCTGTGCA AAAAAGTGCT ATAAAGAGCA TACCTTCATC GTCTTCCTCC AAAGTTCTCT CTCTGTCATG GCACGATTGT GTGGAGCGAG TTCTCCAATC AAAGGAACCC TTTTTTATTG ATGCTGACTT TCCGCCAACG AGCAAATCGC TGGATGGTCG CCAACGCCGA GCCAGTGAAA ACAAGGGGCA ACGAACTTTG TGCGCATGCG GCGTTCCTGC AGCTGCCAAA GTTGTCCAAT CAGATGGACC GAACTACGGC CGTTTTTATC TGTCCTGTGG CAAGCAAACA CAACGGCGAG CGCCGGTATT GGTAGTTCGT AAACAAGACG ATGCACAAAA TAGTGCTGAT TGTCAAGACG AAGCCAAACT TCTGAAAGAT CCACCAGTTA CTGTCAACAA TCCATACGCA AAATCTAAGC CATCAACACC ATCGAAACAT TGTCGCACTA ACCCATCATC GCCAAAGTCT CCGCCGCAAC GGCGGTCGTG CACTTTTTTC AAATGGGATC CAGACGGTTC CATCGGAGCA TCGGGCTACG CTACAAGATA CTCGTTGTTT GTCTGGCAAC ATTTTGGATT GGAGAACAAC TGTTGCTTGT ACCGCACGTC AATCGATCCT TCGCAAGTAC GTCAAGGCGC GGTAGGAAAC TGCTGGTTCC TATCAGCACT GGCAGTCGTA GCGGAAAAGT CGTATTTGGT TCGCCAACTG TTGCCACATG ACAAATTGAA TCCCCAAGGT TGTTACGAAG TCAACCTTTG CTTGGACGGC GCTTGGACAC CAGTTCGGGT AGATTCGACT TTGCCTGTTG TACTGCAAGA TGTGAACAAA ACGACAGGAG GATCTTTGTT ACAATCACTC CGTCATGGTG TTCCGCTAAA TAGCTGTAAA GAATTGGTGG CTACCCCCGC CTTTTGTTCG GCACCAGACC TCCAATTGTG GCCAGCTTTG GTCGAAAAAG CCTACGCAAA AGCTCACGGC TCTTACGCAC AGCTTTCCGG TGGTTTTATT GCGGAAGGAT TGACTGATTT GACGGGTGCT CCAACAGAAA CTATAATCTT TTCGGATTTA ATAGATTTAG ATGAGTTGTG GGCGCGCTTG CTATCTTTTC ACCAAGCTGG TTTTCTCGTC GGTGTCGCCA CTTCTCGAGG GGGTGAAGGC CTTGTTGGTG GCCATGCGTA TAGCTTATTG GATGTAATTG AGATCAACAA CTCACTGATT GGTGAACAAA AGAAAGTGAC TGATTACTTT TCGAGCCCTT CTAAGAAGCA TCGGAAACTT ACTGGCCCAA ACTATGACAC TTGTGCACCT ATTAGACTTG TGCGGATTCG AAACCCGTAA GTTTTCTTTG GTGATATTGG ATTGACTCGA ACAACACCTT AACATTTCCT TTTGGCGTGC GTTTCAGTTG GGGAAAACGG GAGTGGAAAG GTGACTGGAG CGTTGATAGT GAACGCTGGA CTCGAGCGCT GCGGAAAAAG ATTGGATCTG ATGCGTTTGC TAGGGGAGAC GGCACATTTT TTATGTCGTT TGAAGATATG TTGCAACGAT TTCATCACAT GGACATTGCC AAAACTCGAG AGGTTCGTGT TCGCAGTGAG TCGCCCTCGT CCTGTCAATT CGTTGCTTTC TGACACCTTG CTGAAATCCA ATTCGCCTAG GGCTGGAAGC ACTCGTGCTC TGATGGTATT TTCCAAAGGA ATGGCGATCC GATCGCATCT TCTAAATACA CTTATGAGAT TATTCCGTCC TGTCGTACTT GGGCATTTGT TTCGTTGGTC CAGAAGAAGA AACGCGCAAA CAGTAACTCT AAGTATTGGT ATTGCGACCC TTCGATGCTA ATTTTAAAGC GGAGGTCGGA TACTGAGGAG TGGACCTGCG AAGCTTCAGT ACTTACGGGT ATTGGGAGAA TGAGCGATTG TGAAATCTTC CTTGACCCTG ATTTCTCGTA CATGTGTGTC TTAGTATCAT GTATCGGTTG CATGGATACA CCGGAATCCT TTGAGTTCCG GCTGTCGACT TACAGTTCGG AAGAAGTAAC CGTGCGGCCC GTTCTGAATG AAAGAATTCT CTGCTTAATG ACCCTTCGAC TTCTTCACAA ATTGCTGCTC AATCGAGGAC ATAAACTCCT GTACCCGGTC GCGCCATTTG GTGTTTTAAG CTGCATCCAT GGTAGCGGCT GCCTATACTT CGTTGCGGTA AACGGTGCTT GCGATGAATT TTTGTCTATT CGATTGACGC TTGATATTCA AGAGGGGATG ATGCTCGTTT ACGGAAAAAG TGGGGATTCA TTTGATATTC CTCCGAGACG CCAGCAAATC TTGGCAATCG TTTCAAGAAA TGGGAAACGT TGTACGTGCA CTCATTTAAG CTTTCGTTAT TTGAGTAGTA CCATTAAGTC CAGCAAAGAT GGATCTCATG TTTCAAAGTT GCAGTACTCG GGCATGAGAG GTAGCGTTGA GCTTGGTCTC GCGGCAGACT TGCTTACGAG CAGTTCCGAC TCCTCACGGG TCTGTATTAG GGGCGGCGAT TCACTCGAGA TTTACCAGTG GATTCCGCAA GTGGGCTCCT GTTTTGATCT AGTCTAG
|
Protein sequence | MASHESAIVV IDDSENDDGP DSPTSNTSPE VLSFEQQEAI HLFSSITDIS DTKAVKDSLE MFDYDVERTI NKWIEQQHSY QRSPIADNSA AFRLINRNAS RINSPVKRNL FTGMPKKLDL DTKVSVQKSA IKSIPSSSSS KVLSLSWHDC VERVLQSKEP FFIDADFPPT SKSLDGRQRR ASENKGQRTL CACGVPAAAK VVQSDGPNYG RFYLSCGKQT QRRAPVLVVR KQDDAQNSAD CQDEAKLLKD PPVTVNNPYA KSKPSTPSKH CRTNPSSPKS PPQRRSCTFF KWDPDGSIGA SGYATRYSLF VWQHFGLENN CCLYRTSIDP SQVRQGAVGN CWFLSALAVV AEKSYLVRQL LPHDKLNPQG CYEVNLCLDG AWTPVRVDST LPVVLQDVNK TTGGSLLQSL RHGVPLNSCK ELVATPAFCS APDLQLWPAL VEKAYAKAHG SYAQLSGGFI AEGLTDLTGA PTETIIFSDL IDLDELWARL LSFHQAGFLV GVATSRGGEG LVGGHAYSLL DVIEINNSLI GEQKKVTDYF SSPSKKHRKL TGPNYDTCAP IRLVRIRNPW GKREWKGDWS VDSERWTRAL RKKIGSDAFA RGDGTFFMSF EDMLQRFHHM DIAKTREGWK HSCSDGIFQR NGDPIASSKY TYEIIPSCRT WAFVSLVQKK KRANSNSKYW YCDPSMLILK RRSDTEEWTC EASVLTGIGR MSDCEIFLDP DFSYMCVLVS CIGCMDTPES FEFRLSTYSS EEVTVRPVLN ERILCLMTLR LLHKLLLNRG HKLLYPVAPF GVLSCIHGSG CLYFVAVNGA CDEFLSIRLT LDIQEGMMLV YGKSGDSFDI PPRRQQILAI VSRNGKRCTC THLSFRYLSS TIKSSKDGSH VSKLQYSGMR GSVELGLAAD LLTSSSDSSR VCIRGGDSLE IYQWIPQVGS CFDLV
|
| |