Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47709 |
Symbol | |
ID | 7202891 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 595389 |
End bp | 599300 |
Gene Length | 3912 bp |
Protein Length | 1196 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182096 |
Protein GI | 219123571 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAACCAGCTG CTTACGAACT ACTGAATTTC ATAGCCAAAG GTGTTCGTAC CAGGATGACC CCAACAAAAA AATCCTGCGG GTCCCACAAA GTAACAACTC TTCGTCGGAC AATTCGAGAC AGAGCCAACA AAGATGGACA CCAGTAGCTT TAGTGACCCC CTCGACGACG ATTCGATTTC TGTAGACTAC TCTTCGACAG GAACTAACAC CGAAAGCGAC CGTCATGCCA AGTCTGACGA ACGCGACGAA ACCAAGGAAG TGCAAAGGTT GGCTTCTTCA GAGACGGCTC ATATTCGTTT CTGGCGACTT GTGGTTGTGG CTCTACTGCT CATATCAGGA ACAGTGCTTT CTACGTTCAC ATACATTTTC TTAAAGGGTC AAGAGAAAGA CGATTATGGA GATGCGGTAA GATCTATCGG TAGCAGCTGT TGACAGTCCA CTCAAACTTC CTGTTGGTCT CACCATGCTT TCCTATTGCT GTCAATCAAA GTACTACCTC TTTGTGAACT CTGTTCGTGA CATCACCCAA TTTCGAGTAG TAAATATGTT TGATGCTATC CAAGGATTAG GAGAAACACT CACAGCGCAT GCCGTCAATC GGAACTTGAC ATTTCCGTTT GTCACTTTAC CCATGTTCGA AGTCGCCGGA CAACACGCGC GAACTCAGTC GCGGAATGAG CTTCTTTCCT TTGCGCCTTT TGTCGGGGGA GACGAAAAAG AAGCATGGGA GCGATATGCA TTGGAGAACC AGGGATGGAT TGAACAGGGC CGTGAGATTA GGCTGGAGAG CGATCAAAAT GCTCAGGTAA CGTCGTTCGT TGAAGGGTCT ATTCCTACCA ATATAGTCGA GTTTACAGCA TCTGGAGACG TAGGGCTAGC TCCTCCAGGA CGTGATTCCT ATTCGCCCGT TTGGCAGATG TCTCCTGTCC CATTCTCAAC CGTGTCCTTG AATTTTAACC TACAAACATT TGCTCCCGCC AAGCTCGTGA TGGACGCTGT GGAAATCTTG AAAGGTACGT TGAGACTCAA ACAGCAGCTT TCGTCGATTT TTTTGTGATC GTCTCACACC TCCCCACCCG AACCCGCGCC ATAGATTCTG TTCTAAGCGT GGTAGCGAAC AATTCACAAA ACTCGAGATT GGTGTTGACC GAAAAGGACC ACACTGCTTT CCACGAACAG TTTGTTACTA GTAAAGGCGA TGGCAGCCTT CAAGTACAAC CCCACGCATC CATGTTCACT CCAGTATACG AGGAACTAGG GAACCGTGAT TCAAAAATTG TTGGGGTCAT TTCCAGCGTA ATCGCTTTTG ATGCATTCAT GGCTGACCTG CTACCCGATA AAGTGAGGGG TATATACTCA GTCATGAAAA ATACATGTGG ACAACAGTGA GTGCCGAAAG TACAATATTT GCTGCATTTA TCATTTGTCC GCAATTGTTT GTGTGCTTAC ACAGTGCGTA TGACAGATTC TCGTACGAGC TGACGGGTAA TCGCGCAATA TATCTTGGCA ATGGTGATCA GCATGAAACC GCCTTCAACA AATACGAAGT TGTGATTCCC TTCGATGCCT ATCGCGATCC CGATCTTGCC GCCATCACGG AGAAGCATTG TCGGTATTCC CTACACATGT ATCCCAGCCA GCAATTCGCT CAGGGATACG AGTCCAACCT TCCCATTGTT TTCACGTCGC TCGTAGCAGC CACCTTTTTT CTTATGGCCT TGACATTTCT GGTTTACGAC CGCTTTGTCT ACCGCCGGAA CATCAAAGTG GTAGATGCTG CTGCTCGATC TAATGCCATA GTGTCATCAT TATTTCCCTC CAACGTCCGT GAGCGTTTGT TCGAAGACGC AAAGGCAAGA AGCGATGTCA ACCAGGCTGC TCACTCACGT CTCAAAACAT TTCTGCACGG CGGCGACATT CCTGAAACCG CCATTAGTGA TGCGAACGAG ATTAACGGGA ATTTTTTCAA GAGCAAGCCT ATTGCGGAGC TCTTCCCGCA AACAACTATC ATGTTTGGTG ACATCAGTGG TTTCACGGCA TGGAGCTCCT CGCGAGAGCC CACACAGGTT TTCCAGCTCC TCGAGACTTT GTACCACTCC TTTGACGAGA TTGCCAAGAA ACGCCGCGTC TTCAAGGTAG AGACGGTCGG CGACTGCTAT GTTGCGGTTG CTGGACTACC TGACCCCCGC AAAGACCATG CAGTAGTAAT GGCGCGCTTT GCCAAAGACT GCATGCACCG AATGCACTCA ATGACGAGAA AGCTGGAAGT CTCTCTGGGC CCAGACACGA CAGATTTGTC TCTTCGCATC GGATTGCACA GTGGACCCGT GACTGCCGGC GTGTTACGAG GGGAGCGGTC CCGCTTCCAG CTCTTTGGGG ACACTATGAA TACGGCCGCA AGAATGGAAA GCAATGGCAT TCGCGGTCGC ATTCAAATCT CACAAGAGAC GGCCGATCTA ATCGTAGCTG CTGGTAAGGC GCATTGGTTC GTTGCGCGCG AGGATACAAT CGTAGCTAAA GGTAAAGGTG AGCTCAAGAC GTTTTGGCTT GCACTCGGAG ATGGGGGAAA GACAAAATCG ACGACAGATA CAACGCACAG CAGTGATGAT GTTTCCTCGC CCAATCACAA TAGTTCCATG ATTCTGGATG GGCTGATGGT GACGAGCGCC GAAAGCGATC AGCAAGTCAA CACCCTCATG TCCAACAAGA CTTCGCGTCT GATCGACTGG AATGTGGATG TCCTGTCACG GTTGATCAAG CAAATAATTG CGCAACGCAA GGCTTCCAGC TCGCCGACGA AGGACTCGAC CAAAAAGCAC TTCTGTCCCG GCGACAACAG AGGAGCCGCG ACCACGGTGC TGGACGAAGT GACAGAGATT TTGGCCCTTC CAGAGTTCGA CGCGGAGGCT GCCAGGCGTC AGCAGGATCC ACAGAATATT GAGCTGAACG CGAACATTAC CACGCAACTC CAGCAGTATG TGTCCAACGT TTCTGCCATG TACCAAAACA ACCCCTTTCA TAACTTTGAG CACGCATCGC ACGTGACAAT GTCGGTAGTG AAGCTACTGT CCCGAATTGT GGCACCAGTG GACGTGGTTG TGTCGGACGG GAAGAACCAA AAGCGCTCCA TCGCTTCCAA GCTGCATGAC CATACGTATG GCATCACATC GGATCCATTG ACCCAGTTTG CGTGCGTCTT TTCGGCACTC ATCCACGACG TCGACCACAG TGGAGTTCCC AATGTGCAGC TGGTAAAAGA GCACAGCAAA ATCGCCAAAT TCTATCAGGG GAGGAGTGTC GCCGAGCAGA ATTCTGTAGA TCTGGCCTGG GAACTACTAC TGGACGAGAG TTTTGATGAC CTGCGAGCCG CTATCTTCGC CACCGACGGG GAAAAAGCCC GGTTCCGACA GCTGGTGGTC AACTCCGTCA TGGCTACTGA CATCATGGAC CCGGATTTCA AGGCCATCCG AAACGCTCGC TGGGAGAAGG CCTTTACAAA GTGTCCCAAC CTGGAGGAAG ATGCGAAACA AGCGACGAAC CGCAAGGCGA CGATTGTGAT CGAGCACATG ATTCAGGCCT CCGATGTCGC GCACACGATG CAGCACTGGC ACATTTACCG CAAATGGAAC GAGCGTCTTT TCATGGAGAT GTACCAGGCC TTTAAAAGCG GTCGAGCTGA AAAGAGCCCA GAAACGTTTT GGTACAAAGG CGAGCTCGGA TTTTTCGATT TCTATATCAT TCCGCTGGCT ATGAAGTTGA AGGATTGCGG TGTGTTCGGA GTGTCGAGTG ACGAGTACCT GAATTACGCT ACGCGCAACC GCAAGGAATG GGAAGATCGC GGTCAGGAAG TGGTTCGTGA AATGATGGAA AAAATCGGAT TTGGTTCAAT GGATTCTGAA ACTATGAAAT GA
|
Protein sequence | MDTSSFSDPL DDDSISVDYS STGTNTESDR HAKSDERDET KEVQRLASSE TAHIRFWRLV VVALLLISGT VLSTFTYIFL KGQEKDDYGD AYYLFVNSVR DITQFRVVNM FDAIQGLGET LTAHAVNRNL TFPFVTLPMF EVAGQHARTQ SRNELLSFAP FVGGDEKEAW ERYALENQGW IEQGREIRLE SDQNAQVTSF VEGSIPTNIV EFTASGDVGL APPGRDSYSP VWQMSPVPFS TVSLNFNLQT FAPAKLVMDA VEILKAFVDF FVIVSHLPTR TRAIDSVLSV VANNSQNSRL VLTEKDHTAF HEQFVTSKGD GSLQVQPHAS MFTPVYEELG NRDSKIVGVI SSVIAFDAFM ADLLPDKVRG IYSVMKNTCG QQFSYELTGN RAIYLGNGDQ HETAFNKYEV VIPFDAYRDP DLAAITEKHC RYSLHMYPSQ QFAQGYESNL PIVFTSLVAA TFFLMALTFL VYDRFVYRRN IKVVDAAARS NAIVSSLFPS NVRERLFEDA KARSDVNQAA HSRLKTFLHG GDIPETAISD ANEINGNFFK SKPIAELFPQ TTIMFGDISG FTAWSSSREP TQVFQLLETL YHSFDEIAKK RRVFKVETVG DCYVAVAGLP DPRKDHAVVM ARFAKDCMHR MHSMTRKLEV SLGPDTTDLS LRIGLHSGPV TAGVLRGERS RFQLFGDTMN TAARMESNGI RGRIQISQET ADLIVAAGKA HWFVAREDTI VAKGKGELKT FWLALGDGGK TKSTTDTTHS SDDVSSPNHN SSMILDGLMV TSAESDQQVN TLMSNKTSRL IDWNVDVLSR LIKQIIAQRK ASSSPTKDST KKHFCPGDNR GAATTVLDEV TEILALPEFD AEAARRQQDP QNIELNANIT TQLQQYVSNV SAMYQNNPFH NFEHASHVTM SVVKLLSRIV APVDVVVSDG KNQKRSIASK LHDHTYGITS DPLTQFACVF SALIHDVDHS GVPNVQLVKE HSKIAKFYQG RSVAEQNSVD LAWELLLDES FDDLRAAIFA TDGEKARFRQ LVVNSVMATD IMDPDFKAIR NARWEKAFTK CPNLEEDAKQ ATNRKATIVI EHMIQASDVA HTMQHWHIYR KWNERLFMEM YQAFKSGRAE KSPETFWYKG ELGFFDFYII PLAMKLKDCG VFGVSSDEYL NYATRNRKEW EDRGQEVVRE MMEKIGFGSM DSETMK
|
| |