Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43695 |
Symbol | |
ID | 7197238 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1209503 |
End bp | 1212840 |
Gene Length | 3338 bp |
Protein Length | 1060 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177781 |
Protein GI | 219112059 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.044454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTATCAA ACCGACGGAA GAGAAAGATT CTCGGATGCT TTCTCCTCTT TGCGACTCCC ATTGACGGCT TGTTGGGAGT GACTTCAGCT CCGTCCTTGA CGCGGAAAAA GAGGCGTCAC TCAAAGTGGA GCTTCCTAAC TTCGCCGTCC TCAACAACGA GAGCAACTTT ACGATCGGGG AAAAGTAGCT GGCTAGAAGC TGCTCCTCAT CGGCAGGTTA ATACGAATGC AAGTTCACCT CCAACTGAGT CATTATCATC ATCATCACCC GCGACTTCAT TTATCAACAC GAACTCTCCA ACTTCATCGG CAAGTAGCGA TGCCCGTCAT ACGGATTCGC CATTATCCCA AATCTATGAG GACGACGAAC ATCCACGGGC ACCTCGACCG CCGCCGTCAT CTTTTTCGCG TAACGAAGAC TGGCTAGAAT CCGTTACGGG CGAATTACTC GATCTGGATG TGTACCCCCT GGGCAAACTC ACTGACGACG ATGTCGAATC GATTGCCGGT CTCATGGCAG CTTGGGCTCG GCGAAAGTCC GTCACCGCTG CACTTACAGT CGAGAGGTTA CTAAAACGAG TTGTCGACGA TTTGAAAGAG GGGAATCAGC GCGTGCATGT TACGACCCGA ATGTATTCCT GTGCGATTGA CGCCTGGGCG AAGAGTGGCG TCGAAGGCTC TTGCGAGCGT GCCGCCCAAA TTCACGATAC ATTGGTGCAG CACTATCAAA GTACCAACGA TCCACTTTTG GCACCATCTG TTATGTCCTT CAACACAGTC GTCAATGCCT GGGCGAAATC AAATCACGAC GATGCACCCG CTAAAGCTGA AGCTGTGTTG GAAGAAATGA TACAGGCATA CCGAAACGGC AACGAAGCGT TAAAACCGGA TGCGGTTACA TTTTCAACGA TATTGGATGC TTACTCGAAA TCCAATAAAC CCAACGCTGT AGCACGCTGC TACGAATTGT TTCAGGTTAT GGACGAGCTG GACGTCAAGC GGAACGTATA CACCTTTTCT GCGCTACAAA ATGTTGTGGC AAGATCTCGG ATTCCGAACG CGGCGGAGCA AACCATGAAT ATTCTACAGC AGATGCTCAA ATTGTACGAA AATGGAGACG TCTTTGCCAA ACCCAATACT TTAAATTACA ACGCCGTTCT CAACGCATGT TCGCGAACCC CGAGCAAAGC CAGCGCTCAA CTCGCCGACG ACTTGCTGCA TAGCATGGAG TTGCCCTTGA TACAAGGCGG TTATGATGTC GAACCCGATC GTCTATCCTA CGCCATGTGC ATACTAGCCT GTGCTCGCTG TCCTGACGAA GCATTCGGCG TGCCGAAGGC CGAGGCCAAT TTACGGCGCA TGGAAAGTCG GGCCATTATG GAAGCCGCCA AACGGCAACA GATCTCCAGC GCGGCGCCAC CTACTGTTAC ACTCGACATT GAATGTTTTA ACGTTGTGCT CACCGCCTTG TCCAGGCGTA AAAACATACC GCCGACACGG ACCTTAGAAA TTGTGAAACG CATGGAGGAA TACGCCGAGC AAGGACAGGA GCATTTGCGT CCGAACGTGC GTTCCTGGAA TGCCGTTTTG AACGCGTACG CCCGGGCCAT TGCGGTTTCA CCTCACTCAA CGGCTTCCAA CTCGTATGCT CAAATGGCCG CTGAGTTTCT CCAGCACATG CGCCTTGATC TGGGTATTCG GCCCGACGCG TTTTCGTTCG CCGCATTGCT TAGCGCGTTC CAAAAATGGG ACCATCCCGA AGCAGTCGCC CAGGCCGATG CTTTGGTTCG AGAAATGGAA TCCTTGTTCG AACAAAATGA GATTGACGCT CCGCCTGATG TGTATCAGTA CGTCGAAAGA TATAGAAAAT CTGTGTGCTT CTTTTCTCTA TGTTGACACA AGCTTTGTTC CTTTTCACTT CAAAAACCGC AGCTATACAA TTCTCTGTGG AGCCTGGGCT AGGTCGGGCC AAAAGATTGC CCCTCAACGC TGTTTGCAGA TTCTAGCACA CATGGTGGAA CGGCATCGCC TAGGCTATCC CAACGTGAAA CCAAACGGTT AGTATACTCT GTGTTGCCAT CATAGCTTTC CTCATGTGAC TCATAATGTG TTTTTGCGCT ACTGTAGTCC GAACATACAA TGCCGTGTTG GATTGTCTTG CGCGCGCTGG CGCGGAAGAC CGTGCTGAGC AATTGCTGTT TCATATGTTA AAATTGTATC GAGACGGAGA TCATGATGCC GAGCCCGATG CCTTCACGTT CAATTGCATC ATTCACGCGT TCTCTAAATC TCGACGAAAG GGCGCCGGTC GGAGGGCCGA GTCAATTCTG GACCGTTTTT TGGAATACCA CGAAGAAGAA AATCAATCGA TCCGTCCCGA CACGCGATCA TTTACCCACA TTATTGCTCA TTATGGTCGA AGCCGTGAGT TGGATGCCCC ATATCGAGCC GAGTACGTTT TAAATCGCAT GGTATCATTG TGTAAAGACG GCAACAAAGA TTTAGCCCCG AACCTGTTTG CCATCAAAAC TGTTGTGGAC AGTTACTCTC ACGCGAAACA TCCCGACGCT GGTCGTAACG CCGAGCGCTT TTTAAATCTG ATTCGAGAAT TGAGGGAAAA GCATGGCATT ACTAGACTTG AGCGTGATAC TTCATTTATG AACAGCGTGC TGTTTGCGTG GTCGAGCTGT GGCAGCGAAG ATTCCGGTCA TCGCGCGGAA GGTCATCTGT TGGAAATGGA AGACAGTTTT GACCAAGGAA CCATCTCTTT TCGACCGGAC TCGCGAAGCT ACGAAATGGT GCTGTCGGCG TGGGCAAAGT CAGAAAGCAG CGACAAAGCG AAGCGTGCGT TGCTGACTCT ACGTCGTATG CAAGAGCAGC AACGGACCGG CAATCCGTTT GTCCGAATTG ATGAAGCCGC TTATTCTTTT GTCATTAATG CTTGCGCCTT CAGTAATGCC GGCGAGGATC TCGAAGCTGA AGCATTCACA ATCGCAGTAA AGCTGTTAGA CGAAATGCTG GAGTCTAAAA GCGTTCATCC CAGCTCACTT ACATACGGAT GGTTTATTCA GGCATGTGGA CGCCTTCGTG TGGCGCACGC ATTAAAAAGT GTCCAAATAG GAAGGGCATT TCATCTTTGT TGCGAAAATG GTTTAGTAAA CGACTTTGTT TTGCATCGGT TAAAGGGAGC GGCGCCAGAT CCAGTCTTCA AGGAGTTGCT GGCTCCTGTT TTGAGCAACT TACCTCCTCG TTTTCCGAAG GGAAGGCTCG CAGTAAATAA CCTCCCATCA GATTGGACTT GCAACGTCCA CGGGAACAGA AAAATAAGAC GGCAATAG
|
Protein sequence | MVSNRRKRKI LGCFLLFATP IDGLLGVTSA PSLTRKKRRH SKWSFLTSPS STTRATLRSG KSSWLEAAPH RQVNTNASSP PTESLSSSSP ATSFINTNSP TSSASSDARH TDSPLSQIYE DDEHPRAPRP PPSSFSRNED WLESVTGELL DLDVYPLGKL TDDDVESIAG LMAAWARRKS VTAALTVERL LKRVVDDLKE GNQRVHVTTR MYSCAIDAWA KSGVEGSCER AAQIHDTLVQ HYQSTNDPLL APSVMSFNTV VNAWAKSNHD DAPAKAEAVL EEMIQAYRNG NEALKPDAVT FSTILDAYSK SNKPNAVARC YELFQVMDEL DVKRNVYTFS ALQNVVARSR IPNAAEQTMN ILQQMLKLYE NGDVFAKPNT LNYNAVLNAC SRTPSKASAQ LADDLLHSME LPLIQGGYDV EPDRLSYAMC ILACARCPDE AFGVPKAEAN LRRMESRAIM EAAKRQQISS AAPPTVTLDI ECFNVVLTAL SRRKNIPPTR TLEIVKRMEE YAEQGQEHLR PNVRSWNAVL NAYARAIAVS PHSTASNSYA QMAAEFLQHM RLDLGIRPDA FSFAALLSAF QKWDHPEAVA QADALVREME SLFEQNEIDA PPDVYHYTIL CGAWARSGQK IAPQRCLQIL AHMVERHRLG YPNVKPNVRT YNAVLDCLAR AGAEDRAEQL LFHMLKLYRD GDHDAEPDAF TFNCIIHAFS KSRRKGAGRR AESILDRFLE YHEEENQSIR PDTRSFTHII AHYGRSRELD APYRAEYVLN RMVSLCKDGN KDLAPNLFAI KTVVDSYSHA KHPDAGRNAE RFLNLIRELR EKHGITRLER DTSFMNSVLF AWSSCGSEDS GHRAEGHLLE MEDSFDQGTI SFRPDSRSYE MVLSAWAKSE SSDKAKRALL TLRRMQEQQR TGNPFVRIDE AAYSFVINAC AFSNAGEDLE AEAFTIAVKL LDEMLESKSV HPSSLTYGWF IQACGRLRVA HALKSVQIGR AFHLCCENGL VNDFVLHRLK GAAPDPVFKE LLAPVLSNLP PRFPKGRLAV NNLPSDWTCN VHGNRKIRRQ
|
| |