Gene PHATRDRAFT_43695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43695 
Symbol 
ID7197238 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1209503 
End bp1212840 
Gene Length3338 bp 
Protein Length1060 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177781 
Protein GI219112059 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.044454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATCAA ACCGACGGAA GAGAAAGATT CTCGGATGCT TTCTCCTCTT TGCGACTCCC 
ATTGACGGCT TGTTGGGAGT GACTTCAGCT CCGTCCTTGA CGCGGAAAAA GAGGCGTCAC
TCAAAGTGGA GCTTCCTAAC TTCGCCGTCC TCAACAACGA GAGCAACTTT ACGATCGGGG
AAAAGTAGCT GGCTAGAAGC TGCTCCTCAT CGGCAGGTTA ATACGAATGC AAGTTCACCT
CCAACTGAGT CATTATCATC ATCATCACCC GCGACTTCAT TTATCAACAC GAACTCTCCA
ACTTCATCGG CAAGTAGCGA TGCCCGTCAT ACGGATTCGC CATTATCCCA AATCTATGAG
GACGACGAAC ATCCACGGGC ACCTCGACCG CCGCCGTCAT CTTTTTCGCG TAACGAAGAC
TGGCTAGAAT CCGTTACGGG CGAATTACTC GATCTGGATG TGTACCCCCT GGGCAAACTC
ACTGACGACG ATGTCGAATC GATTGCCGGT CTCATGGCAG CTTGGGCTCG GCGAAAGTCC
GTCACCGCTG CACTTACAGT CGAGAGGTTA CTAAAACGAG TTGTCGACGA TTTGAAAGAG
GGGAATCAGC GCGTGCATGT TACGACCCGA ATGTATTCCT GTGCGATTGA CGCCTGGGCG
AAGAGTGGCG TCGAAGGCTC TTGCGAGCGT GCCGCCCAAA TTCACGATAC ATTGGTGCAG
CACTATCAAA GTACCAACGA TCCACTTTTG GCACCATCTG TTATGTCCTT CAACACAGTC
GTCAATGCCT GGGCGAAATC AAATCACGAC GATGCACCCG CTAAAGCTGA AGCTGTGTTG
GAAGAAATGA TACAGGCATA CCGAAACGGC AACGAAGCGT TAAAACCGGA TGCGGTTACA
TTTTCAACGA TATTGGATGC TTACTCGAAA TCCAATAAAC CCAACGCTGT AGCACGCTGC
TACGAATTGT TTCAGGTTAT GGACGAGCTG GACGTCAAGC GGAACGTATA CACCTTTTCT
GCGCTACAAA ATGTTGTGGC AAGATCTCGG ATTCCGAACG CGGCGGAGCA AACCATGAAT
ATTCTACAGC AGATGCTCAA ATTGTACGAA AATGGAGACG TCTTTGCCAA ACCCAATACT
TTAAATTACA ACGCCGTTCT CAACGCATGT TCGCGAACCC CGAGCAAAGC CAGCGCTCAA
CTCGCCGACG ACTTGCTGCA TAGCATGGAG TTGCCCTTGA TACAAGGCGG TTATGATGTC
GAACCCGATC GTCTATCCTA CGCCATGTGC ATACTAGCCT GTGCTCGCTG TCCTGACGAA
GCATTCGGCG TGCCGAAGGC CGAGGCCAAT TTACGGCGCA TGGAAAGTCG GGCCATTATG
GAAGCCGCCA AACGGCAACA GATCTCCAGC GCGGCGCCAC CTACTGTTAC ACTCGACATT
GAATGTTTTA ACGTTGTGCT CACCGCCTTG TCCAGGCGTA AAAACATACC GCCGACACGG
ACCTTAGAAA TTGTGAAACG CATGGAGGAA TACGCCGAGC AAGGACAGGA GCATTTGCGT
CCGAACGTGC GTTCCTGGAA TGCCGTTTTG AACGCGTACG CCCGGGCCAT TGCGGTTTCA
CCTCACTCAA CGGCTTCCAA CTCGTATGCT CAAATGGCCG CTGAGTTTCT CCAGCACATG
CGCCTTGATC TGGGTATTCG GCCCGACGCG TTTTCGTTCG CCGCATTGCT TAGCGCGTTC
CAAAAATGGG ACCATCCCGA AGCAGTCGCC CAGGCCGATG CTTTGGTTCG AGAAATGGAA
TCCTTGTTCG AACAAAATGA GATTGACGCT CCGCCTGATG TGTATCAGTA CGTCGAAAGA
TATAGAAAAT CTGTGTGCTT CTTTTCTCTA TGTTGACACA AGCTTTGTTC CTTTTCACTT
CAAAAACCGC AGCTATACAA TTCTCTGTGG AGCCTGGGCT AGGTCGGGCC AAAAGATTGC
CCCTCAACGC TGTTTGCAGA TTCTAGCACA CATGGTGGAA CGGCATCGCC TAGGCTATCC
CAACGTGAAA CCAAACGGTT AGTATACTCT GTGTTGCCAT CATAGCTTTC CTCATGTGAC
TCATAATGTG TTTTTGCGCT ACTGTAGTCC GAACATACAA TGCCGTGTTG GATTGTCTTG
CGCGCGCTGG CGCGGAAGAC CGTGCTGAGC AATTGCTGTT TCATATGTTA AAATTGTATC
GAGACGGAGA TCATGATGCC GAGCCCGATG CCTTCACGTT CAATTGCATC ATTCACGCGT
TCTCTAAATC TCGACGAAAG GGCGCCGGTC GGAGGGCCGA GTCAATTCTG GACCGTTTTT
TGGAATACCA CGAAGAAGAA AATCAATCGA TCCGTCCCGA CACGCGATCA TTTACCCACA
TTATTGCTCA TTATGGTCGA AGCCGTGAGT TGGATGCCCC ATATCGAGCC GAGTACGTTT
TAAATCGCAT GGTATCATTG TGTAAAGACG GCAACAAAGA TTTAGCCCCG AACCTGTTTG
CCATCAAAAC TGTTGTGGAC AGTTACTCTC ACGCGAAACA TCCCGACGCT GGTCGTAACG
CCGAGCGCTT TTTAAATCTG ATTCGAGAAT TGAGGGAAAA GCATGGCATT ACTAGACTTG
AGCGTGATAC TTCATTTATG AACAGCGTGC TGTTTGCGTG GTCGAGCTGT GGCAGCGAAG
ATTCCGGTCA TCGCGCGGAA GGTCATCTGT TGGAAATGGA AGACAGTTTT GACCAAGGAA
CCATCTCTTT TCGACCGGAC TCGCGAAGCT ACGAAATGGT GCTGTCGGCG TGGGCAAAGT
CAGAAAGCAG CGACAAAGCG AAGCGTGCGT TGCTGACTCT ACGTCGTATG CAAGAGCAGC
AACGGACCGG CAATCCGTTT GTCCGAATTG ATGAAGCCGC TTATTCTTTT GTCATTAATG
CTTGCGCCTT CAGTAATGCC GGCGAGGATC TCGAAGCTGA AGCATTCACA ATCGCAGTAA
AGCTGTTAGA CGAAATGCTG GAGTCTAAAA GCGTTCATCC CAGCTCACTT ACATACGGAT
GGTTTATTCA GGCATGTGGA CGCCTTCGTG TGGCGCACGC ATTAAAAAGT GTCCAAATAG
GAAGGGCATT TCATCTTTGT TGCGAAAATG GTTTAGTAAA CGACTTTGTT TTGCATCGGT
TAAAGGGAGC GGCGCCAGAT CCAGTCTTCA AGGAGTTGCT GGCTCCTGTT TTGAGCAACT
TACCTCCTCG TTTTCCGAAG GGAAGGCTCG CAGTAAATAA CCTCCCATCA GATTGGACTT
GCAACGTCCA CGGGAACAGA AAAATAAGAC GGCAATAG
 
Protein sequence
MVSNRRKRKI LGCFLLFATP IDGLLGVTSA PSLTRKKRRH SKWSFLTSPS STTRATLRSG 
KSSWLEAAPH RQVNTNASSP PTESLSSSSP ATSFINTNSP TSSASSDARH TDSPLSQIYE
DDEHPRAPRP PPSSFSRNED WLESVTGELL DLDVYPLGKL TDDDVESIAG LMAAWARRKS
VTAALTVERL LKRVVDDLKE GNQRVHVTTR MYSCAIDAWA KSGVEGSCER AAQIHDTLVQ
HYQSTNDPLL APSVMSFNTV VNAWAKSNHD DAPAKAEAVL EEMIQAYRNG NEALKPDAVT
FSTILDAYSK SNKPNAVARC YELFQVMDEL DVKRNVYTFS ALQNVVARSR IPNAAEQTMN
ILQQMLKLYE NGDVFAKPNT LNYNAVLNAC SRTPSKASAQ LADDLLHSME LPLIQGGYDV
EPDRLSYAMC ILACARCPDE AFGVPKAEAN LRRMESRAIM EAAKRQQISS AAPPTVTLDI
ECFNVVLTAL SRRKNIPPTR TLEIVKRMEE YAEQGQEHLR PNVRSWNAVL NAYARAIAVS
PHSTASNSYA QMAAEFLQHM RLDLGIRPDA FSFAALLSAF QKWDHPEAVA QADALVREME
SLFEQNEIDA PPDVYHYTIL CGAWARSGQK IAPQRCLQIL AHMVERHRLG YPNVKPNVRT
YNAVLDCLAR AGAEDRAEQL LFHMLKLYRD GDHDAEPDAF TFNCIIHAFS KSRRKGAGRR
AESILDRFLE YHEEENQSIR PDTRSFTHII AHYGRSRELD APYRAEYVLN RMVSLCKDGN
KDLAPNLFAI KTVVDSYSHA KHPDAGRNAE RFLNLIRELR EKHGITRLER DTSFMNSVLF
AWSSCGSEDS GHRAEGHLLE MEDSFDQGTI SFRPDSRSYE MVLSAWAKSE SSDKAKRALL
TLRRMQEQQR TGNPFVRIDE AAYSFVINAC AFSNAGEDLE AEAFTIAVKL LDEMLESKSV
HPSSLTYGWF IQACGRLRVA HALKSVQIGR AFHLCCENGL VNDFVLHRLK GAAPDPVFKE
LLAPVLSNLP PRFPKGRLAV NNLPSDWTCN VHGNRKIRRQ