Gene PHATRDRAFT_43037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43037 
Symbol 
ID7196840 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1868054 
End bp1872104 
Gene Length4051 bp 
Protein Length1122 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176858 
Protein GI219110213 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000542513 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGAGGCGGA CTAACAGTAA CCGAGCTAGA GCAGATTGCT CACTCCGTAC ATCTCTCTCT 
ATATATTCAT TCCTACCTAC CTACCAACAC TAGATACATA TTGTATCTAG AGGTAGATTT
CACCACAGTT CCGTCGGAAG CGAGAAGGAA ATCGACTTTG GGAGTCGAGC ACGATACTCT
AGCGGTAGTA GAAGAGTATT GGAGGAACGG CACGACGAGA AAGAAGTAAA GGCATTGCGT
GGATTGTGAC TGCGACTGCT ACTTCTCTAC AGCAACTACT ACTACTGCTG GAGAAGAAAG
ACGTACTTAT CCGTTTGCGG GGATTGGTCA AGTTTCGGAC TGTAGTTTGG GACTCGTAAA
ACACCATGCG GTCTGGGCTA CTCACATTGT TCGGTTGCGC GGCAGTGTCC GACCATGTCT
CAGCGTTTGT CCCGCAATCA CGTCTCCTGC CCGTCAGCTA CAGCTCTCAC GGAGTGACTC
CACTACCGCG TAGCTTGGCC TTACAGGCAA CGGCCGACAA CGGCGGTGGC TCGGGACTGG
GCAACGCCAT TCGCCGGATG ATGGAAACAC CAATTCCGGG CACGGACAAG GTCGCCGCCA
CCGCGTCAAC GCCAACCACG CCCGTAGTCG TTCCCGGCGT ACCCGTGAAC GTTCCCGATT
GGTCCCAGAC GCTGAACGAC GCCGTAACAA CCGCAGCAAC GACGGCCACA AAATCGTCGC
CGACACCCAC CAGCATCCCC GCGGACGCCG TCGGTACCAA TCTCAACGTG TGGAAATCCT
ACCTGGCGCC CAATACGGCG GATCTGGCCT TGCCGGACAG GGCAGCCTTG CAAGCGGGCG
CGGATCAAAT CGTGACATCG GTACAGTCCA TCCCAGTGGA CAAGTTGAAT CAAGCCGCGT
CGGATATTGC CAACGTCTTG TCTACCGAAG GTTGGTCCAC TGCGGATATC ACCCAAGCTC
TCAACATTGA CGAACTAGGA GTGTGGTACG CCGGAGCGCT CGGGGCGGGC ATTCTGTTGG
CGGCGGGGCG CAACTCGACT TTCACCAAGG CAACAGCTCC CCCTCCTCCG CCCCCACCAC
CCAAACGCAC GTTTCCCGAT CCACCCAAAC TACCCCCCTT GCCCAATCTC CCGCTACCGG
AAAACGTACC GGTCCCCGTA GTCGCCACCG CGGGTGGCAT TGGGGCACTC ACCTTCGCCT
CACTTTTGGG GTTTGGGGAT TCCATCAAGA ACGCCATCAA GTTTGCCCTC GTGCCGAACG
CGGCCATGAC TGCCAGGAGT GCAGCGGCCG TGGCTGCTTC CACCAACACC CAGACAGCAT
CGGTCCCACT GGCTTCCAAT CCACCGCCGC CACCCCCTCC CACCGTCACA CCGGAACCCA
TTAACCAAGC GATCGAAGCG GCTTCCACAA CAATCGGGTC CAGTGCATCT GCGGGATTGT
CCTCGTTGAA AACCTACCTC GTACCGGATC CCAAGCTCTT GGAACTACCC GACAAGGCCG
GTCTTCAAGC GACGGCGGGG AAATTCATCG ACGCCGCCGT CGCCATACCC CAATCAATTC
CCGTCGACAA GTTTGGGACG GCCGCCGTCA CACTCGCACG GGTGTTGCGA ACGGAAGGAT
GGTCCGCTGC GGACGTCACT AATGCGCTCA ACGTGGAAGA ACTGGGGGTG TGGTACGCCG
GTGCACTCGG CGCCGGCGTA CTCTTGGCCA GTCGCAACGT CACGAACATT GGAACGTCCG
CATCGGGACC GAAAAAACCT GCGTTTCGTC CCACCCGCGT CGTGGCTCCC GAACCCGTGC
CGACACCGTT GGAGTCCCTG CAGAGTCAAG TGCGGGAAAT CAGCAAAGTC GAAATTTCAC
CGGCCGCCAA AGTCACAATT GGTACAGTCG GCGCATTGAC GTTTGCTTCT CTTGTTGGTA
TGGGAGATTC CGTCCGGTCC GCCATCAAGT TTGCCTTGGT GCCGAACGCC AAGAAACCGG
CCGTATCGGT TCTAGCGCAA TCTGTCACGC AGGAGCCTCC AGAACCACCC AGCCTGCCGG
ATCCGCCGTC CTTTTATGCA CCAGCAGTGT CTTCGGAGAT CACTGATAAG CCGGTCACGG
AACAATTGGC TGGGAGCCTG GTGGAGTCCA CCAAAGGGGT CATGTCCGAC ACGGTGGACT
CGTCCGTTGC TTCGTCCAGC TCCGTCCAAG CGGCCATCGC TTCAGTACAG TCGAACTATC
AGAACGCGCT CAATTCGATC AAAGAGGCCC CAGGGACTTC GGGCCAAGCC TCGAAACTTT
CCGAATTTCT GAAAGAAAAA GTACCATCCT TCAGCACAGA TTTCGGCAAA CTGGATCTGC
CCAAGCCTGA TTTTAATGTC GATCTGAGTG GTTTCGACCT TACAACGGAA CGCATCAAGG
CTTCATTAGA ATCTATTCCA GTCGACAAGT TGTCCAGCAC GTTTGACAAT CTGGGGAAAA
GCTTGCAAAA TGGCGGCTTC ACCGCGCAGA GTATTCTGGA GTCAATGAGC TCCGCCGAGA
AGGGTTGGTA CTTGGCAGCT GGTAGCGTCG TGTTGGCAGC GGTCGGGGCC GGGATTCGCA
ACACATATGA AGACCAGCTC GAAACCACCA CGTTAGAGAC CAAGGAAAAG CAAGCCAATG
CGAAAGAGCC TGCCAAGATT TCTGAAGCTG ATAGTGTTGA AGACGACATG GTCTCTCAAA
TCAAGGAACT GAGTCAGATG ACAACGGCAT TATCGAACGA GCTCAAGCAG ATTAAGACTC
AAAAGTCAAA GAAGGACTAC GATGTGGCCA CAATGCAGAG CGATGTGCGC GAACTGCAGA
ACGCAATGGA CGCACAAAAG AAGTCTGAAA AGGCGTTGAA ATTGCAGCTT GCGCAGACGG
AAAAAAAGCT GGCGGCGGAG ACCGCCCAGC TACAGCAAAA GCTAGAAGAA GCAAACAAAA
AGTTCAAAGA CGAGAAGGAT GCAATGAAGA AAACAAACAA AAAGCTTCAA AAAGACTTGG
ATGCAGCGCT GGCGTCAGTT GCGGCCTTAG AAGCGGAAAA GGTTGTGTAT TTGTTTTGTT
TGTTTGTTCG TTGTATCAAT ATTACTTTGT TAGACGTGCT CTAACCCATG ATTATTTGAT
CAACGCAATG CAGGCTGAAC TACAAACACA GCTCAACGCA CTAGGTATTG AAGAAGTGGA
GGCACAGCTC GCAGAGTTGG GGCTCAATGA AGTGGCGAAA AAGCCCAATC GAAAACAAGA
ACCCGAACCT GTCGTTGAGA TAGAACCAGA ATCGAATTCA GAAGCCCAGG CAGCGACAAC
CACAACCAAA AGCAGCTCAT CACGTCCACA GGACACTTTC TTTGCTAATT TTACCGAAAT
TCCCTTGGAC GAGCTTCCAG CAGTAGCTTC GGAGTCGACA GAGAAGAACA AAAGCTCTAA
AACGAGAGCC GCTTCTAAGC AACCGCCTTC AAAGAAAACA AACATAAAGA AAAATTCGCC
GAAGAAATCC GCCACAAAGG GTGCCATTAA GAAGCAAGTT GAGCAGAAGC CGGAAGAGAA
AAAGGCTGAA ACGAAAAAGA CAGAAGCGAA AAAGGTTGAA ACAAAAAAGA CAGAAGCGGA
AAAGTCTGAA ATGAAAAAGA CAGAAACAAA AAAGAAAGAA GCGGCTGACT CTGTGGTGTC
AGGCGGATCG GAGAACTGGA ACAGTTTGTC AGAATCAACG CTGAAACGGA AAACGGTGAA
GGAATTGACT TCGTATTTGG AAGAAAAGGT ACGTCAATAA TTGTTTTATT GCCTCTACCT
TGTTTAATAG CCAGTTTGAT TTACCTTGCA TCCATTTGTT CTCTTCAGGG ACTTACAACG
ACCGGAGGAG ATGGTAAAAC ACTGAAGAAA GCGGATCTGG TAGCCGTCGT TCTGTCGCAA
TCGTAGGAAT GTTGTAATGG CTTCGTTTTT TTTGGCTATA TACTGGATCG ATGCAATTAA
AATCGCTACC TTAAAACACG ACCCAGAAGA TTACTGCGTT GCACCACCCG TTCTTAATAT
AGCGTATGTT TGTAAGGAAA CGACATGCTG G
 
Protein sequence
MRSGLLTLFG CAAVSDHVSA FVPQSRLLPV SYSSHGVTPL PRSLALQATA DNGGGSGLGN 
AIRRMMETPI PGTDKVAATA STPTTPVVVP GVPVNVPDWS QTLNDAVTTA ATTATKSSPT
PTSIPADAVG TNLNVWKSYL APNTADLALP DRAALQAGAD QIVTSVQSIP VDKLNQAASD
IANVLSTEGW STADITQALN IDELGVWYAG ALGAGILLAA GRNSTFTKAT APPPPPPPPK
RTFPDPPKLP PLPNLPLPEN VPVPVVATAG GIGALTFASL LGFGDSIKNA IKFALVPNAA
MTARSAAAVA ASTNTQTASV PLASNPPPPP PPTVTPEPIN QAIEAASTTI GSSASAGLSS
LKTYLVPDPK LLELPDKAGL QATAGKFIDA AVAIPQSIPV DKFGTAAVTL ARVLRTEGWS
AADVTNALNV EELGVWYAGA LGAGVLLASR NVTNIGTSAS GPKKPAFRPT RVVAPEPVPT
PLESLQSQVR EISKVEISPA AKVTIGTVGA LTFASLVGMG DSVRSAIKFA LVPNAKKPAV
SVLAQSVTQE PPEPPSLPDP PSFYAPAVSS EITDKPVTEQ LAGSLVESTK GVMSDTVDSS
VASSSSVQAA IASVQSNYQN ALNSIKEAPG TSGQASKLSE FLKEKVPSFS TDFGKLDLPK
PDFNVDLSGF DLTTERIKAS LESIPVDKLS STFDNLGKSL QNGGFTAQSI LESMSSAEKG
WYLAAGSVVL AAVGAGIRNT YEDQLETTTL ETKEKQANAK EPAKISEADS VEDDMVSQIK
ELSQMTTALS NELKQIKTQK SKKDYDVATM QSDVRELQNA MDAQKKSEKA LKLQLAQTEK
KLAAETAQLQ QKLEEANKKF KDEKDAMKKT NKKLQKDLDA ALASVAALEA EKAELQTQLN
ALGIEEVEAQ LAELGLNEVA KKPNRKQEPE PVVEIEPESN SEAQAATTTT KSSSSRPQDT
FFANFTEIPL DELPAVASES TEKNKSSKTR AASKQPPSKK TNIKKNSPKK SATKGAIKKQ
VEQKPEEKKA ETKKTEAKKV ETKKTEAEKS EMKKTETKKK EAADSVVSGG SENWNSLSES
TLKRKTVKEL TSYLEEKGLT TTGGDGKTLK KADLVAVVLS QS