Gene PHATRDRAFT_21664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21664 
Symbol 
ID7202266 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp727789 
End bp732187 
Gene Length4399 bp 
Protein Length1186 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181792 
Protein GI219122937 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGCC CCATTATCCA AGATGTTTCG TTGTGCTTGC AGCCGGGCAA AAATTATCTC 
GTATTGGGGC CTCCCGCTTC GGGCAAGTCA ACCTTGCTCA AAGCGATCGC GGGGCAGCTG
AAATCATCGT CGACGGAAAA ACTGGAAGGC CAGATACTGT ATAATGGACG GGAGTTGGAG
GTATGTAGTG CGGTTCGCCT GTCATGTACA TTCCCTTACA GGGTGTGGTT GCCTGTGTCG
GTCTTTGGTT CGCACCTGTC TCTCACTTCA TTCGGCTACG GATCCCGACT GTAGGTGGGT
CGACGGCAGC AGTGGTACAT TGAAAATGCC TTTGCTTACA TTGATCAACT AGACAAGCAC
GCACCACGGT TGACGGTCGA CGAGACGTTC GAATTTTCTT TTCAATGTAA AACCGGAGGC
ACATTTCAGC AAGCTCAGGA TCCACGCGTT TTGCAGGACC CCAAAGTTAT GACAGCGATA
CAAGAAGCCG ACCGGAGCAG ACTCGGTGTG AATATGGTCT TGGCGAGTCT AGGGTTGACG
GAAGTTCGCG ATACGTTTGT GGGGAACACT GCCGTTCGTG GGGTTAGTGG AGGCCAACGG
CGGCGAGTGA CCGTGGGGGA AATGATTACG TCTCGTCAAC CTGTCCTTTG CGGTGACGAA
ATTTCGACTG GTTTGGATGC TGCGTCCACC TTTGACATGG TGCAAGTACT CACTCACTTT
GGAAAACTAG CGCAAATGAC ACGAGTCTTT GCTCTGCTGC AGCCGAGTCC CGAGACTTTC
AGTCTTTTCG ACGAGATCAT ACTCGTGTCG GAAGGCTTGA TTTTGTATGC TGGACCAATC
GACGAGGTAG AGGATTACTT CGCTGAGCTT GGCTATCGAT CTCCACAGTT CATGGATGTC
GCTGACTTTC TACAAACGGT TTCTACCGAG GACGGTAAGA AACTGTATCA CCCTGTCGAC
GATAGCAAAC GGACCGAACC GCCTACTGTC GCAGATCTTG CCAATTGTTT TAAAACCAGT
CAGCAAGGGA AAAAAATTCG CGATCGACTG GACGAACCCC CTCAGTATGT TTGGAAACAA
GACGATCGAA TCTCACAGCA TGGAAGCATT GTCTCGCAGC TTACCTTGTT GAAGCAAGTG
AAGAAAAAGT ACGCCAACTC CTTCTTTCGA AATACGTGGT TGAATCTGAA ACGATTCTTG
TTGTTGTGGA CGAGGGATAA AAGAGTGATT TTCGCCAGTG CAGTCAAGAA CATATTGATG
GGTGTCAGTG TTGGCGGAGT ATTCCGCGAC GTCGATGACG AAGTCTCTAT TTTAGGGGCT
CTTTTTCAAT CAGGTCTTTT TATCATGCTC GGGGCAATGC AAAGTGCATC TGGGCTAGTA
AACGACCGCG TTATCTTTTA TAAGCAAATG GACGCCAATT TCTTCTCGTC GTGGCCCTAC
ACGCTGGGAA GAACTTTGGC AGGATTCCCA CAGGTACGTG TCTGACCGAT GGCATGCTTC
TGTTTGCATC GCACGTTCTT ATACGGCTCT CTCTTGTAGA CCATCATGGA TGTCTTCACG
TTTGGGACAA TTCTTTACTT TATGGTTGGA CTTAGCGATC GAGCTGTGAC CGAATATTTC
TTGTTTATTG CAATTTTAAT GACTTTTGCA ATGATGATGA ATATGCAGCT AGCAGTGTTT
GCATCGTTCG CTCCAGACTC TCAGCTGCAA GTCTACAGTG CTTGTACACT ACTGCTGCTA
ATTCTGTTCG GTGGTTACAT TGTGGCGCCT GATGCCATCC CCTCGTTTTA TCTTTGGATA
TATTGGTGGA ATCCTTTTGC TTGGGCTTAT CGTGCTTTGG TGATCAATGA GTTTCGTAGT
TCACGATGGG ATGATCCAGA CGCGACGCTT GCAGGGATTG GTTTCGTGTA CGGTATAGAT
TCTAGGCCAT TTGAACAGGA CTGGCTGGGG TATTGCTTCC TTTATATGAC CATTTACTTT
TTCGGTTGCG TAGTTCTGAC GGCTGTGAGT CTTGGCTACG TGAGACAGAT CCCTGAGCCG
ACACCGCCAG ACGTGAACAT AACAAGGCTT GTCTCGGATC CTGTATCAGA GCGTCGGAGG
GTCAATGTAC CCTTTAAACC TGTGACATTG TCTTTTGCAG ACGTTTGCTA CGAAGTCAAA
GCTTCAACAA AAAATGAAAC TCTAAAACTT TTGAATGGTG TCAATGGAAT TTTCCGATCA
GGACGCATGG TACGAGTTTT TGATAGCTCT CTGTTCCAGT TGTTGTGGTA GTGCTGTTCT
CATGCATCCT CTCCGTTGTC TTGCTGTTAG TGCGCATTGA TGGGATCGAG TGGAGCAGGC
AAAACGACAT TGCTGGTAAG TACAGTGCAT CTATTGTTCC AAAGCATTAA CATTCACATG
CTCACAATTT TCCGGTCATA GGATGTGATT GCTCTAAGAA AAAGGACTGG ATCAGTGACG
GGTGACGTTC GGTTGAATGG ATGGTCACAG GACAAAATCT CGTTTTGTCG TTGCTCTGGA
TACGTTGAGC AATTTGACGT CCAGTCACCG GAGCTGACGG TTCGGGAGAC AATTCTGTTC
TCTGCTCGGC TCCGCCTCGA TCGCGATGTC GTCACAAGCG AAGAGGACCG GGAGGCTTTC
GTCGACCAAG TCATTGACGA TATGGAACTT CTTCCTTTGG CTGATTCGTT AGTTGGTAGT
GACGAGGGAA TCGGTCTAAG TTTTGAGCAA AAGAAGAGGT TATCAATTGC GGTTGAACTC
GCGGCTTCAC CGTCTGTCGT CTTCCTGGAT GAGGTAAGGC TTTCGCTGTC TGTAATCAAA
ACGATGTATT GTGCATTTTT GCTCTAACAT ATTGATTGCT CTAACTATTG ATTACTATGG
CTGTTCGCAG CCTACGAGTG GTTTAGACGC CCGAAGCGCT CTACTTGTGG TAAGGGCGCT
ACGCAATATT TCAGACAAAG GGCAAACCAT CGTCGCAACT ATTCATCAAC CATCGTCAGC
GATTTTTGAG ATGTTTGTAA GTAACTTCAG CGCCAGGAAA ATATGCAGGT TCGATAAGAC
CTCACATTCT TCTTTGGTGG CAGGACGAAT TGTTGTTGTT GAAACGAGGT GGGCAGGTTG
TTTTTCAAGG AGACTTGGGA AAAGATTGCT CGCGTTTAGT GAACTATTTT GAAAATTTGG
GGGCAACAAA GATCGAACTC GGGGAAAATC CTGCGAACTG GATGCTTCGG GTAATTACAT
CGGAAGACAT GGGTGATCTT GCGCAAAAGT ACGTCGAGTC AAAGGAGTAC GCACTCCTGC
GTAAAGATCT GGATGAAATC AAGGCTGTCC AAGATCCCGA GTTAAAAATT GAGTACAAAG
ATGAATTTGC TGCCAGCAAG GCTGTACGAC AGCTACTTGT CAACGGACGC CTACGCTTGA
TCTATTGGCG GTCACCAGCA TACAATCTAT CTCGCTTGAT GGTATCTATG GTGATTGCCT
TTGTTCTAGG ATCGGTCTTT ATTCTTGTCC GGCATCCAGA AATCTACACC GAAGTGGAGA
TGCGCTCCCG CCTGTCCGTA ATCTTTCTAA CGTTCATTAT CACCGGTATC ATGGCCATTC
TTTCGGTAAT CCCCGTCATG ACCAAGATTC GGGAGATGTT TTATCGCCAC CAAGATTCAG
GAATGTACGA TAGTGCCGCC ATTGGTTGGG CCCTCGGTTC GGCTGAGAAG CTTTTCATTG
TTCTGGCCAC CACCATCTTT ACGGTTGTCT TTTTGAGCGT AGCGGGTATG ACCAAGTCGT
TGCGTGGATT GTTTGGGTTT TGGGTACGTC TGCCCAAAGT GACTTGCTAC CAGGGACTTG
AAAACGAACT CGTGACACCT CATATCGTTT CTTGCTTTTT GCTTGTAGGG ATTCTTCACG
TTCAACTTTG CGATATACTC CTACTTTGGA CAGGCTTTCG TTTGTTTGGT TGAGAATCCT
GCAACGGCAT TAATTTTGTC GAGTGTCTTT ATCGGCCTCA ATAACTTCTT TGCCGGTTTA
ATTGTGCGTC CGCAACTGTT GGTTGGTTCG TTTTTTGCCT TTCCATTTTA CATCACGCCC
GGTCAATACG TCTACGAAGG TATGGTGACC AGTTTGTACA AGGGCAGTCC CAAAATTGTA
ACGGCCGATG TGGGCGGAGG CTTTTTCGAA TACTTGGTGG ACACGGGCGT GTGTGTTCCG
CAACAGCCAG AGCCGTGTCA GGGGACCGTG TCCGACTTTA TCGACGTCTT TTTCGGCGGC
GTCTTTACGG ACGATCATAT TTCTCGCAAC GCACTGATTC TGGGCGGTAT ATTGATCTTG
ACACGAGTCT TGACCTTTGC CGGTCTCAAG TACATTCGTT ATAATTAGCT TACATAACCG
AAAATACGTG AACACAGTG
 
Protein sequence
MQRPIIQDVS LCLQPGKNYL VLGPPASGKS TLLKAIAGQL KSSSTEKLEG QILYNGRELE 
QWYIENAFAY IDQLDKHAPR LTVDETFEFS FQCKTGGTFQ QAQDPRVLQD PKVMTAIQEA
DRSRLGVNMV LASLGLTEVR DTFVGNTAVR GVSGGQRRRV TVGEMITSRQ PVLCGDEIST
GLDAASTFDM VQVLTHFGKL AQMTRVFALL QPSPETFSLF DEIILVSEGL ILYAGPIDEV
EDYFAELGYR SPQFMDVADF LQTVSTEDGK KLYHPHGSIV SQLTLLKQVK KKYANSFFRN
TWLNLKRFLL LWTRDKRVIF ASAVKNILMG VSVGGVFRDV DDEVSILGAL FQSGLFIMLG
AMQSASGLVN DRVIFYKQMD ANFFSSWPYT LGRTLAGFPQ TIMDVFTFGT ILYFMVGLSD
RAVTEYFLFI AILMTFAMMM NMQLAVFASF APDSQLQVYS ACTLLLLILF GGYIVAPDAI
PSFYLWIYWW NPFAWAYRAL VINEFRSSRW DDPDATLAGI GFVYGIDSRP FEQDWLGYCF
LYMTIYFFGC VVLTAVSLGY RRRVNVPFKP VTLSFADVCY EVKASTKNET LKLLNGVNGI
FRSGRMCALM GSSGAGKTTL LDVIALRKRT GSVTGDVRLN GWSQDKISFC RCSGYVEQFD
VQSPELTVRE TILFSARLRL DRDVVTSEED REAFVDQVID DMELLPLADS LVGSDEGIGL
SFEQKKRLSI AVELAASPSV VFLDEPTSGL DARSALLVVR ALRNISDKGQ TIVATIHQPS
SAIFEMFDEL LLLKRGGQVV FQGDLGKDCS RLVNYFENLG ATKIELGENP ANWMLRVITS
EDMGDLAQKY VESKEYALLR KDLDEIKAVQ DPELKIEYKD EFAASKAVRQ LLVNGRLRLI
YWRSPAYNLS RLMVSMVIAF VLGSVFILVR HPEIYTEVEM RSRLSVIFLT FIITGIMAIL
SVIPVMTKIR EMFYRHQDSG MYDSAAIGWA LGSAEKLFIV LATTIFTVVF LSVAGMTKSL
RGLFGFWGFF TFNFAIYSYF GQAFVCLVEN PATALILSSV FIGLNNFFAG LIVRPQLLVG
SFFAFPFYIT PGQYVYEGMV TSLYKGSPKI VTADVGGGFF EYLVDTGVCV PQQPEPCQGT
VSDFIDVFFG GVFTDDHISR NALILGGILI LTRVLTFAGL KYIRYN