Gene PHATR_18524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_18524 
Symbol 
ID7204357 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp413705 
End bp417174 
Gene Length3470 bp 
Protein Length766 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186340 
Protein GI219113513 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACAACATTT CGTTAAATTT GCGAGTAACA ATTCTCGGAA CCACCGAGGA AAATAGTCGC 
ACGAAGCCAA AATGGGCAAG GGCGACGGCA AAAAGGAGAA AAAGGGTTCC CCATCGGAAA
CGATCGGTTG GGGCGCGACG CCCAAATTTG CCGTCTTGCA GTATCCGGGC GAAACGCAAC
CCGTCGGAAC GTTGTCGACC GCTGCTTCCC ACGAAACCTC CAACGATGCC ACCACCAGCC
TCACACCGGG ACGTCGAGTC CAAGCCGGTC GGGGTCCCAA AATGGCCAGT ACGGGCTTGG
CAATTGGCCG AGACTCCTTC ACGGTCCGGG TCCGCGATCC GGCCTGGTTG ACCTCCCGTG
CCGCCGTGTA CGACCGCATC CGCGCCTCGC GAGCGGCAGA ACTCGCGTTG AAAACGCCGA
CACCCATCCG GGTCGTCATG CCGGATGGCA AGGTACTGGA GCAGGATCAG GAAGGCCAGA
ACTTTAGGGC CTGGCAGACG ACGCCCTGGG ATGTAGCACG GGTGATTGCG CAGGGATTGG
CCGACGCCGC CACGGTGGCT AGGGTGACCT ACGCGGACTT TGTAGCCGAC TACGATAAGG
CTCAGGACGG TATGGAAGTG GAGGATACAC TGTCGGCGGC TATGGCCGAC GGGGGAGTCG
AGTCCGACGC ACAGGAAAAT CAGTTGCTCT GGGATATGAC CAGACCCTTG GTAGGGAACG
TGGCCAAACT GGAATTGCTC AAGTTTGAGG ATGATCAAGA CGCCAAAACG GTCTTCTGGC
ATTCATCGGC TCATATGATG GGAGAGGCAT TGGAACACCT TTACGGATGC AAGTTGACCA
TCGGACCGCC GTTGGCGGGA GGATTCTATT ACGATTCCTA CATGGGCAAG GATGCCTTTC
GAGAAGAAGA TTGTACGTTG TGTGTTGCAG TGTCAATTGC TCCAACCACA GAGTAGACAC
TCTTGCAGTA CTACCAAAAA GCGATTCTGT TTTCTGTCGT GCTGACCATT CTCTTGTCTC
TTGTCTGGTA GACTCCCCCG TGGAAGGGGA AGTAGGCAAA ATTATCAAGC AAAAGCAAAA
GTTCGAGCGC CTAGTCATTA CCAAGGAGGA AGGCTTGGAG TTGTTCGCCG ACAATCCCTT
CAAGGTCAAT ATTCTTACTA CCAAGGTCCC CGACGGATCC CGCACCACCG TCTACAAATG
TGGTGATCTA ATAGATCTGT GTCGGGGTCC ACACCTGTCC CATACCGGCA AGGTCAAGGC
CTTTGCCGCG ACACGGCATT CGGCCACCAA TTGGCTGGGA GATACCAACA ACGATACCCT
CCAGCGCATG TATGGAATTT CGTTCCCCGA CAAGAAAATG CTCAAGGTTT GGAAGGAAAA
TCAAGAAAAG GTACGTGAGC TGCCTGCCTC AGTTCTGCGT TGGTGGAGAA ATTATAATAC
GACAAGCAGT GAAGACAATA CACGAATCTC ATTCTCGGAG TAAACAATGT GACCGACAAC
TTTCCAATCC GCTGATTCTT CTGAGCTTGT CACAATTAAC GTTGATGGGC AGACGCATTT
TAATTGTACG CTAACTCGTA TTCTCTGTGT TTCTTGTCAG GCCAAAGAAC GCGATCATCG
TCGTATCGCG GCCAAGCAGG ATCTCATAAT GTTCCATGAC TTATCTGCAG GGAGCGCCTT
TTGGCTGCCG CACGGAGCCC GCATTTACAA CAAGCTCATT GATTTTATTA AATCACACTA
CTGGAACCGC GGCTACGACG AAATCATCAC ACCCAACATT TACAATCTTG ATTTGTGGCA
CAAGTCCGGC CACGCCCTCC ACTACAAGGA CGCTATGTTC TGTTTTGACG TGGAAGGTCA
AGAATGGGCT ATGAAACCCA TGAATTGTCC TGGTCACTGT CTCATGTTCG CCAATTCAAT
TCGTTCCTAC CGTGATTTGC CGCTGCGTTT TGCCGATTTC GGAGTGTTGC ATCGCAACGA
ACTTTCCGGT GCTCTCACGG GCTTGACGCG CGTCCGACGC TTTCAGCAGG ATGATGCCCA
TATTTACTGC CGCGAGGATC AAATTGAAAA AGAAGTCGTG GATGCGCTTA ATTTTATGAA
GGATGTTTAC GATACATTCG GAATGACGTA CAAACTGGAA CTGTCCACAC GTCCTCAAAA
GGCGCTCGGG GACGTTGCGC TTTGGGAGCG CGCGGAGGAA GCCCTGGCGA ACGCCATGGA
TATGTTCGCT GGCAAGGGTG GCTGGAGAGT AAATCCGGGG GACGGCGCCT TTTATGGACC
CAAAATCGAC ATCAAGGTTA TGGACGCCAT GGATCGTGTG CACCAGTGTG CTACTGTACA
GCTGGATTTC CAACTACCCA TTCGATTTGA CCTCCAATAC ACCACGGCGA GCAAGGAAGA
AGGCCAGCAG TTTGCTCGGC CAGTGATGAT CCATCGTGCC ATGCTTGGTA GTGTGGAGCG
CATGTTTGCC GTCTTGTGCG AACACTATGG AGGAAAGTGG CCATTCTGGC TGAGTCCTCG
ACAAGTGATG CTCGTGCCGG TTCATGCCGA ATTTTTCGAT TACAGCGAAG ATATTCGTGC
GAAACTCCAT GCCGAAGGTT TCTATGTGGA TGTCGATACC TCGAAGAATA CATTTCAAAA
GAAGGTTCGC AATGCACAGG TAGCGCAGTA TAACTTTCAG TTTGTCGTAG GCAAGGCCGA
GGTCGCCAAC GGCTCGGTCA ACATCCGCAA TCGGGAAAAT CAGGTCGAGG GTGAGAAAAA
GATTGATGAG ATGATCGCGA TGCTCAAGCA ATTGAGAGAG GAACACAAGT AGAAATTCGG
TTTCGTTAAG TTCCGTGAAA CTAAGTATTT ATCAACATTC CCGTAACATC AGACACCGTC
TTGTTCCTAG TTCTTCGTGC GCGCCCACAG CGCTTTGCTG CCGCTTCGAA CATGGCTCGG
TTGTGCATGT CGGTCAATTC TACAGGTGGC GTGGCACCGC GCTTCATAAT GAGCGCAGCA
CGAGGAAGAG CGAGTCTTTC AGTTCTCTTT TCGTCAAACT GCTTCGAGGC ATCCAGTACT
GCCGTGCGCA TCTCGCGGTG TTTTCTGTAC AACGCACGGT TTATAACCAA GGGAGACATG
TTCATATCTT TGATGACTTC TTCGTGCAAT CCGTCGACAT GTCCAGGGAG CATCACTGCC
CGAGAAGAAT CTCGAAGATC CCCGGAAGCA TTTCTTTCAT TCTCCACCCA TAAAGGAGAG
CCTGGCTCGT GTGAATGAGA ACGCTTCTTT TTATAGATCA AATGAGAGTC GGACTGTTGA
TCTAGAAGTT CGAGATCGGA AGCACTGGCC CCGATGCCAT TCGTTTTCTC CTCTCGAACA
GGTGTGAGAC CTTCCGTTGA TAGGCCACTC ACCAAATGCC CCGGGGCTGA AAGAAGATTT
TTGTCTGTTG AATTTATTCC GACACTCACC TCCGCGTTGC GGAGGGCACT
 
Protein sequence
MGKGDGKKEK KGSPSETIGW GATPKFAVLQ YPGETQPVGT LSTAASHETS NDATTSLTPG 
RRVQAGRGPK MASTGLAIGR DSFTVRVRDP AWLTSRAAVY DRIRASRAAE LALKTPTPIR
VVMPDGKVLE QDQEGQNFRA WQTTPWDVAR VIAQGLADAA TVARENQLLW DMTRPLVGNV
AKLELLKFED DQDAKTVFWH SSAHMMGEAL EHLYGCKLTI GPPLAGGFYY DSYMGKDAFR
EEDYSPVEGE VGKIIKQKQK FERLVITKEE GLELFADNPF KVNILTTKVP DGSRTTVYKC
GDLIDLCRGP HLSHTGKVKA FAATRHSATN WLGDTNNDTL QRMYGISFPD KKMLKVWKEN
QEKAKERDHR RIAAKQDLIM FHDLSAGSAF WLPHGARIYN KLIDFIKSHY WNRGYDEIIT
PNIYNLDLWH KSGHALHYKD AMFCFDVEGQ EWAMKPMNCP GHCLMFANSI RSYRDLPLRF
ADFGVLHRNE LSGALTGLTR VRRFQQDDAH IYCREDQIEK EVVDALNFMK DVYDTFGMTY
KLELSTRPQK ALGDVALWER AEEALANAMD MFAGKGGWRV NPGDGAFYGP KIDIKVMDAM
DRVHQCATVQ LDFQLPIRFD LQYTTASKEE GQQFARPVMI HRAMLGSVER MFAVLCEHYG
GKWPFWLSPR QVMLVPVHAE FFDYSEDIRA KLHAEGFYVD VDTSKNTFQK KVRNAQVAQY
NFQFVVGKAE VANGSVNIRN RENQVEGEKK IDEMIAMLKQ LREEHK