Gene PHATR_44021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44021 
Symbol 
ID7204216 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp716332 
End bp718703 
Gene Length2372 bp 
Protein Length728 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186403 
Protein GI219113639 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAAAGCCAA TCGTCTCCCT GCCGGTGAGC GAAAAATCTC AACTATAGAT TCGGTGTACT 
GGTGCAGCTT GCATTGAGAA ATCTAGTTTA CTTCGGACGA GGGTTCTCTC AGATTTTGGA
CAGAATGAGC GGCCGTGCCG TGGAAGGACG CGACGTCAAT TCGATTCGCG AGGATGACGA
TGAACATAGC CAGTGTTTTG CCCTAGATCT GCGCAATCAA TCCAGAGGAC CAATGGCCAT
GGACCAATGC ACCGAAACGA GCAACAACTG CGTAAGCGAG CCTCCCCTAT CTCAAGCCGA
AATGTCGGGA AGTGCGCCTT TCGCCAATCA ATTGTCCTGT CTGAGCAAGC AAGCAAGCCC
AATGAGGTCC GAAGGGTGGA ATGAGCTCCA GCCCGCCGGA GAGGAGGCCG CAATGGAAAG
GCGACTGCAA AGAACGAAGG TGGAAAAGCC ATCAGAACCT TTTGAGTTCA GGGATACCAA
TCAATATCCT GCACAAAAAA TTATCTCGCC AATCGATTCC AAGCAAGCAA TCGTGGTGGA
CAAGGGCAGT TTGGCTCAGG GAAGACCCGG TACATCGGAC ACAAAAACAA TATCATTTCC
TCCAAGAGAG AGTGTCGATA AATCAAACCG CTTGCTGCAC GTTGATGTAA GCTGCAAACG
AGAGACTATA CAGACTCAAT CCAACGAAGA TTCGGAAAGT GAAGGAGCCT CCGCAGATAT
TGGCGACTAT TCCGAATTTA GCTATAGTGA CGAAGAAAGG CCTTTGACGG TTCAAGAAAA
CCAGCTTCTC TACGGATCAA CAGGTGAATA CAGTTTATAC AGCTTGGAAC AGATGGAAAA
CCGTTTCGAG GATCGCGATT TCCATGGTGC TCACCAAAAA GTAGTTCGAC ACAGCCATAC
ACATTCGCCG TTCGCCAACT TGGGCAAGAC CTCAACTCTC AAGGCCGCGC GCGCCACCTT
CTTACCTCAG ACTATTGTTC TGGAAACTGA CGCTAAAAAC TATCCCACTT ATCTGTGTCC
AATATGTGGC ACACGACAAA GGGAGTTCTT CAGTGTTTCC GACGCACCGC GACAGCTTGA
AGGTCCCTCC GGCTATTTGG CTCTGTACTT TTCAATTTAT GTCATCTCGT CGCTTTTCAT
CTTTGGTTTG GAAGAGGGAT GGCGACCATT GGATTGCATT TATTTTGCAG TTGTTACACT
CACAACCGCT GGACTGGGAG ATTTCGTCCC CACGTCAGAT GTGAACAAAA TTATTTGTTC
TATCTTCATT TACTTCGGTG TGGCCTGCAT TGGATTATTG CTCGGATCTT ACATCGCTGG
CATGCTAGAT GATAGTGCAT CACGGGAAGC CAAAAGAAAT CAGCTGAGTT CGTGTCCAAA
TTGTGCCCGT ATAAAGACTC TCCAGGATGC AACGTCCAAT ACGCCAGAGC ATACGGTGCC
TGACACAACA CCAGCTATTC CTAGGCGAAG TAATCGTCGA AGTTGTGCAT CCGAGCGTAT
ACTTGACAAA GCTCAAAACC AGGATCACAG TGCTGTGTAT ACGTCACATC ACTGCCATCG
AACGCACACA GGTCAGCGGC AAAGCTCGTT TGAAAGTCAC TCATCCCCTT TCCACAACAA
AGGGGTCGAG GTTACAACTT CTTCTTCGCA AGGTGCATCG ACGGTTGAAA ATATAGCGTC
TTTGGGTGAG TCGCAGGTTT CAAAAGGTAG CGAGGCAGGA GCAGCATCTA GGAAACTAGG
AAACATGCGC TCCGCAGCTA TTCTCCACTA TCTTCCTCAT CCCCCCCCCC CCCCCCCCAA
CAAATCTTTT AGGATCGCCA GTCACAAGAG GGATCCTTGG CCGACAAAGG CATACCCGTC
ACGATTCGTT TGACATAACA GGAAATGCCA GAATGTACAG CGCAGCAGCT GGAATGGGAC
GAACACGCAA ATTTAGCGAA GATGTAGGAG TAATGATGAT TCCATCGATC AGACAAGCTC
CCCCCACTAT TCAGGAAGGT GCCCAGCTCG AGACTCCGCC GTTGGGTACT GATGCAAACG
GGTGTATGGA GCCACCATAC ACAAGGTCTC GCGGGTTCAC TTCTGAATCG GATAGTTCCA
AAGAAGATGA CATGAGCGAT AGCGACGACG ACGACTCTTT CGCAGCCTCC ACTCATACCT
CGTCAGGTTC TTCCGAAGCT GTAGACGATG GAATGTTCAA ATTGCAAGTG GCCAAGTATG
TGTTTCTGAC GTTGAAGCAG GCGCTGGTCA ATTCAATGGT CATCATAGGG GTTGGCTGCC
TCGGGTTTCG CTTTATCGAG GGCTTTTCAC TCGTAGACAG CTGGTATTTT ACAACGGTTT
TTCTCACAAC CGTCGGATAT GGTGGGTGCT GA
 
Protein sequence
MSGRAVEGRD VNSIREDDDE HSQCFALDLR NQSRGPMAMD QCTETSNNCV SEPPLSQAEM 
SGSAPFANQL SCLSKQASPM RSEGWNELQP AGEEAAMERR LQRTKVEKPS EPFEFRDTNQ
YPAQKIISPI DSKQAIVVDK GSLAQGRPGT SDTKTISFPP RESVDKSNRL LHVDVSCKRE
TIQTQSNEDS ESEGASADIG DYSEFSYSDE ERPLTVQENQ LLYGSTGEYS LYSLEQMENR
FEDRDFHGAH QKVVRHSHTH SPFANLGKTS TLKAARATFL PQTIVLETDA KNYPTYLCPI
CGTRQREFFS VSDAPRQLEG PSGYLALYFS IYVISSLFIF GLEEGWRPLD CIYFAVVTLT
TAGLGDFVPT SDVNKIICSI FIYFGVACIG LLLGSYIAGM LDDSASREAK RNQLSSCPNC
ARIKTLQDAT SNTPEHTVPD TTPAIPRRSN RRSCASERIL DKAQNQDHSA VYTSHHCHRT
HTGQRQSSFE SHSSPFHNKG VEVTTSSSQG ASTVENIASL GESQLFSTIF LIPPPPPPTN
LLGSPVTRGI LGRQRHTRHD SFDITGNARM YSAAAGMGRT RKFSEDVGVM MIPSIRQAPP
TIQEGAQLET PPLGTDANGC MEPPYTRSRG FTSESDSSKE DDMSDSDDDD SFAASTHTSS
GSSEAVDDGM FKLQVAKYVF LTLKQALVNS MVIIGVGCLG FRFIEGFSLV DSWYFTTVFL
TTVGYGGC