Gene PHATRDRAFT_20893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20893 
Symbol 
ID7201853 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp771598 
End bp772830 
Gene Length1233 bp 
Protein Length331 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181070 
Protein GI219120673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.775902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGAGATCGAT ATCGACGAAC AAACCTATAC TGCGACTTTT TCGCAATGCC GCACTATGCA 
ATCCAGAGAA GGTTTTAATG TCCCAACACG GTGACGGGCT TTTCAATACA AAAATTCCCA
TATTGCATCG TTGTTTTGCT TCTTCTCCAA GCAAGCCCCA TGCCTCCATC TTCTCTACCG
ACACCGTTCG TCTTTCTAAA GTGTTATCCC TGCATACAGA AAACCTGGTG ATCAGCCGTC
GAGAAGCGGA AAGGATGATA CGATCAGGAG ATGTGACTCT GGCAGGAAAT GTGTTAACAT
CGCCCATGAT GATGCTGAAA GAGGAGGACT TGAATGATGG AGCTTTGAAG GTGAACGGAA
AGGTGGTAAA CCTTCGATCA AGTAAGGGTC CTCGTTCCGT TGCAGGAGAA GAAAATTCGG
TTCACAAAAC CCGCGTTTGG ATCGCGCATA AACTTCCGGG AGAAATAGTC GCTGACAACG
ATCCTTACGA TCGTCCCTCT TTATTGCAGC GATTGATACG AGGTGGTGTC GGCAAAGTTG
GCAAGACTCG ACTTCATCTT AAATCCGTGG GACGCCTCGA CATGAACACA GAAGGACTGA
TTTTGGTGAC TAATGATGGA AAGTATGCTA GAGAGATGGA GCTGCCGTCA AATAAATTGC
ATCGAACGTA TCGCGTTCGA GTACATGGCT TGCTTACGGA CCACAAGTTG GCCCGAATAC
GGAAAGGGGT AACTGTGGAA GGTATAAGGT ATCCTCCAAT GAGGATCATA CCCGAAAGCA
CTCGGCAATC GCAATCAACA AATAAGTGGC TAAAAGTGAC TTGCACAGAG GGAAAGAATC
GCCAAATTCG AAACGTTTTC AAGTATTTAG GATGTAAGTA CCGACCTTGA GCCGCCGCAC
ACCTCAGACT CCTTTTACTT CACTTGTCTT CCTCACCGAA ATTTTTTTTA GTAACGGTCA
CACGATTGAT TCGCATTTCC TATGGCGATT ATCGGCTGCA AACTATTCCG CCTGGTATGG
CAATCGAAGT TCCAGCCAAG CATATCGAGA ATCAAAAACA TCGGGGTCGA TTGATTCGAC
CGACTAAGCC CGCACCGAAA CGGAGTGACG AGGCCGAATC GTCGAAGGCG GTGCAGTGGG
TCCGACATTA GATTTCGATC TTGAATAGTT GTTTTACATA GAAGTATAAG GGATAAGAGG
TCACTTGTAA AATCTAGCAC AAGGCTTCGC GGG
 
Protein sequence
MSQHGDGLFN TKIPILHRCF ASSPSKPHAS IFSTDTVRLS KVLSLHTENL VISRREAERM 
IRSGDVTLAG NVLTSPMMML KEEDLNDGAL KVNGKVVNLR SSKGPRSVAG EENSVHKTRV
WIAHKLPGEI VADNDPYDRP SLLQRLIRGG VGKVGKTRLH LKSVGRLDMN TEGLILVTND
GKYAREMELP SNKLHRTYRV RVHGLLTDHK LARIRKGVTV EGIRYPPMRI IPESTRQSQS
TNKWLKVTCT EGKNRQIRNV FKYLGLTVTR LIRISYGDYR LQTIPPGMAI EVPAKHIENQ
KHRGRLIRPT KPAPKRSDEA ESSKAVQWVR H