Gene PHATRDRAFT_21821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21821 
Symbol 
ID7202864 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp443997 
End bp445371 
Gene Length1375 bp 
Protein Length311 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182069 
Protein GI219123516 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.563947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTGCTGCAA CATGCAATCC TCGATCTCGG TAGGTTTCTA CGTAATTTTG TACGGGACGT 
CCCGAGGGAA ATGACCATGG CCAAATGATG TTTTCTAACA CTGTTTTGGA CTGTTTTCCT
TGTTCAGAAA TACCGCATTA CTAAGCGGGT AGGCGGAGGA TCGTTCGGAG ATATTTATCT
CGGAGTTGGT GCCAATGGGG AGAAGGTACG TCTCCCGTCG CTCTTTTCGA TCTAGTACGA
AAGTACCCGC AGCCTGTCCA CGCGATTCCT CGGAATATAT GGGAGAGTAC TTACTGCACG
GATGTATTCC GGGACAGTGC ATTTTCTAAC ACTGTTGTAA AATCGTTTTA GGTTGCTGTG
AAGTTCGAAA AGCACGGCGC TCGATGCCCT CAGCTTCGTC ACGAATACAA AGTTTATCGC
GAGTTGCAAA ATGCACCCGG CTTTGCTAAA GTTCACTATT TTGGTACACA GGATTCTTAT
AATCTCATGG TTATGGATCT ACTGGGTCCT TCCCTGGAGG ATCAGTTTAA CAAGTGCGGC
CGAAGATTTA CTCTCAAGAC TGTTCTCATG GTTGCCGATC AGATGTTGGA GCGTGTGGAG
TTGATGCATT CACGTCACTT GATCCACCGT GACATTAAGC CAGCGAATTT CGTTACCGAT
GCGGGGCGTG GTAACGGAAA CTTTATATAT TGTATCGATT TCGGTCTTTC GAAGCGCTAC
CGCCATCCTC GGACGCTTCA GCACATCCCG CAGCGTGAAG GCAGATCCCT CACAGGAACG
CCTCGGTACG CTTCGATTAA CAACCATTTA GGCGTGGAAC AATCTCGTCG GGATGACTTA
GAGAGTATCG GGTATGTACT TGTATACTTC CTGAAGGGCG GTCTTCCATG GCAGGGACTG
AAGGCCAAGT CCGCGACGAA AAAGTACAAG CTTATCATGG AAAAGAAGCA GTCCATCACT
ATTCCGGCGT TGTGCCAAGG ATGTCCCAGC CAGTTTGCTG AATACTTGGC TTACTGCCGA
TCGCTCAAAT TTGAGGCCAA GCCGAACATC GCATACTTAC GTGGTATGTT CCGTGACTTG
TTCCGCTCGC AAGGATATAC GAACAACCAC AGTAGTCTGG ATTGGGACTG GAATCGCGTG
GAAGGAGGCG CAGCTGCCGG CGATCGACCG GACGACAAAG CTGGGTACTG AACGGAAGCG
CAATCATTGA CTGTGAGTAT TGAGAAACGA GGATAGCTGA GTACTCGTAT CAGATATTCA
CGAAGCACCG TAAACCCTCT TCGCCCTCCT TTGGATTGTA TCACGTATTG ATATCAATCT
TTAAATGTTT TAACGACACG CCAACTGTCA GTTAGCTTTT GATATTATCA AATTT
 
Protein sequence
MQSSISKYRI TKRVGGGSFG DIYLGVGANG EKVAVKFEKH GARCPQLRHE YKVYRELQNA 
PGFAKVHYFG TQDSYNLMVM DLLGPSLEDQ FNKCGRRFTL KTVLMVADQM LERVELMHSR
HLIHRDIKPA NFVTDAGRGN GNFIYCIDFG LSKRYRHPRT LQHIPQREGR SLTGTPRYAS
INNHLGVEQS RRDDLESIGY VLVYFLKGGL PWQGLKAKSA TKKYKLIMEK KQSITIPALC
QGCPSQFAEY LAYCRSLKFE AKPNIAYLRG MFRDLFRSQG YTNNHSSLDW DWNRVEGGAA
AGDRPDDKAG Y