Gene PHATRDRAFT_21961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21961 
Symbol 
ID7203069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp189348 
End bp190470 
Gene Length1123 bp 
Protein Length320 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182172 
Protein GI219123730 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTACATTGAC AAGCTGCATT CGCTTGAGTT GCTAATCGTG GAAGAAATCG AATGTTGCGC 
ACCGGATTTG TCGAGCTCGC AAGGAACAGG AAAAAAGTCA GACAGTTTGC AATCTTGGTT
GGGGGTGAAC AATTTAAATG AACTACGTCG AAAAGTTGAT GATATGTCTG CTACAGAACT
CAAACAGCTT ATATTATCAA AGGTGAAGAA CTCTGCGCCT CCGTGTTCTT CGGCGTCGCC
GCTACTCATC AAAACGGAGC AAAGCGATAC CTCGAAGCTA GTGGAAGATT TGTTAAATGA
GCTTTCGACA GCCCCGCCTT CCAAAGAGTG GGAGATTGAT TTGTACGAAG TGCGGTTTTT
GCGACGGATA GGACAAGGCA ACGCGGGTAC TACATACTTG GCTGACTGGA GTAACCTGAA
AGTTGCCGTC AAAGTTGCTT CTATTTCCGA GATGGGTTTG GATGGTTGGC GCAAGGAAGT
ACAATCCCTA CAGAAACTTC ATCATCCCAA CATTATTCGC TTACTTGGGT CGGTCTACCA
CCCAAATCCA TTAACATTTT GTTTGGTGCT AGAGTACTGT GATGCGGGTG ATCTATCGAC
TGCGATTCAA AAGGTAACTC CCCGTAACTT TGTTTTTCAC GTTGCGCAAA GTATTGCGAG
GGGCATGTGC TATCTCCACA ATCGGGGGAT TATTCATCGC GATATCAAAC CAGCGAATGT
GCTCTTGAGC GGCAAAGTTT CTTCCGGTCA ATTTGACGTC AAGGTAACAG ACTTTGGGGT
AGCGACGGAC ACCAATTCGG TAGAAGACCG AACCGCGGAG ACAGGAACTT ATCGTTGGAT
GGCTCCAGAA GTGATTCGTC ACGAAGCCTA TAGTCAGACT GCCGACGTCT ACTCCTTCTC
TATACTTATG TGGCAGCTCT TGACTCGCGA AGATCCTTTC GAAGGGAAAT CTCAGATTGA
AGCGGCAGCG GCCGTTGCCA TGGAATCTGC CCGCCCTCCG TTTCACGCCG AAACGCCTGA
TTCGATAGTG CGGCTGATTC AAGCCTGCTG GAGCGATGAT CCACGGAAAC GCTTACCGTT
CGACAAAATT TCCAAGACTC TGGCTAGTAT TGAATCTACA CAG
 
Protein sequence
MSATELKQLI LSKVKNSAPP CSSASPLLIK TEQSDTSKLV EDLLNELSTA PPSKEWEIDL 
YEVRFLRRIG QGNAGTTYLA DWSNLKVAVK VASISEMGLD GWRKEVQSLQ KLHHPNIIRL
LGSVYHPNPL TFCLVLEYCD AGDLSTAIQK VTPRNFVFHV AQSIARGMCY LHNRGIIHRD
IKPANVLLSG KVSSGQFDVK VTDFGVATDT NSVEDRTAET GTYRWMAPEV IRHEAYSQTA
DVYSFSILMW QLLTREDPFE GKSQIEAAAA VAMESARPPF HAETPDSIVR LIQACWSDDP
RKRLPFDKIS KTLASIESTQ