Gene PHATRDRAFT_21922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21922 
Symbol 
ID7203044 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp83087 
End bp84429 
Gene Length1343 bp 
Protein Length306 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182319 
Protein GI219124036 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTTTCTCGG CGGTACTGTA TTCGGCGTCT AGTGCCATTA TTCATTACCC TACATGTCTT 
CCCTATATGC AGTCTCGATG CCTTTCTGGT CCCGGACGTA AGACTCAGAA GGACTCAGGG
TCTGAATCAA CGCCAATCTT ATATACGCGC CGAGAAGGAA AAGAAATTGA CGGCGGACAC
GGATCTATTG AAATCCCTCG ATGATTCTTT TTCCTACGAC GGCCGCCTGG AAGGATCTTC
CTTTGCAGAC TTTCGCTGTG GCTTTGTCAC GGTTATGGGG GCGCCGAATA TGGGCAAGTC
AACGCTGCTG AACGCACTTC TGGAAGAGGA CCTGTGCATT GCGACGGCCC GTCCCCAAAC
GACTCGTCAC GCTATTTTGG GTCTCATGTC TACCGATAAA TGCCAGGTCT GTTTAGTTGA
CACGCCCGGG GTGATTGAAG ACCCCGCCTA CGAGCTGCAG GAAGGTATGA TGGAAGCCGT
TACAGGTGCT GTGGCGACTT CTGACGTTCT TTTGGTCGTT ACGGACGTCT TTTCTACACC
TATACCCGAT GACGAATTGT TTCTCAAAGT TCAGAGAACA CGAAAACCGG TACTAGTAGC
GATCAATAAA ATCGACTTGG CAAAAAAAGT AAACAAAGCA GCGGAGGAGA ATCGAGACAA
GACGGTGACG GTCGAAGAAG CCGTAGCGTT CTGGCGAGCC CAGTTGCCGA ATGCCCTCTG
CATTCTTCCG CTATCGGCTT CGCAAGGAAT CAACAATGTT GGTGTGGTGG CGATGAGAAG
GATTCTCACG GGTGGCCCGG ACGTGCCGTC GGTGATCCGA GCAATGGGGA GGCCCATTCC
AGGAATGTTT CTGGGGGACA CCCAATTCGT AACGGACGAC GCGTGTCGAG AACTCTTACC
GATTAGTCCC CCGCTGTACG ATCCGGAAAC ACTAACGGAT CGGCCGGAAC GCTTCATTGC
GTCGGAAATT GTTCGGTCCG CTCTCTTCCA GGTACTGAAG AAAGAGTTGC CGTACTGCTG
CGAGGTGAGA ATTCGAGAGT TCAAGGAACC AAAAGAGGAG GGTGAAGTAA TACGGATTGC
GGCGGACGTT CTAGTAGAAC GCGACTCTCA AAAGGTAATT GTTATTGGTA AGAATGGCGC
TCAGGTGAAA GAGATTGGCG TGATCGCGCG GGAGAAGCTG GAAGCCTTTT TTCGGCACCA
AATTTTTTTG AACTTGTCGG TGAAAGTCGA CAAAGACTGG CGAAAGAATA CTCGCAAGCT
TACTGAGTAT GGATACATGA AACCCAAAAG GTAAAATTGG GGGATAATGC CAAAGACTCA
ACTAGCAACA ATTTTCTGTT CCG
 
Protein sequence
MGAPNMGKST LLNALLEEDL CIATARPQTT RHAILGLMST DKCQVCLVDT PGVIEDPAYE 
LQEGMMEAVT GAVATSDVLL VVTDVFSTPI PDDELFLKVQ RTRKPVLVAI NKIDLAKKVN
KAAEENRDKT VTVEEAVAFW RAQLPNALCI LPLSASQGIN NVGVVAMRRI LTELLPISPP
LYDPETLTDR PERFIASEIV RSALFQVLKK ELPYCCEVRI REFKEPKEEG EVIRIAADVL
VERDSQKVIV IGKNGAQVKE IGVIAREKLE AFFRHQIFLN LSVKVDKDWR KNTRKLTEYG
YMKPKR