Gene PHATRDRAFT_19805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_19805 
Symbol 
ID7200026 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp860378 
End bp861578 
Gene Length1201 bp 
Protein Length279 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179523 
Protein GI219117457 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000518484 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACGCTTGCCA TTGACCATTC CTGTTGATTT CATTCCGTAA ACGTTTTACC ATGTGCTTGT 
GTTGTTTTAC AATCTCCACC GCCGAAGTCG GCGTCATTGA ACGCTGGGGC AAATACAGTC
GCCTCGTACA GCCCGGACTC AACGTGATTT GCTGTCCCAT GGAGTCGCTC GTGGGCAAAC
TCAGTTTCCG CGTGCAGCAG CTCAACGTAC GCGTCGAAAC CAAGACTCTC GATAACGTCT
TTATCACTTC CGTAGTATCC GTACAGTACC AAGTCCTCCG CGACAAGGTG TACGAGGCCT
TCTACGCGCT TTCCAACCCC GCCAGACAAA TCACCGCACA CGTCTACGAC GTGATGCGTT
CGCAACTACC CACACTCGAA CTGGACGCCG TCTTTGAAGC CAAGGAAGAC CTCGCGCTAG
CCGTCAAAAA CGCACTTTCC GAAATCATGA CTACGTACGG ATATCAGATT GTGCAAACTC
TCATTACCGA TTTGGATCCG GATCAGGTAC GTCCCGTGTA ATGTTGGGGC GTCTCTCTAT
GTGTTAGTGT GTGTGTGTGC ACGATACTTT GCGCGCCGTG GTTCGGAAGG AGGGTAGTTC
ACGTCGCGCA CTTAATCTTT GGTCAACATT GCGAATGTGC TCATACTCTT TTTTTTCGTT
CGCCTTGCTC CACAGCGCGT GAAAAACGCC ATGAACGAAA TCAACAGTTC CAAGCGACTC
AAATACGCCG TGGCGGAGCG TGCCGAAGGA GACAAGATCC TCAAGGTCAA GGGCGCCGAA
GCCGAGGCCG AAGCCAAGTA CCTTAGTGGT GTGGGTGTCG CCAAGCAGCG CAAAGCCATT
GTCGATGGCC TGCGTACGTC AATCGTCGAC TTTTCCGATC ACGTGGAAGG ATCCAGTACC
AAGGAAGTTA TGGATTTGTT GCTCTTGACA CAGTACTTTG ACATGATTCG CGACGTGGGA
GCCGAGAGCC ACTGTAAGAC AACCTTTGTC CCATCGTCTC GGGGTGCACC CGACGACATG
CGCAACGCAC TCCTGCAATC GGCCGCCGGA AGACTCTAAA AAGTTTGCGT CCGAACGATG
ACTACCTATT TCTGTCGTTC TCCTCGTTTT ATTGCCTTTT GCTCCTTGTT TTTCTTCGCG
GCAATCCGCG ATTCTAATCT GTTTCGTGTC TCAATAATCG TACTACGTTT CTTTTATACA
T
 
Protein sequence
MCLCCFTIST AEVGVIERWG KYSRLVQPGL NVICCPMESL VGKLSFRVQQ LNVRVETKTL 
DNVFITSVVS VQYQVLRDKV YEAFYALSNP ARQITAHVYD VMRSQLPTLE LDAVFEAKED
LALAVKNALS EIMTTYGYQI VQTLITDLDP DQRVKNAMNE INSSKRLKYA VAERAEGDKI
LKVKGAEAEA EAKYLSGVGV AKQRKAIVDG LRTSIVDFSD HVEGSSTKEV MDLLLLTQYF
DMIRDVGAES HCKTTFVPSS RGAPDDMRNA LLQSAAGRL