Gene PHATRDRAFT_37072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37072 
Symbol 
ID7202091 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp106060 
End bp107480 
Gene Length1421 bp 
Protein Length381 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181300 
Protein GI219121911 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.730274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCCCAA CGAAAGGACT TCCGCCGTAC GCAAGTGTGC TGGTGAAGCA TGGATCGGTA 
GACAGTAGAG ACAACGACGA AGAAAAGGTA AGAAGCGGTT AAATCTGAAT CCAGAATGTA
CACATGGTTC AGTTGTCAGT AAGGTGCGAC GAAACTCGAA CTCGCAGTAA AGTATACTGG
AGGAGCGAAA CATGGTCTCT TGCCGAATAC TCATTGATGC TATTTTTTGT TAGAGATCAA
AGCTCTTGCG TCATCTTGGA AATGGATGTG AGGCTATAAA TAGAGCCCCG GTTCAACCCG
GACAGAATAT TTCCTCCTAC TCAAGCTTTC CTGGTGCTCC AATGCCGACT CCTCTTCTGC
AAACGAGCAA TTTTCCATCG GGGTTTGGCA GTGAACTTCC TTTGCCCTCC GTTTTGCGAG
CAAACGCTTC TCTCCACTCT CTTGGCTTTG GCTCCCCAAC CAACGCATCC ATGACAAGCA
CGGGTCATCC TTTGATTGGC CAATCCATGT TCTCGACGCC AATACATGGG TGCTTCTCCA
CAAGTGATAG AGGAAACGAC TCCTCGAGCA CATTAGCTAT TCTGTTGCCC GTTTCAGAAT
CAATGTCCGC GTCGAGTCTT GGAATCGACA GCACACTTCT CCAGTGCGCT GACTATGCGC
GCAGCGCTGA TGACTTCCAC CTAGAACATC AAATACAGTC AATTACAGCT GCTTCTTTGT
TAGATTTAAC ACCCTCTCTT TCTCACCAAG AATTTCAAAC CTTCGTCGAG ATCTCCGAGA
ATCATAGCGC TAGTCGGCCC ACGTCAAATA GCAGACTCTT GAGCAAACGA GATTCGATCT
TGTCTACAGA TATCCAGAAA GAGAAGCGAC AGAGGCCATT TACATGTGCC GACGCGATTG
CCCATGCTGC TACCGAGAAC ACGGGAATCG CAATCCGTAG TCGACGATTG TTTGAGCAGG
TTCCCAAAAC AGTAAAGCCT TGCAAGTGCA AGAACACGCA CTGTCTGAAG CTGTATTGCA
CCTGTTTTCA GAAGGGATCA TTTTGCGATC CAGACATTTG CAAATGCATC GATTGCTACA
ATTTGAGGGA ATTCAACGAG ACCGGGGGCA AGCGACAGGA AGCTGTTTCT GAAATCTTGT
TACGGCGCAT TGACGCCTTT GAATCCCGTC CAAAGAAAAA GACTGGTGAA GGATGTGCTT
GCAAAAAGAA TCGGTAAGTC AGAAGGGATA CACCTAAATG CTGTAGCCTG TACAACTCTC
AAATCTTGCC ATATATGAGC AGATGTCTTC AAAAGTACTG CGACTGCTTT GCCACAAAGT
CGGACTGCAC TGAACGCTGC AGGTGTAGCG CCGCATGTGG CAATAATCGG TTCCCGGCGA
TTGAAGACAA TCTATCAAAC GAAGAACCTC CCAGTCCATA A
 
Protein sequence
MGPTKGLPPY ASVLVKHGSV DSRDNDEEKR SKLLRHLGNG CEAINRAPVQ PGQNISSYSS 
FPGAPMPTPL LQTSNFPSGF GSELPLPSVL RANASLHSLG FGSPTNASMT STGHPLIGQS
MFSTPIHGCF STSDRGNDSS STLAILLPVS ESMSASSLGI DSTLLQCADY ARSADDFHLE
HQIQSITAAS LLDLTPSLSH QEFQTFVEIS ENHSASRPTS NSRLLSKRDS ILSTDIQKEK
RQRPFTCADA IAHAATENTG IAIRSRRLFE QVPKTVKPCK CKNTHCLKLY CTCFQKGSFC
DPDICKCIDC YNLREFNETG GKRQEAVSEI LLRRIDAFES RPKKKTGEGC ACKKNRCSAA
CGNNRFPAIE DNLSNEEPPS P