Gene PHATRDRAFT_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_2037 
Symbol 
ID7202519 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp414078 
End bp415190 
Gene Length1113 bp 
Protein Length347 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181552 
Protein GI219122438 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.417362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATTATCATTG GTGTGCTTAT CTCTCTTTCC GCCCTCTTTT CGGGCTTGAC GCTGGGACTT 
ATGAGTCTAG ACAAGACGGG CCTCGAGATC GTTATGCACG GTGACGACGT CACTAACGCT
CGCTACGCAT CCGATATTTT TCCCGTACGA GAAAATGGCA ATCTATTGTT GTGTACACTG
CTACTAGGAA ACGTGGCCGT CAACGCGCTC TTGTCCATTA TGATGGGTGA CATTGCTGGC
GGTCTGATAG GTTTCTTATC CAGTACATTT TTGATCGTCA TTTTTGGAGA GATCATCCCA
CAAGCCGCCT GCAGCCGCTA TGCGCTGCTC ATTGGTAGCA AAACTGTCCC GCTGGTTCGT
GTGATTCTTG TACTCTTCTA TCCAATTGCG GCCCCATTGG CTTATATGTT GGACAAGCTT
CTGGGGGCCG AATTGGCCAC AATCTATTCC AGCGCCGAAC TTATGAAATT GCTACAGATT
CACGTAGAAA ACGAAGCCAT GGATCAGGAT ACCGCAGTTG CCATGAGGGG CGCCCTCAAA
TACAAGGATA CGACCGTCAA AGAAGTCATG ACGCCACTCA GCAATACCTT CATGTTGTCG
GTTGACGAAA AACTCAGCTT TGAAACCATT GCGAAAATTT TTAAAACAGG ATATTCTCGA
ATTCCAGTCT ACGAGATTTC AACGGTATGT TATTTTACTG GACCATGCGA GCTGCCGAAT
ACGAGTGACG CTGGGCGTGT AAATGCTAAC TGCTAGCGTC TATCTCTTGG CTTCCAGAAC
AACGTTATTG GCCTATTATT TGTGAAAGAC TTGATCTTCA TCGACCCGGA AGACGAAACA
AGGGTGGCCG ACTTTGTCCA AATTTTTGGA CGAGGTGTAC ACGTTGTGTG GCCTGATGAC
AAGCTTGGCG ATGTCTTGCG CGAGCTCAAG CTAGGCAAAT CTCACATGGC TTTAGTCCGA
GACGTGAACA ACAACGATGC AAGTGTAGAT CCATTTTACG AGATCAAGGG CATTATTACT
TTGGAAGACA TTGTTGAGGA GATTTTAGGT GATGAGATTG TGGACGAGAC AGATGCCTTT
GTCGATGGAT CGCACGCCGT AAAAGTCGAC CGA
 
Protein sequence
IIIGVLISLS ALFSGLTLGL MSLDKTGLEI VMHGDDVTNA RYASDIFPVR ENGNLLLCTL 
LLGNVAVNAL LSIMMGDIAG GLIGFLSSTF LIVIFGEIIP QAACSRYALL IGSKTVPLVR
VILVLFYPIA APLAYMLDKL LGAELATIYS SAELMKLLQI HVENEAMDQD TAVAMRGALK
YKDTTVKEVM TPLSNTFMLS VDEKLSFETI AKIFKTGYSR IPVYEISTRL SLGFQNNVIG
LLFVKDLIFI DPEDETRVAD FVQIFGRGVH VVWPDDKLGD VLRELKLGKS HMALVRDVNN
NDASVDPFYE IKGIITLEDI VEEILGDEIV DETDAFVDGS HAVKVDR