Gene PHATR_33631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33631 
Symbol 
ID7204072 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1418941 
End bp1420301 
Gene Length1361 bp 
Protein Length400 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186249 
Protein GI219113331 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAGCG AACGAAGTTC CCCTTCCGGC TCTAAGGAAT CAACATCTCA AGATGTTGCC 
GATGTAGCGG TGATTGGAGC CGGCTGGTGG TCACAAGGAT GGCATATACC ACAGTTGAGT
CGGAACACTC GGGCCAATCT CGTAGGAATT GTCGATTCTG CCTCTCAGCC CCGATCGAAT
CTCAATCCTC ATCTCGAGTC GCTCGAAGCG CTGGCACGAA AGTACGACAC TGCCGTATTT
TCCTCCGTGT CGGACCTTCT GGCGAACACA CCCACTTTGG ACGGTGTAAT TGTGGCAACG
CCGCATTCGA CTCACTACAA CATTGGTAAA GAAATCTACG ACGCCAACAG GAACAGGGAA
AAACCAATCC ATATTCTCAT GGAAAAGCCC ATGACAACCA ACATAGAGGA AGCCTACCAG
CTGCACCAAC TGGTGGCGTC CCGTCCGGAG GTTTCCTTCC TAATCAACCA TTCGGCCAAC
TACCGTTCGC AAACAAAGGC AGCACATCAA GCTGTCCCCC AACTAGGGAG TTTGCGGCAT
GGCTCGATCT TCATGGCGTC TGCTCTGAGT TGGATCTTTG TACGTCCATG GAATGTTGGC
TAGCGTTGTG ATGCGACTCG AGGGCTTGAA TGTGAGCCAT ATCTTCTCGG GGTTTGCATT
GTCATTGCGA GCTTCTGTTA CTTTGGTCAC TCACTGTCCT ATGTTTTATT CCTTCTTTGG
CTCGTTCACG CATCTAGGAA GACCCTTCGA ACACTGGTTG GAACGTTCCG GGGCCTGACA
TGCTGGGGAA CGGCTTTGCG TGGGGTCAAT CGTCGCACGT CTTGGCTTGG GTGTTCTACG
TATGCCCGCA ACTTAGCCCT ATCGAGGTAT ACTGCCGCAT GACGCATTCT GCCGCCACTG
GAGCTGATGT TGCACTTTCT GGAACGATCA TTTGCCGAGA TTGCCACTCC GACAATGAAG
TCATCCTTTC CGTTGCCGGT ACCAGTTTAC TTCCGGGCAA TGAGCACTCG GACCCTCCGG
TTGGCAAACA AATTCAATTC AAACTCTACG GCGAAGACGG TGCTATTATC TACTGCGGCG
ATACACGGGA TGAGAACACC GGAAATCTGG AACTGCGCCG CGTGTCGATG GATGGTGTCG
TGGAGTTGCC TGTTGGTCCG GGATTTGCCT TTGAGAATTT AGAAACAGAA CATGATGGGC
CTGAGTCATT GCAGGCTTTC TTGGATGCTT GTGTAGGAAA ATCAACGCAT GTGGGGGCCG
ACAGTTTGGT TGGACTCAAA ACGGTTCAAG TATTGGATGC CATGTACCGC AGCCACGCTT
CCGGTCAATC CGAACCTGTA CGGCATGCGG ATGCGGAATA A
 
Protein sequence
MRSERSSPSG SKESTSQDVA DVAVIGAGWW SQGWHIPQLS RNTRANLVGI VDSASQPRSN 
LNPHLESLEA LARKYDTAVF SSVSDLLANT PTLDGVIVAT PHSTHYNIGK EIYDANRNRE
KPIHILMEKP MTTNIEEAYQ LHQLVASRPE VSFLINHSAN YRSQTKAAHQ AVPQLGSLRH
GSIFMASALS WIFEDPSNTG WNVPGPDMLG NGFAWGQSSH VLAWVFYVCP QLSPIEVYCR
MTHSAATGAD VALSGTIICR DCHSDNEVIL SVAGTSLLPG NEHSDPPVGK QIQFKLYGED
GAIIYCGDTR DENTGNLELR RVSMDGVVEL PVGPGFAFEN LETEHDGPES LQAFLDACVG
KSTHVGADSL VGLKTVQVLD AMYRSHASGQ SEPVRHADAE