Gene PHATRDRAFT_28482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_28482 
Symbol 
ID7202166 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp368268 
End bp370470 
Gene Length2203 bp 
Protein Length602 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181194 
Protein GI219121689 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.962764 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCAAACACCC ACTACAAACA CTCGAGACAA TCCGGAGATA CTTCCATATA CTCTAGAAGC 
AATAGAGCAA GACTTACAGT AAAGCAATCA TCACATAGAC TACATCTGCG TCGCGACCGA
AAAGACGGAC ATTGCACGTC GTCGAAAAGG GTTATTGACG TCGTTTGGGG GAAAGTGATC
GCCTTTCGAT TCTTATACCA CTGGGGATCA TGGAAGATAT CATGTCCATA ACTCCGCTGG
GGTCGGGCCA AGAAGTCGGA CGTTCCTGCC ACCTGCTTGA ATTTCGCGGA ATGACAATTC
TCTTGGACTG CGGCATTCAT CCTGGGTACG ATGGGTTGAA TGGACTCCCT TATCTCGATC
GAATTGAGCC TGATCAGGTT GACGTCTTGC TCATCACCCA CTTTCATCTA GATCACGTCG
CTTCGTTACC CTATTTGACC GAACGAACTT CCTTTAAAGG CCGCATCTTT ATGACTCATC
CTACCAAGGC GGTGACGCGA TTGCTTCTTG GGGATTATCT CCGACTCTTG CAAATGAAAA
ACGCCAAACC CGAAGACGTA CTGTACACGG AAGCCGATCT TCAGTCTTGC ATCGATAAGA
TCGAGCTCAT GGATTTTCAT ACCACAGTCA CGGTCGGCGG TTTGTCATTT TACGCGTTGA
ATGCCGGCCA CGTGCTCGGT GCTTGCATGT TCTTTCTCAG TCTCGGTGGA CGCAAAATAT
TGTACACGGG CGACTACTCC ATGGAAGATG ATCGCCACTT GATGGCGGCC GAAATTCCTG
CCGAGTCCCC CGACGTGCTG ATTGTAGAGG CCACGTACGG TGTGCAAGTA CACGCGAGTC
GCGCCGAGCG CGAAGCCCGC TTTACCGGAA CCATCGAACG AGTCATATCG CGCGGCGGCA
GATGCCTCAT ACCAGTCTTT GCACTGGGGC GAGCCCAGGA ATTGTTGCTC ATTCTGGACG
AGTACTGGCA GGCAAATCCG CATTTGCAAA ACATTCCCAT TTGGTACGCT AGTAAGCTGG
CTTCTCGGGC ACTACGTGTC TATCAAACGT ACGCGAATAT GATGAACGCA CGGATACGCT
CCCAAATGGA CGTGTCCAAT CCTTTCCGAT TTCGTTTCAT TCAAAATCTC AAATCCATTG
ACGTCAATTC GTTTGACGAC TCCGGTCCTT CGGTCGTTTT TGCCTCTCCA GGGATGTTGC
AGTCTGGAGT TTCGCGGCAG CTATTTGATC GCTGGGCGTC GGATCACAAG AATGGTGTGT
TGATTGCCGG TTACGCTGTT GAGCACACAC TTGCCAAAGA AATCATGGCG CAACCCAAAG
AAGTTGTTAC ACTGGAAGGT CGTCGACAAC CGTTAAATGC CCTAGTGGAC TACGTCAGTT
TCTCGGCGCA CGTTGATTTT GTACAAAACC GCTCTTTTAT CAATCAAGTG GCGCCGAAAC
ACATTATTCT CGTACATGGA CAGAAGGATG AAATGGGACG ACTCAAGAGT GCTTTACTGC
TACAGTATAA GCAGTTTCCG GAGGTAAGCG CTTTCCGTAA AGAAGAACTT TGTCGTATTT
GCGTTGTTTA GCCACTCACA ATCTCTGATT TCGGTTGTTG CCGGCAAATC GTGTCTTAGA
ATAAACGTCC AACGATTACC ATGCCACCGA ATTTACAGGA AGTCAAACTT AAATTTGCAC
GTCGGCGATC GGCCAAGGTC ATGGGATCGT TAGCAGATCG ACAAAAAGAG CCTAAAGAAG
GGGAAGAAGT CCGGGGTATT CTTGTCACGC ATAACTTTCA TTCGAAATTG GTTGCTCCCG
AAGACTTAGC CACCTACACT CCCTTGCGAG TCGGCTCGAT CGCCAGTAAG CTGCACGTTC
CATTTGTTGG ATCTCTTGCG ACTTTGCGAT TATTTTTGAC GGAAATGTTT GCTGGGGTAT
CGGAAAGTAC GGAAGAATCA GAGGACTCGA CGCGGACCAT TTTCCAACTT GTGAACGAGG
TGTGTAAACT GAGCGTTCGT TGAAAACGAT TAATGCGCGG TCCAAGTCGT CATATATCTC
AGTCTCTCAC ACTTTTCAAT GCACTCTATT TCCATCATAA TCGCAGGTTA AAGTCACGTT
AGGTGCGAAC AAGGGAGTAG CGATTGTCGA ATGGATGGCA AGTCCCCAAG GTGATATTTT
GGCTGATGCT GTTGTCGCGC TGTTAATGCA CGCGCAGAGC AGT
 
Protein sequence
MEDIMSITPL GSGQEVGRSC HLLEFRGMTI LLDCGIHPGY DGLNGLPYLD RIEPDQVDVL 
LITHFHLDHV ASLPYLTERT SFKGRIFMTH PTKAVTRLLL GDYLRLLQMK NAKPEDVLYT
EADLQSCIDK IELMDFHTTV TVGGLSFYAL NAGHVLGACM FFLSLGGRKI LYTGDYSMED
DRHLMAAEIP AESPDVLIVE ATYGVQVHAS RAEREARFTG TIERVISRGG RCLIPVFALG
RAQELLLILD EYWQANPHLQ NIPIWYASKL ASRALRVYQT YANMMNARIR SQMDVSNPFR
FRFIQNLKSI DVNSFDDSGP SVVFASPGML QSGVSRQLFD RWASDHKNGV LIAGYAVEHT
LAKEIMAQPK EVVTLEGRRQ PLNALVDYVS FSAHVDFVQN RSFINQVAPK HIILVHGQKD
EMGRLKSALL LQYKQFPENK RPTITMPPNL QEVKLKFARR RSAKVMGSLA DRQKEPKEGE
EVRGILVTHN FHSKLVAPED LATYTPLRVG SIASKLHVPF VGSLATLRLF LTEMFAGVSE
STEESEDSTR TIFQLVNEVC KLSVKVTLGA NKGVAIVEWM ASPQGDILAD AVVALLMHAQ
SS