Gene PHATRDRAFT_39520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39520 
Symbol 
ID7195350 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp41762 
End bp43217 
Gene Length1456 bp 
Protein Length454 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183660 
Protein GI219126848 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCGT CTTTGCAGGG TGCCAACGCC ACTTCCAACG GTGCCATGGC GCTGTTGCAA 
AAATTTGCGG CCCTCAATAA TCACATCGAA GAGATTCGTC GGCAACAAAA CAGTGTACTG
CGTGAAATAG ACTCGGTCCA GCAGCAACTA GTGGATGTGG GAGAAGACCG AGAAAAGATG
TGTGAAAAAA CAGACGAAGC GGGAAAGAGC CTCATCCAAT TGGAACAAAG GACTAAAGAC
GCTTTGGATT CTCAACTTCA GGTAGAAAAA GGTCATTCCG AAGCGCTGTT GACTAACCAA
GTATGTGCCC GTCGACTTGA AGCAGCCCGT CAGGATACGT TGGAATCCCA GCAAGCCTTT
CTCGAACGAA CAAGGAACTT TCGTCATTCT TGTCGACGCC TGCAGCTACG CGCACAGCAC
ATGGGAATCC AACACGCCTC TCTCCGGGCA TGGATTGCTG CAAAAGGGGA AACGATATCG
GGAACTGATC TTGTGGGCGA CCAACGACAT CAAACGTACG GGTCCCGGTA TCGGAACTCC
AAATTTGATA TTCGGGATCC GGATTCATGG GGTCTCGAAG TCGTCCAGGG CGACGAAGAG
CTTCACGAAC TGTTCTTAAA GTATGAGTCA AAGAAGAGTG ATTTGGACTT GGCACAAAAG
AACCGTGAAA TAGCTCGAAT CGCCTGGCAA GAGCAGCTTA CCAATGCCGA CACTCGCCGT
GATCGTCGCA CGAGACTTGA AGAGCAACTA CGAAGGATCG AAGGAGACAA TGACGCCTTG
GAGTCGCAAA TGCGTGAATT GGAATGGCAG ACCGCTAGAG TTCGAGAGAC CGCGACAGTC
CCAGAGCTTC TTAAGAGTGA GTCGACATTC TTAGGATCAT TGGGATCCAC ACCAGGCTGT
TCCCTGGGCT TGCAATTCTA ACCGCTAAAT CTCTCTCTAT CTACCAGGTA CCGTTCAGCT
GATATCTCGA TCGACGAACT CCACCAAGAC GCAGCAATCA TCCGCCGCGG TGACTCCTAC
GACAGGCTGC CGTTCTACAA CCGGCCTCTC GTCAAGGGTT GCCTTTCACC GGACAAACCC
GTATGCCAAT ACAGGAAAGA ACCAATACAA GAATCGAATT TCCAGTACAG ATTCCAACGT
CATTTCCACG CCCGAAATCC GTCTACCGGC ACTCCATTCA GAAGCTTCCG ATTTCCGCGA
CAGCGCGTCG GCTAGTATTG GACGGCGTGG CCGTCGTCTT GGTGGAAGTG AATTTGGATT
GAGCATGGAA ATTGTCGGGG AGCCATCCAT CACATCCATG AACTATAAAA AAGCAACAGA
TACCTGCTTC AATGATCCTC ACATTTCTTT GCCGGCGGAT GATTCTTTGA AGCGCACCAT
TGCATCGCTG CAAGATAGCG ACGGCGAAGA CTTTTCCTAC ATGCCATTTA CTAAGAAGAG
CAATCAGCCT TTGTAA
 
Protein sequence
MSSSLQGANA TSNGAMALLQ KFAALNNHIE EIRRQQNSVL REIDSVQQQL VDVGEDREKM 
CEKTDEAGKS LIQLEQRTKD ALDSQLQVEK GHSEALLTNQ VCARRLEAAR QDTLESQQAF
LERTRNFRHS CRRLQLRAQH MGIQHASLRA WIAAKGETIS GTDLVGDQRH QTYGSRYRNS
KFDIRDPDSW GLEVVQGDEE LHELFLKYES KKSDLDLAQK NREIARIAWQ EQLTNADTRR
DRRTRLEEQL RRIEGDNDAL ESQMRELEWQ TARVRETATV PELLKSTVQL ISRSTNSTKT
QQSSAAVTPT TGCRSTTGLS SRVAFHRTNP YANTGKNQYK NRISSTDSNV ISTPEIRLPA
LHSEASDFRD SASASIGRRG RRLGGSEFGL SMEIVGEPSI TSMNYKKATD TCFNDPHISL
PADDSLKRTI ASLQDSDGED FSYMPFTKKS NQPL