Gene PHATRDRAFT_48952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48952 
Symbol 
ID7195369 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp127571 
End bp130650 
Gene Length3080 bp 
Protein Length791 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183682 
Protein GI219126894 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTCAGTCCAA TCCGAACCAA GGCCCGCTTG TGTATTGCCG AGAGAAGAAG GGTCTCGATT 
GAGAAGAGCA CTTCGTGGTG CATTTTGTTA TCGCCTCGCT CCATCGGCCT TTCATTCGAG
CTACTGGTTC CCCAGTATTC TACATGATAT ATTTTGTAGT TTTTTACGGT AGCTTGAATT
CGATTGCAGA AGTATTTTTA CGTTGAGTAT ACGTATTGTA TTAGCTCTTT GTCGATAAAA
TCCTTTTATT TCCCACAACA ACATGGAAAA ACCACCCCCG ACTGTCGCAG ATCTGTAAGT
CTCTGTAGCG ATCGCAGGCG TCTTCGCTCT TGTCTCTCGT TTGGCCGATT GTCTCATATT
CTCTCCATTG TTCGCCAATA TTTCTTGTCC TCCGCTACAC AATTTTGCTG CTGTGCAGCG
TCTTGATCGG TGGCGGTCAC GCGCACGCGC ACGTTCTGAA AATGCTCGGC ATGAAAAGCA
CCGCGCATTC CTCACGCAAC ACGGTATACG GATCACCCTC ATTGCCAAGG ACATACACAC
CCCGTATTCC GGTATGCTTC CCGGCTTCGT GGCGGGACAC TACGATCACG ATCAGATTCA
TCTGGATCTG AATCAGTTAT GTCACGTGGC CAATGCACGA CTCGTTCACG CGGCGGCCTG
CAAAATTACC TACCGCAACG GTGGCGGTGG ACTCGTCTAC CTGAACGATG GCCGTCCCCC
CATACGGTAC GATTGCGTTT CCATTGATAT CGGGAGTGCC CCGGCGTATG GTGACGTTGT
GTGCCAGCCC GGTGTGGTAC CAGTCAAACC GATCGCCAAT TTTTGCACAG CTTACCAAGC
ACTCGTCAAC CGTTGGGAAA ATGAAACACC ATCGTCCACA GCAGCAAATG ACCAGGACAC
GACGTCAACC AACGGCACAA GGGCAGAACC GCCAAGCCCC TACGTCGTGG CGGTTGTTGG
TGGTGGTGCG GGAGGGTTGG AACTCGCCTT GTCAGTTCAA TATCGGTTGC GAAGCATCAA
CCCAAATGCT CCGTTACGCC TCTTGGTTGT CACGCGCGGG AAGACAATAC TGGAAGGACA
CAATCATCGA GTCCAGGCCA AGTTCCAACG CATTATGCAG GAAAGAAATA TCGAAATTTA
CTACCTCGCC ACCGTTGTAA AGGTGGCAGA AGACTCATCC ACAATGCGCA AGCGCCTTAT
TCTTTCGCCA GAAGGCGCCG CTGTGCACGG CCGAGACTCA TTCGTCGTCG ATGCTTGTTT
ATGGTGCGTC ACGGCGGGTG TCGCGCCGTG GCTGAAAGCT GACACGCCAT TCGCCACGAC
GAAGCAAGGA TTTCTCCGTG TCCACGATAC GTACGAATCA ATTGAACATC CTGGAGTCTT
CGCCGCCGGC GATTGCTGTC ACGTAGACAA GCATCCGAGA CCCAAAGGTA TGGCTAGTCT
GTCTAGATGC AAATCTTAGC ACTCATGGAT CTAGTCATTC CTAACTATTT TCACATTGTC
TTACATTCAA ATCCAGCTGG AGTATTTGCT GTACGGGCCG GACCATACTT GCTTGACAAC
TTGCTGCGAT ATGTTTCTTC AAAGCCCCTT ATTTCGCACA AACCCCAAAG TCATTTTCTG
GGTATTTTAG CGACCGGGAA CAAGTATGGT GTAGCGTCGA AATCATGGTG GTTAGCCACG
GAAGGATCTT GGATATGGAC CTGGAAAGAT TACATTGATC GTACCTGGAT GGCCAAATAC
AGTACTGATC TACCCGATCT TAAAGACATG ATGGCGAATC AAAAGCACTC AACTCAGCAA
AGCGGCCAAC AAACGAATGC TTTTGTTGCT TCTAAAGGGG ACGAAGTACT GCAGGCCTTT
ACATCAGATC CAATGCGGTG TGGAGGGTGC GGCGCCAAAG TTGGGGCCAC GATCGTTTCA
CGCGTCTTGG CAGCTGTCTA CGAACGGCAA ATTGAACGAG CTAAATTATT GGGGCTGCCC
CAGCCGTCAC GGATCGATCA CGACGATGCA GCGGTTCAGA TACTTCCAAA TAAAGCAGGT
GGCGCCATTG TTCAAACTAT TGACTACTTT CGAGAAATCG TAAAGGATCC ATTTACCTTC
GGGAAAATTG TGGCAGTCCA TGCTCTAAGC GATATCCACG CTATGGGGGC GACTCCCCAA
ACAGCCATGA CACTGGCTGT TGCACCGTTT GCTGCTGACG AAGAGGTAAC AGAATCGACT
CTATTGCATT TACTTAGTGG AGTCAGTGAT ATTTTGCAGG ATGAAAATGT GCAGCTTGTT
GGTGGACACA CTTGTGAAGG ATTGGAGTTA GCATGCGGAT TGAGTGTCCA AGGATATACG
GATAATCCGA AACTGCTCTT ACGTAAGCAA GGTGGTGCAA TAGGAGATAA AATCGTCTTG
ACCAAGCCAA TAGGTACCGG GGCACTCTTT GCAGCCGACA TGAGGGCTCG TTGTAAAGGC
TCATATATGT CGGAAGCCCT CGACAGCATG ATTCACAGCA ATTGCCATGC AAGCCAAGTT
GCGATGCGAG CAAAGGGCAT CAGATCGTGC ACTGACGTCA CTGGTTTCGG TCTCATTGGT
CATTTGCTCG AAATGCTCAT GGCAAACGAA ACGGTTAAGG AATTGGACAG CATCGGCGCG
GTAGTAAACA TTGGCGATAT CGATTTTCTA CGCGGTGGGT TAGAGGCATC TGCGAATGGC
ATTTTTTCGA CACTCCAATC ACAAAACGGG CGAAATCGAC GCGCTATTGT CAATCACACT
GAAGCCGCTG AAAAGTATCC GGTCAAGTAC CCACTATTGT TCGACCCTCA GACTGCTGGC
GGTCTAATGT TTTTCGTCGA TGCTCTGAGT GCTAGTGAAT TTCTAGCTGA ACTACGTGCG
GCTGATGTGA ATGCCCATAT AGTGGGAGAG CTAGTTTCAT ATCCTGCAGA AAGTAATGCG
GCCGCCGGTT TCTCCGAGAG TGTTTGCACG ATTGGAAGTG GCGGAGCAGT GACGGGCAAA
AGGATCCGTG TGCGGTAACA GCGGACGCCG GACCAATTTT TGTTCCGCGA TTCCACTGTG
TTAAGAAAGC AGTCTCTACT
 
Protein sequence
MLPGFVAGHY DHDQIHLDLN QLCHVANARL VHAAACKITY RNGGGGLVYL NDGRPPIRYD 
CVSIDIGSAP AYGDVVCQPG VVPVKPIANF CTAYQALVNR WENETPSSTA ANDQDTTSTN
GTRAEPPSPY VVAVVGGGAG GLELALSVQY RLRSINPNAP LRLLVVTRGK TILEGHNHRV
QAKFQRIMQE RNIEIYYLAT VVKVAEDSST MRKRLILSPE GAAVHGRDSF VVDACLWCVT
AGVAPWLKAD TPFATTKQGF LRVHDTYESI EHPGVFAAGD CCHVDKHPRP KAGVFAVRAG
PYLLDNLLRY VSSKPLISHK PQSHFLGILA TGNKYGVASK SWWLATEGSW IWTWKDYIDR
TWMAKYSTDL PDLKDMMANQ KHSTQQSGQQ TNAFVASKGD EVLQAFTSDP MRCGGCGAKV
GATIVSRVLA AVYERQIERA KLLGLPQPSR IDHDDAAVQI LPNKAGGAIV QTIDYFREIV
KDPFTFGKIV AVHALSDIHA MGATPQTAMT LAVAPFAADE EVTESTLLHL LSGVSDILQD
ENVQLVGGHT CEGLELACGL SVQGYTDNPK LLLRKQGGAI GDKIVLTKPI GTGALFAADM
RARCKGSYMS EALDSMIHSN CHASQVAMRA KGIRSCTDVT GFGLIGHLLE MLMANETVKE
LDSIGAVVNI GDIDFLRGGL EASANGIFST LQSQNGRNRR AIVNHTEAAE KYPVKYPLLF
DPQTAGGLMF FVDALSASEF LAELRAADVN AHIVGELVSY PAESNAAAGF SESVCTIGSG
GAVTGKRIRV R