Gene PHATRDRAFT_27821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_27821 
Symbol 
ID7201491 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp848673 
End bp852172 
Gene Length3500 bp 
Protein Length822 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180719 
Protein GI219119937 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATGTAGGCA CACACATCAA TCCTGATTGG ATGATAAATT GCTGAAAAAG TAGATCACTT 
CGAGAAAGGA GCAGCCAAGA CAGGAAACGA GGAGATGCAC AGGGCCCGAG TCGACCAGTG
TTTCAAAAAG ATATTTTCAA ATATTAAGGC CCAATGACCA ACAAAGAATT TGTGACGTTC
CCCCCCGCGT CTGAAGCATT ACAACAATTG TGAGTCTGCG CAAGCCAATT TTAATAACCA
CTTGCCATTC TTCCTAGCAA GTACTTTCAT AGTCAGTCGC TCTCCCACTC GCAAAAAAAC
TCACAGTCAG TCCATTTGTC TCAATTCATT GATTTGATTC GGCATGCGTC TTTGAAGGTA
GTTCAGAATG AAAATTGTCT TTCCGGTAGC TACGACGTAC GTACAACCCT GAACTACTTC
TTACATAGGG TTCTGCGTCT GACAAATAGT CCAAATCCCT CTCAGTCTGT CTGGATATAC
GGTAGAACCA ACGCAGCACC ACACAACACG CACGCGGCGA GATTGTGGAA AGCATCTGCC
AATCAATTTT CCTTTTCCGT TATCGAGTTT TCTGCAGAAC GCTACGACCC GAAACACCAC
ATCCTTCTAC GAGGACCCCT TACATCAGCG CTTATTGTCG TGAATTGGGT ATAGATCCAC
TGATGCGAGT AAGACCGCAC AGCAGATTCA ATTGAAATCC TCTTTTTCCG TTAGTGTTTT
GATCTATAAA ATACGTTCTT GATCCCTTCT GAACGACTCG TACGCGCTTT TCCTCCAGTT
CAAAGAGAGA AAAGAAAGCC GCTTCCCCTC GTCACCGACG ATGAGATTTA CCCGAAACAC
TGTTATTGCT GTAATTATGA CAAATTCTCT CTTTCTTCAG CGCACGTCCC GGCTCGTGGT
TCGAGCCTTG ACGACGGCTG CGCCGCTGTC GACCCGTCGG TCCGCTGTGG CCTTGGTGCC
CAACGCGGCT TCTGCGACAC GGGCGACTGG CTTTGTGATG CCAACCTCGA CCTCTACACC
TTTTGCCCGC ATGCTGGCCA CCAAAGCAAC TGTTGAAGAA GACTTGGACG CCGCATTGGA
TGATGTTTTG GCAGGAGCCT ACACGGAGGC CAAGACTCCG GCTGGAGTCG AGCCTGTCAA
TCACATGAAG AATTCCCATC CGATGCCTTC GCCATTGGTA GAGCAGGTAA GCAAGAAAGA
AAATATATTA TGCACAGGGC CTACTCTTAT CAGCGTAACG ATCTCACGAT ATTCTTTTTT
GTTTCGTTTC TTTAAGGATA TTGATTACAA GGACCCCGAA CTCTTATCCA CGAGTAATCC
TCGTTGGATC GAAGCCGGTC TCGACCAGAG GGTAATTGAC GTTCTAAGCG AGAAGGGAAT
TACGTCATTC ACACCCGTAC AGGCCGAAGC CTTTGGGCCA GTCATGGCTC GACGTGACGT
GATTGGTCGC AGTCGTACCG GAACGGGTAA AACCCTAGCG TTTGGATTAC CCGCATTGAC
TCGTCTCGTA ACATTTACTA CAGAAAACGG CAAGCGCGAT GCCCGTGGAG TCATGAAGAG
TGGACGCAAG GTATCCATGA TTATTCTGTG CCCGACTCGG GAACTGGCGC GGCAGGTTCA
GGAGGAGCTT TCGCAAGTCG CCCGCCCTCT TGGCTTGTTT GTTGAAGTCT TCCACGGTGG
TGTGTCTTAC GACCCTCAGT CTCGCGCCTT GCGACAGGGA GTGGACGTCA TCGTGGGTAC
CCCTGGACGA GTAATTGATC ATATCGAGCG CGGAACGTTG GATCTGAGTG AGTGTGATAT
TGCTGTTCTC GACGAAGCGG ATGAAATGTT GAACATGGGC TTTGCGGATG ATGTGGAAGT
TGTTTTGAAG AACGTCGGTT CCAATAATCC GCAAAAAACG CAATGTTTGT TGTTTTCGGC
CACGACACCG AGCTGGGTTA AGGAGATTGG CCGACAGTAC CAAAAGGACG TTTTGGCGAT
TGACTCTACG GCGGATAAGG GCGGTGCTCG AGTGGCCGAG ACGGTTCGTC ATTTAGCCGT
TCAGCTTGCT CCCGGCGCCG ATGCAAAAAG ATCTGTTTTG GAAGACATTA TTGCGGTTGA
AATCTCCAAG GATGCTGATA TCGGCAAGAT TGAACTCGAA ATTGCCAACC CGATTGCTGC
TGCTGCCCAC AAAAGGAAAA ACAAGGGTAA CCAAGCCATG CAGCAAAAGA TTTTTGGTAA
GACGATTGTG TTTACCGAAA CAAAACGTGA GGCGGACGAG CTAGTATCGG GAGGAGTTTT
CAAAAGCTTG ACTGCCCAAG CACTACATGG TGATGTCGGC CAGAAGCAAC GTGATTCGAC
CCTTGCGGCA TTTCGAAGCG GGGCCTTCAA CGTGTTGGTG GCCACCGACG TGGCCGCGCG
CGGTATCGAT ATTCAAGATG TCGATTTGGT CATTCAGTTC GATCCTCCGC GAGATGTGGA
CACCTACGTG CATCGCTCTG GTCGCACCGG GCGTGCCGGG AAGAAAGGAG TCTCTGTTCT
GCTGTTTAAT CAGCGACAGT CCCGAGACAT CGTCCGTATT GAGCGGGATT TGGGGCATGG
TTTCAAGTTC GATTTAGTTG GACCTCCGTC CGCTGAGGCT ACTTTGAACG CCGCCGCCAA
AACATCGGCG ATTGCGACGC AGAGTATTCC TGAGGAGACG GCTGAGTTTT TCAAAGAATC
AGCAGCCAAG CTTCTGGAAT CGCAAGACCC AGTCGATGTG GTTGCCCGTT GTTTAGCTGC
TGTCTCCCGA CGTGCGTCGG AAGTGCAATC CCGGTCGTTG CTGACCGGCC AGGTTGGCTT
TGCGACGGTT GAGATGGTGA ACGAACGTGG ACGCCCGGTT GCGGCGAACG ATGTCATGTT
CACAATTGGC AAGCTGTCAC GCATGAGCAA CCAGGAAGGA GATTTGGCCT TTGACAGCCA
GGTTGGTAGG ATTCAGACCA ACAGCGAATC GGGCTCTGTT GTATTCGATA TGAATGTGGA
AGATGCCAAA AATTTGGTGA AGTTCAGCAA GACTGTCGAT GCTGGTGGTG CCGCCTTCCA
GCTTTTGAAG GCGCTTGCGG TGGAAAGGGA TCGAAACTTT GGACGAATGG GTGGAGGCCG
TGACGGTGGT GGCAGGTTCA GCCGCGGACG TGGTGGCGGA GGCAGCTACG GCAGCGGAGG
TAGCTACGGC GGCGGCCGTG GGGGCTACAG TGACCGCAAT GGTCGCGGTG GAGGTCGTGG
TGGAGGTCGC GGTGGAGGTC GTGGTGGTGG CGGGCAGCGC TTCGATCGTC GTGACGGAGG
CGGCGGCCAG TCTGGCGGCT ACTCAGGACG TTACGACGGT GGACGCAGCA AGAACTCGCG
CGGGGGTAGT AGCTGGTAAT TTCAGTTCGT CGCAACAGCA ATAACTCCGG TACATAGCTC
TTGGCGTTGT CTTGAGAGCT TCAAAATAAG CAGTGATCCT TAACGAAAAC CATCATTGAT
AGTAACATAT TTGCACATGA
 
Protein sequence
MRFTRNTVIA VIMTNSLFLQ RTSRLVVRAL TTAAPLSTRR SAVALVPNAA SATRATGFVM 
PTSTSTPFAR MLATKATVEE DLDAALDDVL AGAYTEAKTP AGVEPVNHMK NSHPMPSPLV
EQDIDYKDPE LLSTSNPRWI EAGLDQRVID VLSEKGITSF TPVQAEAFGP VMARRDVIGR
SRTGTGKTLA FGLPALTRLV TFTTENGKRD ARGVMKSGRK VSMIILCPTR ELARQVQEEL
SQVARPLGLF VEVFHGGVSY DPQSRALRQG VDVIVGTPGR VIDHIERGTL DLSECDIAVL
DEADEMLNMG FADDVEVVLK NVGSNNPQKT QCLLFSATTP SWVKEIGRQY QKDVLAIDST
ADKGGARVAE TVRHLAVQLA PGADAKRSVL EDIIAVEISK DADIGKIELE IANPIAAAAH
KRKNKGNQAM QQKIFGKTIV FTETKREADE LVSGGVFKSL TAQALHGDVG QKQRDSTLAA
FRSGAFNVLV ATDVAARGID IQDVDLVIQF DPPRDVDTYV HRSGRTGRAG KKGVSVLLFN
QRQSRDIVRI ERDLGHGFKF DLVGPPSAEA TLNAAAKTSA IATQSIPEET AEFFKESAAK
LLESQDPVDV VARCLAAVSR RASEVQSRSL LTGQVGFATV EMVNERGRPV AANDVMFTIG
KLSRMSNQEG DLAFDSQVGR IQTNSESGSV VFDMNVEDAK NLVKFSKTVD AGGAAFQLLK
ALAVERDRNF GRMGGGRDGG GRFSRGRGGG GSYGSGGSYG GGRGGYSDRN GRGGGRGGGR
GGGRGGGGQR FDRRDGGGGQ SGGYSGRYDG GRSKNSRGGS SW