Gene PHATRDRAFT_30394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_30394 
Symbol 
ID7195728 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp438314 
End bp440483 
Gene Length2170 bp 
Protein Length559 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184252 
Protein GI219128084 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.382636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGAGTCTTT CCCGCGTGCA CGTCGTCGTG ATCCTGTCAT CCGGACAAGA TCGTCTTCGT 
CGTCGTGAGA TTGCGTATTC CCACAGCGCT TCGACAGCCG ATTGCCGCAC CCGAGCGCTT
TTCAGATCGA CGTTGCTATT CCCATGTCGA AAGAAGCGGC GCAATCACCG AAAAGCGGAA
AAACGATCGA TCAGACCGCG GATCGGACGG CCATTGTCGC TGCTCGTCTC AAGAAGCTGT
ACAAAAATTC TGTCTATCCC GTCGAAAAGA AGTATCGCTA CGATTATTTC TTTGAAAGTC
CACTTTTGAG TGACGTCGAG TTTGATGGTG CGTACTCCAA ACGAAGTTTG AGGTTCTTTT
GCTTTTGCCA AATGCTTTTA GTGTTGGCAA AGGACCTCAC ACACTCTTGC ACTCTCTCGT
GATAACACAG CCAAGCCTCA AGTGCTTCTG GTTGGACAAT ACAGTGTAGG AAAGACTTCC
TTCATTCGTT ACCTACTCGG TAGAGACTTT CCTGGTCAAC GGATCGGTCC CGAGCCTACT
ACCGATCGAT TCACCGTATT GCTGAACGGC CCGGAAGAAC GCACTATTCC GGGAAATGCT
CTCTCGGTGC ATCCTGATCT ACCCTTTCGG GGTCTCGAGC GCTTTGGAGT CAGTTTTCTG
AGTCGCCTCG AAGGCAGTCA ATTACCAAGT AGTGTTTTGA AATCCATCAC ACTCATCGAT
ACCCCGGGTA TCCTTTCGGG AGAAAAGCAA CGCACCAACC GTGGGTACGA TTTCACGAAA
GTCGTATCCT GGTTTGCCGA AAAGGCGGAT TTGATTATTC TGCTCTTTGA CGCACACAAA
CTTGATATCT CGGATGAACT CAAGGGCGCA ATCGATGTCC TTAAGGGGCA TGAAGACAAA
ATTCGATGCA TTCTCAACAA GGCTGATCAG ATTGATCGGC AACAATTGAT GCGAGTTTAC
GGTGCGCTAC TTTGGTCGCT CGGTAAAACT ATGACCAGTC CAGAAGTAGC CCGAGTTTAC
GTCGGCAGCT TTTGGCAGCA ACCGCTGCAG CACATGGACA ACGCCGACTT GTTCGAAATG
GAGGAAAAGG ATCTCATGAA GGATTTGGCC GTCCTTCCAC GGCAATCAGC CGTACGAAAA
ATCAATGAAC TCGTCAAACG CATTCGCAAG GTCAAGACCT TGGCCTACAT TATTGGCTAC
TTGAAATCAC AAATGCCTGC ACTCATGGGC AAGGAAAAGA AACAAAAGAA ACTCATTGCG
TACGTTTTGT GGGTGATGCG TGACTCTAAA TATGTGGCTG CTGCGTAATG TTGGTTCATC
TTTTCCAGGA GGCGTCGTTT CGGTCACGCA CACACGTTCT CACCATTTTC TCCTTACGTT
TCATCCACAG CGACTTACCG ACGGTGTTTC GTACGATTAT GAAAAAGTAC GACTTGGCCC
CCGGCGACTT TCCCGAAATT GCCAGCTTTT CTAACAAACT GCACGAAACC AAGTTTGCCG
AGTTTAATAC CTTGTCGGAA AAACAAATTG CTGATCTGGA CCGGGTCCTG AATGAGGATA
TTCCCAAACT CATGGAAGAA TTGCCCAGTG AAAAGGACTC TCCCGACATT ATCCGATCCA
AAATGGGAGC TGCTGGTGGC ATCGCCAAGG TTCCGGTCCC GGTCGCCAAT AATAAATTCG
GCAAAAAAGA AACGGCTCAC GAAAGCAATC CTTTTGGCTA CGATGAGGAG AACGAAGATT
ACTGGTACGT GTCACAACCA AGAAACGGTA CGCGTTTTGC GTATCTCCAG GTCTAGCGTG
TTAACATGCA ATATCTTTAC TCTGTTGTAC AGGGCACTGC AGGATTCAGC AGATCGTCTC
CTGCCAAGCT TTGAAGCCTT GGGACCAGAT GGTGGCTATC TCTCGACGGC CAAGGCACGT
GATGTGCTGG TCAAAACGGG GCTCGAAAAG GACCAACTTC GCCAAATCTG GAACCTTAGC
GACATCGATA AGGACGGTCT CTTTGACCAT GACGAATACG TTGTGGCCAT GTTTTTGTGT
GATGCTGTTC TGCAAAAAGG TCGACCCATT CCCTCCGAGC TCCCGGCGAG TGTTATTCCG
CCGCGCAAGC GATCACTGTT AGCAGAGAAG AGCAGCGTAT TTTAAAGGGC AATAGCAACA
ACATCAAGAG
 
Protein sequence
MSKEAAQSPK SGKTIDQTAD RTAIVAARLK KLYKNSVYPV EKKYRYDYFF ESPLLSDVEF 
DAKPQVLLVG QYSVGKTSFI RYLLGRDFPG QRIGPEPTTD RFTVLLNGPE ERTIPGNALS
VHPDLPFRGL ERFGVSFLSR LEGSQLPSSV LKSITLIDTP GILSGEKQRT NRGYDFTKVV
SWFAEKADLI ILLFDAHKLD ISDELKGAID VLKGHEDKIR CILNKADQID RQQLMRVYGA
LLWSLGKTMT SPEVARVYVG SFWQQPLQHM DNADLFEMEE KDLMKDLAVL PRQSAVRKIN
ELVKRIRKVK TLAYIIGYLK SQMPALMGKE KKQKKLIADL PTVFRTIMKK YDLAPGDFPE
IASFSNKLHE TKFAEFNTLS EKQIADLDRV LNEDIPKLME ELPSEKDSPD IIRSKMGAAG
GIAKVPVPVA NNKFGKKETA HESNPFGYDE ENEDYWALQD SADRLLPSFE ALGPDGGYLS
TAKARDVLVK TGLEKDQLRQ IWNLSDIDKD GLFDHDEYVV AMFLCDAVLQ KGRPIPSELP
ASVIPPRKRS LLAEKSSVF