Gene PHATRDRAFT_22666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_22666 
Symbol 
ID7194993 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp48808 
End bp50806 
Gene Length1999 bp 
Protein Length546 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183278 
Protein GI219126049 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACTACCAGT TATACCGTCA ACGCCTTTGT CCCATACGAT CCATTACGAT CTAAATTGTT 
CACTCCCGAT CGTTCTTTCA CTCGCACATT TACAGTTACA GTTAGTTCCT ATTCCACCTT
CCGCGCACCA AACTTCCTCT TCGTCCTCGT CATTCTTCCC CGCAGCATGA GCAATTCAAC
GCTGCACGAA ATTTCTCCCA ACGCCGAGAT TGTCTCGCGT CAACAGGCAC TCCAGGTCAA
CGTCGCCGCT GCTGTCGGAC TCTCCAACGT ACTCAAGTCC AACTTGGGAC CCACGGGAAC
TTTGAAACTG CTCGTCGGGG GGACGATTGA GCAACTCAAG TTGACCAAAG ATGGACTCAC
CTTGCTGAAA GAAATGCAGA TACAGCATCC CACCGCAGCG CTCATTGCCC GTACCGCTAC
GGCACAGGAC GACGTCACCG GTGACGGAAC CACGTCGGTC GTCTTGTTGA CGGGAGAATT
GCTCCGACAA GCCGAACTCC TCGTACGGGA AGGATTGCAC CCACGGGTAC TCACCGATGG
ACTCGATACG GCACGGGATG CCTGTTTGGA AGTCCTGAAA GCATTTGCCG TAGCTCATCC
GGATCTAATC CATAATCGGG ATCTATTGCA GCAAATTGCC CGGACCTCTC TGGCGACCAA
ACTCGACGGT CCTCTTGTGG ATCAGGTACG TGCGCTTTTG TTGTTGTGGT AGCGTTAGCT
GAGTAGTTAT TTCGTACCGT GCCATACCGG AATTCAAGGA TGCCCCCGTT GTCACTCTGG
CGTCTCTTTG CTTACTCACA GTCATACACC ATCAACCCCT CTTTCCTTTC CTCACACTGT
CCATTATAAC TTTGTCTACG CTTACGCTTT TATCTAGATG TCTTCGGCGG TCGTCTCGGC
CATTCAAACG ATATACGAAC CAGACACGCC GCTCGACTTG CACCGCGTGG AAATACTCAC
TTTGGCTCGC CACCGGGCCG TCGATTCCAA ATTCGTTGCC GGTTTAGTGC TCGATCACGG
TGCCCGCCAC CCCGACATGC CCACACAACT CCTCAACGTC AAGGTCATGA CCTGCAACAT
CTCGCTCGAA TACGAACAAA CCGAAACGCA GGCCGGTTTC GTCTACTCCA CTGCCGAAGA
ACGCGAAAAG CTCGTCGAAA GTGAACGTGT TTGGTTGGAC GAGCGTTGTC GGCGCATTGT
GGAATTCAAG CGCCAAGCCT GTGCAGACGG CGAGACCTTT TGCATCATCA ATCAAAAAGG
TGTGGATCCG TTGAGCTTGG ACATGTTCGC CAAGGAAGGT ATCCTTTGCC TGCGTCGGGC
CAAACGTCGC AATATGGAAC GTCTCACGCT CGCAACCGGC GGTAGTATCA TTCTCAGTCT
CGAAGATTTG GAAACCAGCA TGCTGGGCTA CGCCGGTAGC GTCAAGCAAG TCACCTACGG
CGAAGACAAG TACACGTTCG TGGAAGACTG CCCCAATTCA CAGTCCGGGA CTCTACTTTT
ACAGGGACCA AATAAGTTGA CGACCGAACA AATCAAAGAC GCCGCCAAAG ACGGCTTACG
GGCCGTCAAA AATGCCGTAG AAGACGGCGC CCTCGTGCCC GGAGGCGGCG CCTTTGAAAT
TGCCGCTTCG GAACATTTAC TGCACAAAGT CGTGCCCACG CTCAAAGGCA AAACGAAACT
GGGCGTACAA GCGTACGCAC AGGCGCTTTT GGTCATTCCC AAAACGCTCG CCGCTAATTC
GGGTTTTGAC GTCCAGGACG TCCTGCTGAA ACTTCAGGAT GAACGCAACT CAACCAACAT
GGCGATTGGT TTGGATGTCA AAACGGGGGA ACCCATGTTG AGCGCGGAAC AGGGTGTATG
GGACAATGTC CGGGTCAAAC GTCAAGGCTT GCATCTGGCC ACGGTCTTGG CCAACCAGCT
ACTGCTCGTG GACGAAGTCA TGCGGGCTGG CAAACAAATG GGGAGAAATG CCCAACCAAA
TCCGGAAATG ATGGGATAG
 
Protein sequence
MSNSTLHEIS PNAEIVSRQQ ALQVNVAAAV GLSNVLKSNL GPTGTLKLLV GGTIEQLKLT 
KDGLTLLKEM QIQHPTAALI ARTATAQDDV TGDGTTSVVL LTGELLRQAE LLVREGLHPR
VLTDGLDTAR DACLEVLKAF AVAHPDLIHN RDLLQQIART SLATKLDGPL VDQMSSAVVS
AIQTIYEPDT PLDLHRVEIL TLARHRAVDS KFVAGLVLDH GARHPDMPTQ LLNVKVMTCN
ISLEYEQTET QAGFVYSTAE EREKLVESER VWLDERCRRI VEFKRQACAD GETFCIINQK
GVDPLSLDMF AKEGILCLRR AKRRNMERLT LATGGSIILS LEDLETSMLG YAGSVKQVTY
GEDKYTFVED CPNSQSGTLL LQGPNKLTTE QIKDAAKDGL RAVKNAVEDG ALVPGGGAFE
IAASEHLLHK VVPTLKGKTK LGVQAYAQAL LVIPKTLAAN SGFDVQDVLL KLQDERNSTN
MAIGLDVKTG EPMLSAEQGV WDNVRVKRQG LHLATVLANQ LLLVDEVMRA GKQMGRNAQP
NPEMMG