Gene PHATRDRAFT_45250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45250 
Symbol 
ID7200120 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp608518 
End bp610821 
Gene Length2304 bp 
Protein Length743 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179468 
Protein GI219117347 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.325781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGAAAAAGAA CCATCAAAGA ACAAGCTATC TGTAAATGCG CGTACCATGA AGAGACTGCG 
GGGGAAGAGA ATAACTTTAC TGCTTGCCAG TGTGATTCCT TTGCTTGCGG TTGACGATGT
TCCAATGTGG CAGTCCACGA GGGACTCCGA TGAGGAACAT CCTGCTGATC AGGAGAAGGA
ACTGTCTTGT AGGGAACAAA CCCAAATCGA CCGTGGAACA AAGCAATTCC GCGAACCGTT
GCACAGACCT AACGGCGACC GGGACTCTGC ACTGCTGCGG CAGTCACTAA ATCCGGATAT
TTTATTTGAA GGAAGCGAAT TCGGTCAACA CTACAACATT GTAGTGCCGT TTCGGATGGG
ATATCCCCAT AAACTTGTAG GATCATACGT TGAAAATCGT TACAGGCCAC CAGAAAACGA
TACGTCATTG ACCTCTATTA GCACCAGATT GCGAATGGAC GTGGTCCGCG ATAAATTTGT
GTCAAAAGAC GAAAAAAGCG CGGCAGAAAT TGGTGTTGTA GAAAGGAGTA GAGAGACAAT
AAAGAATGAG AAAGTATCCG CATATATCGA TGACTCGCGG GCTGGCGAAG ATCTCCCAGG
ACCAGAAACG ATGCATGCCT CTTCGGGTGA CTCCAAAGGT GATAAATTTT CTGAAGAGGA
CGGACTGAAA CGTGTCTTGG TAGACTATGC CAGCAAATCT GCTGGTGCTT TAATTTTGGA
GAAATCGTCG AGTTGGAACG GGATTTCCAA CGTCCTGAAC GGTGACAAGG ACAAGTACGC
AATTATTCCC TGTGAAGAGC CCCAAAAGTC CGTCGTCATT GGACTCTCCG AAGACATTCT
TGTGAAACAA ATAGTACTTT CCTATTATGA ACGATACAGT TCGCATATTG GAACCTTTCA
AGTGATGGGT TCGCCCCAGA CAATGGGTAA TTGGGTTGAT TTGGGTACAT ACACATCACC
ACGAGGGAAT GGCAAACACG CATTTGATTT ACACGAGCCG TCTTGGGCAC GGTATTTGAA
GTTTCGATTT GTATCGCATT ACGGAGATGA GCATTACTGC ACCGTGAGTC AGATCAGCGT
CCATGGGAGT ACCATGTTGC AAGGTTTTCA CGAGCAATGG GCCGAAACGG TTGAAGAGCA
GCCAAACGAC AAAAACGAGA GGGACGTAGA CGTATCAGGG TCGAAAATAG ATCCTACGTT
TTCGGCTACG GATCAAGAAA ATGGCAATGA CGGCAGCGTC CAAGGGACTG TATCCACTAT
TGGACAATGC TACACCAGAT TGGATGCTGT ATGTCAGATG GATTACAGTT TTGAACGGAG
TGCATTTTTG TTCGCATCCG GTAGGAGCTC TACACCTGAC TTTGATTTAT TGAGTGCGCT
TTCATCCGCG TCCTTTTGTC AACTTGGCAG GCAATCGGCA AGAACAAATT ATTCTCATTT
TGTTGAGCTA GGCCGTCGTG CGCTTACCAT CTCTCCGAAA CGTGGTCGTA GTACCAAGAG
TAAATTTGTT GCCGATTTAT CCGATCAAGC TTTGTTTCAC TCACTTACGG AATCTGTTGT
TGTTAAACAC ATTCAGAGTC TCATCTCTCG AACGACAGGG ATAGACATAC ATGTGGAACG
TTTTGGTGTA TTGGCAACAG TTGATAGGAC ACCCGACAGA ATTTCTGTCG ACGATTCGAA
TCCCCCTGCA ACTGTTTCAT CAGCTTCTGG GGTGATCGCC GGCACCAAGC TGGTTACGTC
TGAAGTCGAA GCCATTGAGC GTATCACGGA TGAGATTGAA TCTCAGCCGT TATTGCAAGC
CATTCAACAG ATGGAAGAAA AGATTCCTTT CGATACCGCG TTTCATGCAT CTGGATTTTC
ATGGAGCAAG ATATTGGAAC AGCTTCCTAG CGCAGCTTGT CTGGAAAAAC TCGATTTCGC
TGATTTTAGA TCCGGCAAAA AATTGAACTT GCGCAATGGG GGGCCGGGGT CGCACGGCAA
CGCCCAAGGC GGAGGTGGTA TGGAGCCAAT CTTTAAAAAG TTCACTGACG AGATAAAAGC
GCTGCAGACT AGTGTTTCGA TTCATGATCA ATTTTCCAAG GCACTGGCCT CTTGTTACCA
ACAAGTATTC CTGGAATTAT TGGTGGAAAT GGATGTCAAA CGCAGTGACA TTGATAACCG
GATTTTCCAG TTGGAGAGGA AAATGCAGAG TGGTTTATTT TTTTTCTCTG CAGTATCTCA
ATGGATGTCT CCAATTATTG GTGGTGTTGT AACGATTTCA AAGCTACCGA TATCGCTTTC
TTTCCAAAAC CGCACAATCA TTGA
 
Protein sequence
MKRLRGKRIT LLLASVIPLL AVDDVPMWQS TRDSDEEHPA DQEKELSCRE QTQIDRGTKQ 
FREPLHRPNG DRDSALLRQS LNPDILFEGS EFGQHYNIVV PFRMGYPHKL VGSYVENRYR
PPENDTSLTS ISTRLRMDVV RDKFVSKDEK SAAEIGVVER SRETIKNEKV SAYIDDSRAG
EDLPGPETMH ASSGDSKGDK FSEEDGLKRV LVDYASKSAG ALILEKSSSW NGISNVLNGD
KDKYAIIPCE EPQKSVVIGL SEDILVKQIV LSYYERYSSH IGTFQVMGSP QTMGNWVDLG
TYTSPRGNGK HAFDLHEPSW ARYLKFRFVS HYGDEHYCTV SQISVHGSTM LQGFHEQWAE
TVEEQPNDKN ERDVDVSGSK IDPTFSATDQ ENGNDGSVQG TVSTIGQCYT RLDAVCQMDY
SFERSAFLFA SGRSSTPDFD LLSALSSASF CQLGRQSART NYSHFVELGR RALTISPKRG
RSTKSKFVAD LSDQALFHSL TESVVVKHIQ SLISRTTGID IHVERFGVLA TVDRTPDRIS
VDDSNPPATV SSASGVIAGT KLVTSEVEAI ERITDEIESQ PLLQAIQQME EKIPFDTAFH
ASGFSWSKIL EQLPSAACLE KLDFADFRSG KKLNLRNGGP GSHGNAQGGG GMEPIFKKFT
DEIKALQTSV SIHDQFSKAL ASCYQQVFLE LLVEMDVKRI GEENAEWFIF FLCSISMDVS
NYWWCCNDFK ATDIAFFPKP HNH