Gene PHATRDRAFT_45844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45844 
Symbol 
ID7201092 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp448506 
End bp450319 
Gene Length1814 bp 
Protein Length560 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180050 
Protein GI219118560 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTATGGTGCA TATTGAAGAA GTGAAAAACG TTCCTACTGG CATTCAACCC GAAGCGAAGG 
AAATCTCTAA GAAAGAAGCG TCCAATGTTG AAAAGTACAA CGATCTCTTG TCGACCATGC
AGAAAACAAA GGAGAAATGG GACGAAGAAG ATGGTGTAGA AGACGAACAA GATGCGCTAT
TATCACAGGC TATTGAGTTC GCCATCGAGC AAGGAAGAGG ATGGGCTCCT GGCGAAAAAG
ATGCTTATCT GGAAAAGATA TTGGACGACG ATTTCATTCC CCCCATGTTT TGCTCGACGC
CGGAAGAGCT GGAGAAAACG GGCTTACAAG AAGCCTTTAC CAGCCTAATA TATGACGGAG
AGGTACGTGG GAATCGCCTA GAAAACATGA ACAATTGCTT TACAGCTCTA ATGCATGCTA
CTTTCCTTGA ATCAGAGCCC AACAAGCTTG ATGCTGAGTT TCAAAAAGAA AGGTAACGAA
GCATTCACCA ACGGCAAACG GAACGAGGCC AAAAACATGC AGTACTATCG GGACGCAATC
AATCATTACT ACGAAGCCTT TGCCTGGGCG CAGAAAATAG AGCCTATGAT GGCCGGAGAT
TTGGCGCAGG CGGATACAGA CGAGCCAACA TACACGGAAG ATGAATTGGA TGAGCTGCGA
TCAAATATTT GCAACAACGT GGCTCTGGCG CACACTCAGC TCAAGAACTG GGGCTTTGTG
CGCGATGAAT GTCAGAAGGC ATTAACTTTC AACAACAACA ATGTCAAAGC ATGGTACAGA
TTGGCTAAGG CTTACCAAAT GTTGCAACGC TGGGAAGAAG CAGGTGACGC CATTGAATCT
GGATTGGCGG TCGACGGCGA AGAAAACAAC AAGGATTTGA GGAAGCTTCA AAAGCTACTA
TCTGATCGCA TCCAAAAAGC TCGTAAATTT CGACAACAAC GTGAACGAAA GAGAGCGGAA
CGCGTAATGA AAATCAAGAA AGTTTGGAAG CACTGCCAAG AAACCGGTGG CATTAAACTC
GGCCGAATTC CACTCGTGGC CACCGTGACA GATGCGGAAG AGGATGACGA CGATCGCGAC
GAGTCTCGCT GGCATTTTCA TCTACCACAT ACCGGACAGC TGCCCAGCGA AGAGCACGGT
GTATGGGCGT GGCCTTGTAT GTTTTTGTAT CCGTCTCACA ATCAGTCCGA CTACGTCAAG
CATTTTGCCG AAAGCGAAAT GTTAGCATTA CGCATGGCGG AGATGTTTCC AGAATTAGAA
GACTTAGGCG GTGAGACTCC CATGCCTTGG GACTACAACA ACGAATTTAG TTGTAGCCAA
CTAGCTGTCT ATTTTGAGAT TCAAGTGCCG GATACGGAAG AACGTGTGAT ACACCCAGAA
CACGTTGAAT TGCTGCGCGA TCAGGCTACC ACGATGCGGT TTTACGAGTC CTGCCGTGCG
CTACAGGGAG ACGAAGGCAC GGCAATGGCG GAAGTAGTAC GGGCGGTGGA GCGCAAACAT
TTGTACCAGC AACGAAAGGC TTGGACAAAG CGTCACGGCA GTCTGTGGGC GAAGCCCGAT
CCATGTTCGG TCGTGCGCGT GCATCCAGCC ATGACCTTGC GAGGGGTGTT GACCGATCAT
CGGATGGTTG TGCCGAACGT AAGTAGTGGA AGCGTCTGCG ATCTTGGAGT TTTGGCGACA
TTTCTGACTC ACAGGCTTGG TTGATCGTTT TTCAGTTTCT GGTAACTTTT GTGATTTTTC
CAGAAAGTCA TCCAGCTCAT GCAGCCTACC TCAAAGAACA CGAATGTGTT GGTCTTTTGG
AACCGACAGA ATGA
 
Protein sequence
MVHIEEVKNV PTGIQPEAKE ISKKEASNVE KYNDLLSTMQ KTKEKWDEED GVEDEQDALL 
SQAIEFAIEQ GRGWAPGEKD AYLEKILDDD FIPPMFCSTP EELEKTGLQE AFTSLIYDGE
SPTSLMLSFK KKGNEAFTNG KRNEAKNMQY YRDAINHYYE AFAWAQKIEP MMAGDLAQAD
TDEPTYTEDE LDELRSNICN NVALAHTQLK NWGFVRDECQ KALTFNNNNV KAWYRLAKAY
QMLQRWEEAG DAIESGLAVD GEENNKDLRK LQKLLSDRIQ KARKFRQQRE RKRAERVMKI
KKVWKHCQET GGIKLGRIPL VATVTDAEED DDDRDESRWH FHLPHTGQLP SEEHGVWAWP
CMFLYPSHNQ SDYVKHFAES EMLALRMAEM FPELEDLGGE TPMPWDYNNE FSCSQLAVYF
EIQVPDTEER VIHPEHVELL RDQATTMRFY ESCRALQGDE GTAMAEVVRA VERKHLYQQR
KAWTKRHGSL WAKPDPCSVV RVHPAMTLRG VLTDHRMVVP NAWLIVFQFL VTFVIFPESH
PAHAAYLKEH ECVGLLEPTE