Gene PHATRDRAFT_45381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45381 
Symbol 
ID7200004 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp1009981 
End bp1011395 
Gene Length1415 bp 
Protein Length451 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179335 
Protein GI219117081 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.22287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAAACTCACT GTCCATGCCG CCAGCCTTGC TGGATCGACA CATGGCAATG AATCTCTTCG 
GAAAAAGATA TTCCTTGGTT TGGTCACTGT CGATGGCAGC CGTTGATACG AAAGCTTACG
TGTCCGAGAA CGTATGGACA CACCATCATC GATGTGCTCG GCTCTCTTCC AAGAAGGGCA
GTCGAAACAT TATGAACGAG GCAGCCTCAC ACAAAATCGG ACACAGAAAT TCTGTTCCCG
CCTACTCATT GGCAGGTGCG TTTGTCACCA GATGGAAGCG AGGATGCAAT ACTCATGCAG
ACTGTTGTAG TCTAGAACTC ATTCCCATTA CCAAGAATCG ACAGACATCG TATCTCGATC
CAGATTATAC GCATTCAGCG GGATCCCAAG CCTGGGCGGC TTTGCACGAA CGTTTTACAG
ATAAGCAGGC CCGTGATGCG CTTTTTCAGC CCGAAATTGT TCTACAGTGT CGCGAGGTTG
GGCACAGGAG CCTCCGTGTA TCACTTTCCA ACAAAACCGT TGACTGTGAA ATGGATATTG
CTTTGGTTGG TGTTTTGGCG CGTGTACTTG TCCAATGGAC ATTGGAACCA GACGGTGGCA
AACCTACTGG CCAATCATGG ACCATTACTC TTACTATGGA AGAATCCAGT CTGACTTTGG
TAAACGGAGT GGACGACGAA AATATGAAGG AATTGTTCGC TAAGTACCTG GACCTATCCA
GCTCAAGTGT CGAAATTGTA GAAATGATGG ACCGGGACGG CAAGATACTT GGAAAAGTCC
CACGAAATCT TGTGCATGAA TATAATTTGC TGCATCGAGG CGTCGGCGTT TTCATTACAC
GTGACGTGCC CCTCCAGCTC CCGATCCACG GTGCGAGAAA CTCCAGTCCA AGTCCAACTC
ACCAACCCGA CTTGTATTGC CATCGCCGAA CCTCTACCAA ACGGATTTTT CCTGATCTCT
ACGATATGTT TGTCGGTGGC GTTGCCCTGG CTGGAGAAGA TTCCCGCAGG ACTGCGTTGC
GCGAAGTCGG CGAGGAATTG GGGCTCGCGC AAGGGAACAT TGGCGATGAG GCCATCTTAA
CGTGCGTCGT TTGTACGGGA TACAACCGAT GCGTCGTGGA TCTATATTGC TATGTAATGA
ACACAATGGC CGAAAGGGTA TCGTGGCAAG CGGAAGAAGT GGCGTGGGGG GATTTTGTTC
CTTTCAACGC TGTTCAAGCG TCAGTCGATT TATCGATTCA ACGATTAGTA TCCGACGGAT
CTTGGCCCGG ACGCTATCCA CCAATCCAAT CGAGCTATAA TGGTGTCTTT CCCAAAGACG
AATTTTCATC GATCAGAAAT TGGGACTCCT GGGACTACGT TCCCGACGGC TTGCTCGTTT
GGGAAGCGTG GTTGCGCTAC CTGAAAGAGG ATTGA
 
Protein sequence
MPPALLDRHM AMNLFGKRYS LVWSLSMAAV DTKAYVSENV WTHHHRCARL SSKKGSRNIM 
NEAASHKIGH RNSVPAYSLA DCCSLELIPI TKNRQTSYLD PDYTHSAGSQ AWAALHERFT
DKQARDALFQ PEIVLQCREV GHRSLRVSLS NKTVDCEMDI ALVGVLARVL VQWTLEPDGG
KPTGQSWTIT LTMEESSLTL VNGVDDENMK ELFAKYLDLS SSSVEIVEMM DRDGKILGKV
PRNLVHEYNL LHRGVGVFIT RDVPLQLPIH GARNSSPSPT HQPDLYCHRR TSTKRIFPDL
YDMFVGGVAL AGEDSRRTAL REVGEELGLA QGNIGDEAIL TCVVCTGYNR CVVDLYCYVM
NTMAERVSWQ AEEVAWGDFV PFNAVQASVD LSIQRLVSDG SWPGRYPPIQ SSYNGVFPKD
EFSSIRNWDS WDYVPDGLLV WEAWLRYLKE D