Gene PHATRDRAFT_43746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43746 
Symbol 
ID7197031 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1358780 
End bp1362168 
Gene Length3389 bp 
Protein Length954 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178125 
Protein GI219112747 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.213644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCTCTTACA TCGCTCTGGA AAAGCAGCTC TAGCGTCCTG AGCCGAAAGA ATTAGCGTAG 
CTCGGAATAC CTGTCCTTCT CTTTTGAACC GCTTCCATGC TTGCCCATCA AGAGGATCGA
CCCCAACCGA CGCGGAATGC CTCGCAAAAC GTTTCGTCGG AGCGATTACG ACTGACGGAA
GAGGAGGTCG ATGCCCTGAT TGAAAATTTC GCCAAGGAGG ACGACTTGAC ACGTAAGACA
TGGACACGCC GCGTCGTGGA AAATTTCCTC ATGAAGGTAC GAGTAGAGCA TTCGAATAAA
ATAAATCGGA TCGACAGACT TTGTTGTTCT TGGAAATGAT TTTCGGCTCA CTGCTTTGTG
TTCTGTTTGA TTCTCACAGT ACAAGTGGTA CTTCCCCCGC CGCGATATCA AGGGTGCTCC
TTCGTTGAGT ATGGCGTATG CGTACTACGA GCATATTACA CTCCCCCGGC ATTTTGCCGG
CGGGGAACAA ACGGCAGAGC ATGTTCTGCG ACGGGCCGAA CCTGGGGAAT CGCAAAGTAC
GGATCTATAC AATCCACTCA AGACACCGTC GTCTTCCTTT ATTGAGTACG TAGTGTTCGA
CGCATGTATG GATCGGCGCC TGGCTTTATA CATCTGGCAT TCCCTGACGT GAGATCTTAC
TCCTTGTAAT TTCTTTTAGA TGGGGCATTG GTGTGGATCT ATACTTTTCC AGTGTGCGAA
TTATGTCGAT GATTCTCTTG CTGGCGGGTC TGCTCAATAT CTACAGTATT TACTACTACG
GTTCTACGGA ATACTCGCCG AACGGAAAGA ATTCATTGTC AACCTTTTCG CTAGTCGGTA
CCGCCATTTG CACCACCGGC GACTGGGTTG TCTGTGCAGA AGGATGCACT CAGGAAGGTT
ACTCCTCCGA AGGAGAAGAC GACCGTTTTG GTATTGCAGA CGACGGAACA GTTTTGGTCG
TTCGCAATGG CTGCGACGAC GGGAGCTTCC TGCAAAATGG AATGGTCAAT TGGATTACTC
TTTTGTTTTT GGGTATTCTA ATGGCTTTGG TGTCGCTTTA TCTTAAAGCT CGTGAAGTTC
GATTCGACGA GGACAAGTGA GTGAAATCTT CATTTTGTGA TTTGTTTGTC TGCATTTGAT
TTTGGTTTTG CTAACTCTTT AATCTATAGA TTAACGTCGA CGGATTATTC CGTCATCGTC
AAGAATCCTC CTCCTGATGC ATACGATCCG GATGAGTGGC GCGATTTCTT TGCACAATTT
GCGGAAAAGC AGGTAACAGT GGTTACCGTA GGGTTAAACA ACGAGTCGCT TTTGAACCTA
CTGCTGGCGC GTCGTGTTCA CCGGCATAAT TTGCGGCTTA TGCTGCCGAA AGGAACCGAC
ATGGATGATG AGGACCAGGT CCGTTCAGCA GTAGCCCGAC TGATTCAAGA CCGAGAAGCC
GAGCCTGATG GTTGCATCAT GCGCTTCTTG GGATGTGCGG TATTTCCTTT TCTGCGAATC
TTCAACATGT TCTTGCCGCC CGAAGTTTTG GTGGATCGAG TCTTCCGTTT GACCGACCAA
ATCAAGGAGC TTCAGGGGGA GAAATACACG GTGTCGAACG TCTTTGTCAC GTTCGAAACC
GAGGAAGGAC AGCGCGCGGC TCTTGCAGCT CTCTCTGTGG GAAAGCTTGA CGCAATCCGG
AACAACACGG CCAACTCTGC CCCTAGTGCC ATTTTCCGTG ACCGTGTTTT GAAGGTTGAA
GAGCCAACCG AACCAAGTGC TGTTCGCTGG ATGGATCTCA GCGCCTCGAC GTTGCGAAAG
ATCATTCTCC GCATATTGAA TTTGCTCATT ACACTGGGTG TCGTTTCTTT TTCTGGATAC
TTGGTAGCAA AAGTTCGTGA AAACTTGGGG CCTGGATACT CTGGCCCTTT GGTTTCCGTA
TTCAATTCGA TTATTCCACA GATCGTCAAG CTCTTAATGA TCTTCGAACC CCATACCACC
GAAGGGAGCT TCCAAACCTC TCTGTACTTG AAGATTACTC TCTTCCGTTG GGTGAATACG
GCCGTCCTTA CGAAATTAAT CACTCCTTTT ACAAGCACCG TTAGTCCTGA AAGGACCAGC
GTCTTGCCGA CAATCAATTC CATTCTGTGG TCCGAGCTAT GGCTAGTGCC TGGTCTACGT
TTATTGGACC TTTGGGGTAA CATCCAGAAG CATGTGCTTG CTCCACGTGC TCGAAACCAA
GAACTTATGA ATTTGAATTT CCAAGGCACG TTCTATAACC TCGGTGAGCG GTACACAGAT
TTGACGAAAG TGCTCTTTCT TTGCTTTTTC TATTCAGCGT TATTTCCGTC GACCTTCTTT
TTCGGTGCGG CAATTCTCTT TGTTCAATAC TATGTAAGTT GTGTACTGCT CAAATTACAG
CCATTCTTGC CTTCGACTTC TCACGGATAC TTCAACCATT GCTATAGACC GACAAATACT
GCTTGATGCG CATCTGGGCA TGGCGACCAT TCATAGGGCC AGAGCTGGCA CGCTTCAGTC
GAAGGTATTT TTTTTCTGGG AGCGTCTTGG CCTTTGCTTT AGTAAGCGCG TACACCTGGG
CTCAGTTCCC ATACGACAAT GTTTGCGATC CAGATACACC CATCTTTACC AACGCAGCAA
GAGAATACTT CAATGTACAA TTCGCAAACT CCTCTACTGC CGACGTTGTC ACCGTTTCAC
AGGACACTCC AGTTGTAGCG TGCAGCCAGA GTTGGCGAGA AGTCAGTGGT TTTTCCTTTC
CTCCAACAAA GCGCATCCAG CCGGTTGGAT TAAGTTGGAT GAGCGACTCA CAAGAGACGC
TGACGAGCGT CTACGGCTGG ACAGCCGTTG CACTTCTTGT CGGCTTTCTT GTTTTCTTTT
TTGGTTCTTC GACCATTAAC TTCTTGTTGT CCTGGTTTCG TGGCATATAT CATACCAGTG
GTCAGAATCA GCGAATCGAT TTCAGCACTA ACCTGGAAAT TTTTGCTTAT GTCCCGCAAG
TCAAGCTGAA ATCTTTACCT TTTCCTCTCT TAGCTTGTAA CGTCGACAAT ATTGACAAGG
GTCTAATTGG TTGGAACGAC CCCGCACATT CATATGATGT TCACAACATG ATTTTCGACG
TTCCCTGGCA AGGAATGCCA AGGCAAAAAG CTGTTGAGAA TGAAGCGAGT ACAAGGGGGA
GCGTTCTTGG AGCCGAACAA GAAGAGGTTG GAGGAAATCA AAATTCACCG CAAGCCTTGC
CCAATGCGCT CGAAGTGAGA ACAGGGCAGC CTCCAATCTT TGCGGTTGTG AAACACTATC
CGCCCGAATG GAGACAGCGC GAGCTAAAGC TGTCCTAGCG CTTCGCTTTT GCTGTAATCG
CTTCTAGTTT AAACACACTT GTACTCACC
 
Protein sequence
MLAHQEDRPQ PTRNASQNVS SERLRLTEEE VDALIENFAK EDDLTRKTWT RRVVENFLMK 
YKWYFPRRDI KGAPSLSMAY AYYEHITLPR HFAGGEQTAE HVLRRAEPGE SQSTDLYNPL
KTPSSSFIEW GIGVDLYFSS VRIMSMILLL AGLLNIYSIY YYGSTEYSPN GKNSLSTFSL
VGTAICTTGD WVVCAEGCTQ EGYSSEGEDD RFGIADDGTV LVVRNGCDDG SFLQNGMVNW
ITLLFLGILM ALVSLYLKAR EVRFDEDKLT STDYSVIVKN PPPDAYDPDE WRDFFAQFAE
KQVTVVTVGL NNESLLNLLL ARRVHRHNLR LMLPKGTDMD DEDQVRSAVA RLIQDREAEP
DGCIMRFLGC AVFPFLRIFN MFLPPEVLVD RVFRLTDQIK ELQGEKYTVS NVFVTFETEE
GQRAALAALS VGKLDAIRNN TANSAPSAIF RDRVLKVEEP TEPSAVRWMD LSASTLRKII
LRILNLLITL GVVSFSGYLV AKVRENLGPG YSGPLVSVFN SIIPQIVKLL MIFEPHTTEG
SFQTSLYLKI TLFRWVNTAV LTKLITPFTS TVSPERTSVL PTINSILWSE LWLVPGLRLL
DLWGNIQKHV LAPRARNQEL MNLNFQGTFY NLGERYTDLT KRYFRRPSFS VRQFSLFNTI
HSCLRLLTDT STIAIDRQIL LDAHLGMATI HRARAGTLQS KVFFFWERLG LCFNTPIFTN
AAREYFNVQF ANSSTADVVT VSQDTPVVAC SQSWREVSGF SFPPTKRIQP VGLSWMSDSQ
ETLTSVYGWT AVALLVGFLV FFFGSSTINF LLSWFRGIYH TSGQNQRIDF STNLEIFAYV
PQVKLKSLPF PLLACNVDNI DKGLIGWNDP AHSYDVHNMI FDVPWQGMPR QKAVENEAST
RGSVLGAEQE EVGGNQNSPQ ALPNALEVRT GQPPIFAVVK HYPPEWRQRE LKLS