Gene PHATRDRAFT_40967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40967 
Symbol 
ID7198818 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp8116 
End bp9601 
Gene Length1486 bp 
Protein Length458 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185028 
Protein GI219129716 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATGTA ATGATGATGT AAGTAATATC CATTGCAAGC TCGACAAAGC ACCTTCTACC 
ATGCTTGACT GTGAATCCGA TTTGACACAT CATTACGATC ATCCATGCCT ATCATCGTGG
GAAGCACAAA CGACAACTTC TCGCGGAATC GTCCTCTTGC GGATTGACTC ATTAAAAAAC
GATGCACCAG GATTCGCAGC TGCATTAGTC CCTATGCTAC AACATGGGGT CACAGACTTG
CTCCGTCTTC GGGCGTCAAA ACCAAGACAA ATGACCTTAG ACCAAGCACC TGTCACACTC
TTTTTGAAAA CAATCCTTTG GATTCATCTT AATTGTACGA CGATAGACCC GACACTGGGC
AAGGAAATTG CAAGACACGG ATCGCACGTT CAGCTCGTCA AGCTCATTCG GCAGGATCCG
TCTGTGCTGG TCCTGGACGA AGCCGAGCAA GATTCCATAA TGGAGCTGCA AGAGGTTTCT
TGCCGGATTG CGGCACTGGG CGTTTTCCCC GAACCTTGGA ATCCTTTCAC GATTGAGGAC
TTGCGGCACC GTCTACCGCT TGTCTTCCAC GCCAAGCCAA TCCTTAGCGA CTTCATTGGT
GTAAAAGCAG GGGAAAGTGT CCATGTCAGC GTGACAAACG AGTCAAGGGA GTTGGCACCA
CAGACGGTCT TGATTCACCA GATCACAGAT CGGCAATCGG CACAAGAAGA CGTCGGATTT
GGTACGTAAC GTGGAACGTT CTCGATTGAG CCTTTTCTAT TGGCATTGGT ACAAACGCCA
CTCAATAGCT TACTGATTTA TTGTTATAGT GACGTGGCCG TCAGCCGTCA CATTGTCAAG
GTGGCTAGTA GCAAATCCAG ACATTTTACG GGGGAAATCA ATATTGGAGA TTGGCGCTGG
CTGTGGTTTG ACCGGCTTGG TTGCTGCACG AATCGTCGTA CACGAGGGAC TAAAGCAGGT
CGAGGATCCT GTTCAGAGTT CCGCATTGCG TGAACAGTTA TTGCCCCAAG GTGTACTCAC
TCTGACCGAT TTCAACACTC GCGTTCTTGC CAACCTTGAG CGGAACGTAG AGCTCAATGG
TGTGTCGTCT GTGTGCAAAG TCCATGGACT GGACTTTTAC CAACAATCAG GTCAATCCCA
CTCCTGGACT GATACGAGGG GAAACGAACA ATCGCAGGTA GATATTATTC TTGCAGCGGA
TGTTATATGT CAGCCAAGCG ACGCTATTGC TACCGCCAAT ACAATTCACG ACGCTCTTTG
TCCTCATGGA CAGGCATTTG TTGTATGCGC TGACGCCACT CATCGTTTCG GAGTCGATTC
ATTTCAATCG GAATGTACGC GTGTTGGTTT GACCGTTTCC ATTGTGAAGA TCATCATGAG
CAATTCTGAT CCAGATCATA CAGCAGCGAA TGACTATCAA GCAACAGCAG GGTATGTGGA
GGGTATGAAA ATGACCATGT TCCATGTGAG GAAGAGTCTG CAATGA
 
Protein sequence
MKCNDDLDKA PSTMLDCESD LTHHYDHPCL SSWEAQTTTS RGIVLLRIDS LKNDAPGFAA 
ALVPMLQHGV TDLLRLRASK PRQMTLDQAP VTLFLKTILW IHLNCTTIDP TLGKEIARHG
SHVQLVKLIR QDPSVLVLDE AEQDSIMELQ EVSCRIAALG VFPEPWNPFT IEDLRHRLPL
VFHAKPILSD FIGVKAGESV HVSVTNESRE LAPQTVLIHQ ITDRQSAQED VGFVTWPSAV
TLSRWLVANP DILRGKSILE IGAGCGLTGL VAARIVVHEG LKQVEDPVQS SALREQLLPQ
GVLTLTDFNT RVLANLERNV ELNGVSSVCK VHGLDFYQQS GQSHSWTDTR GNEQSQVDII
LAADVICQPS DAIATANTIH DALCPHGQAF VVCADATHRF GVDSFQSECT RVGLTVSIVK
IIMSNSDPDH TAANDYQATA GYVEGMKMTM FHVRKSLQ