Gene PHATRDRAFT_49659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49659 
Symbol 
ID7198148 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp333353 
End bp335809 
Gene Length2457 bp 
Protein Length629 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184444 
Protein GI219128488 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACCTTTGTCG GTATCTCGCT GATCTTGAGA AAGCTAACCG TTTGTGTACA GACCATTGAC 
TTTGATTCGA TTGCGTCCTA CTTTTGGTTT CCTTCCCTCT GGGTTCAGGA TGTGCTTTTT
GTGTTGCTAG TCGACGCATC TCTTTCGATG ACGTCTACGT ACAGTGTATC CGTATTTCTT
TGGACCGCAA GCGTAACAGC TGTTGTGTGT ACGATTGGCA TTTGCACCGC GGAAGTAACC
TTCTGGATCG GTCATGGTTT CGGTGTTCCC TGGGATCGGG TCCCACTCGC GTGGTATCGT
TGGGAAGAAT TCCAAAGCTT GATTTCCTCC AAGGCCGAAG AAGGTACGCT CAAGGTATGG
ATGGCTCTTA TAGGGCAATT ATCCTGGACA ATCTTTTTTG GCTTATTTAC GAATACGTGT
CGCAAGAAAT TGTGCCACTA TGAATGCAGA AGTCGACTTC ACATTGCAAA AAGCATAAGC
CTGACCGGTA CCACAGGTGC GCCAAGTAAT GCTAGAGGAT TTACTCATTC ACGCACCGCA
AAGATGAGCG CTGTCGCACT TTCAATAATT TACAGTGTGT CTGTTCTCCT TCTACGACCG
TCTATTCCAT ATACGAGGCT TTCGATGACA CCTTTTCTAA AGGTAGCACT GGAAGTCACA
GAGGGCTTTC AAGAGTATCA CAAGTTGCTT CGCGCAGCAA ATAGCAAAGA CTCCTCGGGA
GATCACAACA AGTGGTGGCG AACAGAAGTT GACACAATGA AACAGCTTGT TCAAAGAGCC
CCGTTCGAGC GTGATTATAT GGATGAGCCA ATCAACGTTG TTCTGGTATT TTTGGAATCT
GTTCGGGCAG ATATGATGCC GTTTGATGGC TCTACACCAT GGGCGAGGCG ATTTGTGCCA
AATGTCACCA TCCATGACAA GATAACACCA TTTTACAATC AATGGGTCAG AGACTCCAAT
TCAACTCTAT ACATTCCTCA TATGAAATCG GCTTCTGGTT TTACTCATAA AAGCCTCGCA
AGTACGCTTT GCTCTTTGCA CGCCTTGCCC CTCCCCGGCA CCGTTGAGCA TGCACAAAAT
CTTTATCATC CCTGTCTTCC GCAGATACTT GACCGACTAG GGTATGAGAG TCAGTTTTTC
AAATCCTTGA CCGAAACCTT TGACCACCAA AACGACCTTA TGCGAAACAT TGGATACCCC
AGAATGTATG GGCGAGAGAG CTATGATCTT GCCCACAATG TATCTGCAAT TTTCCAACGG
GACCACAAAG CCAACTACTT TGGATACGAA GACATCGTGT TGCTGGAAAC TCTCACAGAA
TGGGTTGAAA ACCAGACTAG GCCCTTTTTC CTTTCGTACC TGAGTGGTAT CACCCATGAT
CCATACGAAA TTCCACCTCG TGGAGGGGGT TGGAAAGCTC AATCATTCAG TCCGGATTAT
AAGGCAAACG GCTATCTGAA CGAAGTGTCC TATCTTGACA CGTGGCTAGA ACTCTTGGTC
AAATCATTTG AGGATCGCCA CTTGATGGAC TCGACTCTGT TTGTCTTTTT GGGAGATCAT
GGTGGGCATT TTAAGGATCG AGATTCCAAA TTTACCACAT TCGGTCAGAA ATACGAAGAA
GCTTTTGACG TTGGCGTTAC GTTTCACAGT CGCAATCCGC GGATCCAGGG GCTGTTGCAA
AAAGCACAGG TATTTGTGAG CGGCAATTGG TCTTCGCTCG ACATTGCCCC AACTCTGCTT
GAAATACTAT TTGGAAGAGC GGTAGACCCA ATGCGGGTAG TATCAGAGAA AGGATTAAAA
AACAGTCCAT ATTCCAAGTA CTTTGACAGT CGCGCGAGCG CAAGCTGGGT CGACGGTCGG
TCCATGCTAC GAGAATCAGG ATCCCGACTA CGATTGAGTG TAGGCAACCC TGGAGAAAGC
TTGATACTCC GAGACATGTG TTTTTTGCTG GTGTTTCCTC TAAAGAAAGA CGACCAATCT
CATCCTGAAG CGTTCAACAT TTGTGCGGAC CCGGGTCAGC ATCAACCGTT GCAGCTTCTG
CCGGTATCGT CGCTTTCGAA ACCAAAGTCA AAACTCGAAA AATGGGGCCA AAAAGCCATG
ATGTTCTGCC TACAAGTCAA ATTAGATTTA GTGCGTGCTC ATGAAACGGG TATGCGTTGC
CAAGATTGTG CACTCGAAAA GTTGGTGACT TTGGAGACTT TGGAGAGTTG GTCTACCACT
GTTGAGAGGG CAAACCTCTC TGATGCTAGT GCTGTGTGAG CATGGAGACA AGTGTATTTA
CTGTTTCGGA TAGTCTGACC GTGAATACTA TTATTTTGAC ATTCTACCGA AAGCATATCA
TAAAGACGCG CAATGGACTA TGAGACAGAA AAGAAACGCT CTTGCTTCTT ATTTCCAAGG
ACAACCGACT GCTACCCAAC CATAGAGGAA AGCTTATTAA CAATACTTTG TGACGTC
 
Protein sequence
MTSTYSVSVF LWTASVTAVV CTIGICTAEV TFWIGHGFGV PWDRVPLAWY RWEEFQSLIS 
SKAEEGTLKK LCHYECRSRL HIAKSISLTG TTALEVTEGF QEYHKLLRAA NSKDSSGDHN
KWWRTEVDTM KQLVQRAPFE RDYMDEPINV VLVFLESVRA DMMPFDGSTP WARRFVPNVT
IHDKITPFYN QWVRDSNSTL YIPHMKSASG FTHKSLASTL CSLHALPLPG TVEHAQNLYH
PCLPQILDRL GYESQFFKSL TETFDHQNDL MRNIGYPRMY GRESYDLAHN VSAIFQRDHK
ANYFGYEDIV LLETLTEWVE NQTRPFFLSY LSGITHDPYE IPPRGGGWKA QSFSPDYKAN
GYLNEVSYLD TWLELLVKSF EDRHLMDSTL FVFLGDHGGH FKDRDSKFTT FGQKYEEAFD
VGVTFHSRNP RIQGLLQKAQ VFVSGNWSSL DIAPTLLEIL FGRAVDPMRV VSEKGLKNSP
YSKYFDSRAS ASWVDGRSML RESGSRLRLS VGNPGESLIL RDMCFLLVFP LKKDDQSHPE
AFNICADPGQ HQPLQLLPVS SLSKPKSKLE KWGQKAMMFC LQVKLDLVRA HETGMRCQDC
ALEKLVTLET LESWSTTVER ANLSDASAV