Gene PHATRDRAFT_49291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49291 
Symbol 
ID7195467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp484856 
End bp486573 
Gene Length1718 bp 
Protein Length531 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184003 
Protein GI219127564 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.955571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGAGCGACAG CGACAGCTTA AACGAGAACA GCAGTTGTGG TAGTAGTGCA CGCGGACGAC 
AGTATGCTCC ACTGAATCCA ATCCGAGACG AACGTGCAGT ACTTTTCAAA CTCGAAAGAA
TCATGAGACG GCCTCTGCGG CTCTGGTACA AGGAACCTCC GCCGCCTTCA CACGATCCTC
CATCCCCTCA ACTACATCAT CACCACCATT CACGATTTTA TCATCCACAC TGGGGATTTC
TCCTCATCTT ATCGGGAGTT GCCTTGCAAA TGTGGGTTTC CGTTACGCAA TTATCGGCCT
TGGAGGTACC CGAAAACAAC CTCGACGTGC CCTTGCTCCC AGCCTTTCGG AGCACCCGCG
AAGATTCGTC ACGAAAGACG CCGGCGTCCA CCAGTGCGTG GAGGCTCTAC GTCGGTCGTG
TACGGTACGG CGTGCCCGTG TCCGCCGCCT CCTACACGTG GGAAGCGCAA CCCCCGCGAG
TCGTTCGTTT GGCTCACGAC GGATCCTACC AACATCCGAT TCCCTCCTCC CGGAATCGCG
TGGCCCGACG GAATGTGGCG GAACTAGGGA ATGCTTCCTG TACCTACTGT CAATACGAGT
CGGATCAACA TCTATGGGAC TTTGAAAAAC CCTTTTACGA GGACTGTACC CCCATGGCGA
AATGGCAAAC CATCTTTTAT CCAACCTGCA ATGTCTTGCA CGAAATTACC ATGATGCACT
CGGAGGAAGA CTACGATCCA AAAGGACACG ACTTGGCCGA CGAGAGTATC GATGAAATTG
AGGATAACGA CGATGACACC GTCCCTTCCA TCGAAGAAAC CACCTTGCTG AGTATGCAGG
GCAGTTGGCG GAGTGTTTGG AAGTACCAGG ATGCGGTCAA CGATACGGCC GTACTCAAAG
TGCTGAAACA CAGTCGAGAA TTCGATCACG AATCTTTCGC CTACCACATT ACCGATGCCA
TAGTCATGGA GCGCTTGACC TCGTCCCCAC ACATTATCAA CGCTTACGGA TTCTGTGGAC
AATCTGTCTT AACTGAATTC GCTTCGGGCT CGGCGCGCAA GCTCATTAAG GACCCCAAAT
TTAACAGCAA GGAACGGCTG AAAATGGGAA GAGATTTGGC TCGGGCCTTG ACAGCAATGC
ATTCAATCGA CTTTCCCAAC AGCACCAACC CCACGCTGGC TCACAATGAC ATTAACATTG
CGAATGCCGT TGAAGTGGAT GGTCGCATCA AGCTTAACGA CTTTAATCTG GCCGTCTTGA
TGCGATGGAA TGATACACAA CCGTGTGGCT ATCCGGTACG GTTCGATCGA CCCATGTGGG
AATCACCGGA AGACGTCCGC AACCTCACCT ACGTAGATCC GGCACTCGGG GACGTCTACA
GTCTCGGCAA TCTACTCTTT AGCGTATTGA CAACCCGACA ACCGTGGCTA CATCTGGAAC
CGAACGGCCC CTACAACAAA ACAGAGGTTG CACAAATGAA AACGCAAGGC ATCATGCCAG
CTATTCCCGA TAAATATTTA GAATCGCGCA AGATGGCGCA TCACGCGTTG TATTTCGCCA
TCCAAGCCGC CTACCGAGAT GATCCGGCCG AGCGGCTCAG TTCCCACGAA CTGGCGGAAG
CTTTGGGGAT CGCTCTAAAT TGGGGTCGGG ATGGAAGACG CACCTCCCGC ACCGACCTTG
CCATGCTGTT TGTCAAGCCG AGACCCGACA TGTACTAA
 
Protein sequence
MRRPLRLWYK EPPPPSHDPP SPQLHHHHHS RFYHPHWGFL LILSGVALQM WVSVTQLSAL 
EVPENNLDVP LLPAFRSTRE DSSRKTPAST SAWRLYVGRV RYGVPVSAAS YTWEAQPPRV
VRLAHDGSYQ HPIPSSRNRV ARRNVAELGN ASCTYCQYES DQHLWDFEKP FYEDCTPMAK
WQTIFYPTCN VLHEITMMHS EEDYDPKGHD LADESIDEIE DNDDDTVPSI EETTLLSMQG
SWRSVWKYQD AVNDTAVLKV LKHSREFDHE SFAYHITDAI VMERLTSSPH IINAYGFCGQ
SVLTEFASGS ARKLIKDPKF NSKERLKMGR DLARALTAMH SIDFPNSTNP TLAHNDINIA
NAVEVDGRIK LNDFNLAVLM RWNDTQPCGY PVRFDRPMWE SPEDVRNLTY VDPALGDVYS
LGNLLFSVLT TRQPWLHLEP NGPYNKTEVA QMKTQGIMPA IPDKYLESRK MAHHALYFAI
QAAYRDDPAE RLSSHELAEA LGIALNWGRD GRRTSRTDLA MLFVKPRPDM Y