Gene PHATRDRAFT_45604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45604 
Symbol 
ID7200382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp668835 
End bp671288 
Gene Length2454 bp 
Protein Length648 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179895 
Protein GI219118232 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACTCG AAGCGATGGT ACGTAGCGAC AAAGAAATTG CACCAGCATT TCCCTCCGTC 
GGAGACGACA GAGACTCCGA TACACCTTCT TTTGTGCAAC TCGTTGCTAA CGTGCACTTT
GTGCCACGCC GACGCGTCCC AGTTATGCCA CTTGTGCTCA ATCGGACCAA AACAACAGCC
GATATCAAGC GGACTGCGCC GCCGGATGGT GCCGAAAACA ACGTCCACGA CGAAACCAGT
CCAACGGAGT TGTCCGCATC CAGTTGTACG CCGACTGCGA CACCGACACG CGAAAGCCAC
GTTGCTAACG CGAACATTGC GACTCCCGAC ACCGAAGAAT GGGAATGCGA TGCAACGCCT
CTCTTGCTGC GACGGATGCG GTCGCTAGAT ATAGACTGGT CGGATACGAG TCCGTTCCCA
TCACGACAGA GGCCCAACAG ACTCGATGAC GATATAGAAA CGGCGTCGAT GTTGTCAATG
GGGTCTACCA GTTTTGTGTC ATCAACGCCA CGACGCAATC CAACTCGGCC CCGTAATCCA
ATTGCGGTGA TTGTGACGAC ACCCGATTTT CCATCGACTC GCTGTCGGCG TGCTTCCAAT
ACGAATACAC ACAGTCCCAG CGACTCCGTA TTTGACCGGT TGTACCAGCA CGGCCGTGCA
AAAATACGTG CCGAACGGGA ACGATCGACA CAGTACCCAT CGCGAACGAC AAATAGGTCT
GCAAATACGT TCCGTCAGCG TGGGTCGAGT GGAATGTCAA TTATTTCCAG TGATTCGCCC
CTGCATCCGA GTGAAGACTC CATATACGAA CGTTTATATC GCAACGAAAC TCGATCCCCA
CGTCGACTTT CCACCTGGTC GCCGAGTGAT TCGTCTTCAC TGCAATCCAG CACATCTTCC
TGGCAACGAA GACTAGATTA CACTCCGCCA ATGGACGTTC GGAAAAGGAG ACAACGTTCT
CGTCTGACAT CAAGGTCCGC AAATTCCACA AATGCTTCTA CAGCGGAGGC AACTGTCAAC
GCGAACTCTG TTTTTGAACG ATTGTACCGT CGGGAACCCC GGCCTCGATC CAACTTGTAC
AGTTTACCCA TTCATCAACG GAGACTTCCC AGCGTGAACG GCACTAAACG ACAGATAACC
GAGTCAATAG GCTGCTCGGC AGCTTCTCAC CCAGAGAATC CAAGTGATTC CGAACTTGAA
TTGTTGGCGC ACGAGATATC GCTGCTAACC AAAGAAGGAC ACGTGACTGA CGACGGGTCG
ACTTCGACGA CGACGAGGAA AGATTCCGAG CTGTCCATGT TAGCCCAGGA ATTGGCTTTG
CTTGCTGACG GACTAGAAGA TGAGGACTCG GAGGATATGA GGGGTTTTGA GCTGGTTGAG
CCAAATCTTT CTTCCACGTG GTGGATTGAA TCGCAGATAA TTGCTTTGCA AGCTGCTTGG
AGATTGCGAC AAAGACGTGC TGCTTTGAAC CGCGAAAGAA AGTATGCAAT CACAATTCAG
GCGTATTGGA GAGTCTATCA GGTGCGCAAG CAAAGATGGC TGGCCTTTAC AAAAATATTG
GAAATTCAGA AAGTCATCAG AGGATATCTC GCCCGCCAAA TGTTTAGGAA TATGATAAAT
CGATGGAATG GCGACAGACA TGAAGTCGCT TCGATTTCAG TTCTGCAGCG TGCTTGGCGT
TCTTTATCAG CAAGGCAAAA ATTTCGTGGC ACGAAGCTTT CCGTTTTAAA GGTTCAAGCG
ACTTGGCGAA TGTATGTTCA GAGAACTTCA TACCAAGATA TTTTGGAAAG GTTGGAAAAT
GAATCCTTTA GATCATTATT TGGGAATTCC AAAACTTTGC ACGACAACGA TGATTTTATC
GGTAGCCATA TATCCGATAG AGCAAACGTT GACCGTTTCC AAATCCTGCG CACAATCGAA
ACCAAGATAC ACGAGCGTAT CCCGTGAAGA TCCAGTCTGT ATGGCGTGGC CACAAAAGTC
GCCTTGTGTT GCCTCAGCAG CTGAGTGCTT GTCGATTCGC TGCATCTATA GTGAGTATCC
AAAGCTGGTG GAGAAGTTAT TCTACTAGAA GAGATTTACA ATCACTGGAG AAAGCAGCAG
TTAAAGTTCA ATCCATTTGG AGAATGCACT CTCTCCTTGG ATACCTCAAG ATCAGGAATG
CCTCCAGTAC CGTGATTCAG TCGGCATATC TTGGTTATGT TTTGAGATTA TGTTTGTTTC
GAAGAAGGGC GGCCATTCGA AGATTAGAGC AGAGATATAT TAAGAGACTT CGCAGAACCC
ATCAGGTCCG AGAAAATTAT GCGTCAGCAA TCCTCCAAAG TTGCTGGAGA ATGCATACAA
TAAGGGATGA CTACGTGTAC CTTCGATACA ATTCTATTAT GGTTCAGTCA TTTATAAGGA
AAGCTGTGGT TCGTACCAGA TTCTTGAAAG AACTGGCAGC CATTATAACC ATCG
 
Protein sequence
MGLEAMVRSD KEIAPAFPSV GDDRDSDTPS FVQLVANVHF VPRRRVPVMP LVLNRTKTTA 
DIKRTAPPDG AENNVHDETS PTELSASSCT PTATPTRESH VANANIATPD TEEWECDATP
LLLRRMRSLD IDWSDTSPFP SRQRPNRLDD DIETASMLSM GSTSFVSSTP RRNPTRPRNP
IAVIVTTPDF PSTRCRRASN TNTHSPSDSV FDRLYQHGRA KIRAERERST QYPSRTTNRS
ANTFRQRGSS GMSIISSDSP LHPSEDSIYE RLYRNETRSP RRLSTWSPSD SSSLQSSTSS
WQRRLDYTPP MDVRKRRQRS RLTSRSANST NASTAEATVN ANSVFERLYR REPRPRSNLY
SLPIHQRRLP SVNGTKRQIT ESIGCSAASH PENPSDSELE LLAHEISLLT KEGHVTDDGS
TSTTTRKDSE LSMLAQELAL LADGLEDEDS EDMRGFELVE PNLSSTWWIE SQIIALQAAW
RLRQRRAALN RERKYAITIQ AYWRVYQVRK QRWLAFTKIL EIQKVIRGYL ARQMFRNMIN
RWNGDRHEVA SISVLQRAWR SLSARQKFRG TKLSVLKVQA TWRMYVQRTS YQDILERLEN
ESFRSLFGNS KTLHDNDDFI GSHISDRANV DRFQILRTIE TKIHERIP