Gene PHATRDRAFT_48204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48204 
Symbol 
ID7203331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp503602 
End bp506712 
Gene Length3111 bp 
Protein Length786 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182547 
Protein GI219124515 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.3526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGCCGC CTTCTAGCCT ATCAACCTCA CAGCGAGATG CCCTCCGTGC ACTTACTGAT 
GGATTCTTAC CTCCTTTATC GGTTCCTCAG GAGTATCAGC CCAAGTATGA GGCCCAGAAC
CGCATCCACG AAGAATATTG GAGTCGACGT GTCTCCGCTG ACTCTGACTT TTTGAAGGCC
CTAGAAAGTT CCATTGTCGA CAAGCTTCCC GCCAGAGAAT CTTTTCTTAC TCGATTTCTT
TTAAAGGCTT TGTCAACTAG TGTTGGAACC GCGTTGTTGT TTGGTAAACC CACGCTACAC
CCCTTTACGG AATGGCCCGT TCCACAGCAG ACGGCTCTGT TGCAATCATT GAAAACCAGC
TCGATTGCTA CGAGACGGCA GATTTTCAAT GGATTCAAAC GCTTAATATG TGGATTAGCA
TATTCCTATA CGGTCAATGG AACAAATCCT TTTTGGAAAG CAGTTGGGTA TCCCGGGCCA
GCCCAAAACA CTCTCCTTTC CAAGCGAGAG GATACGGCTC TCGTCGCCAA GACGATGGAG
CAGCAACGAC CGATCCGGGA AGCTCTCGTT CCCATTGATC TGGATACGGA ATATGAATGC
GACATTGTGA TCGTAGGATC GGGATCAGGC GGAAGTGTAG CCGCTAGCGT TTTGTCAGAG
GCGGGTTATC AAGTGCTCGT TCTGGAAAAA GGCACTTACA TTGCGCCAGC AGATATTTCC
AACGAAGAAG CCGACGCGTT GGATCGCATG TACGAGACAC ACGGATTGCT TACAACGAAA
GATGGCTCCA TGATGATTTT AGCCGGTGCG ACATTGGGGG GTGGAACGAC TATCAACTGG
AGTTGCTGTT TGCCTTTACC CTCGTACGTT CGGGAGGAGT GGCGTTCTGA GCATGGTCTC
GTGGACGACT TTAAAGAGGG AGGTGAATTT GAAACTTCGA CGCGAGAGAT TCTCAGTCTC
ATGGGTGTCA CGAATAAGAT TACACACAAT GCGCTGAACC AGAAGCTTCA GCAAGGTTGC
GATGCTCTGG GATACGAATG GGAAGCGAAT TACGTAAATT TGCTTCAAAC TGCCAACGCA
ACAGCAGGCT ACATTTGCTT TGGAGATCGA TACGGCATGA AACGCGGTGC CTTGTCTGTT
TTTCTTCCCA AAGCCATTTC TTACGGTGCA AAACTAATCG AAGGATGTCA CGTCGAACAA
GTTATTCTCG GAGAGGGAGA AAATGGTCGT CGGAGGGCTG AAGGCGTTCG ATGCAGTGTG
GGAGCCCACC GACTTCACGT CGTAGCAAGA AAAGCCGTTG TCGTCGCTGC AGGTGCTCTA
CATACGCCCT GTCTATTGCG GCGTTCCGGC CTAAACAATT CGCATATTGG AAAGCACCTT
CGCTTGCACC CTGTAACTGT TGCCGCTGGC TTCTCCAAGC CGACTGATCC TATCGAATGC
TATCAGGGTG CGCCTTTGAC CACGGTCTGC AACCAATTTT CTCATGGCCC CGCCAACGAT
GGATATGGGG CAAAGATTGA ATGTCCAAGC GCGCATCTTG GCCTTTTAGC TGCAGGCTTG
CCTTGGACAA ATCCTGAACA ATTCAAAGAT AGAATGCTTC GTATTCGAAA TGGTGTGGTT
TTCATCATCG TTCAGAGGGA CAAAGGCGAA GGCACTGTTT CGCTTGCTCG AGACGGAGCT
ACCCCAGTTG TGGAATACTC TGTATGCCCA GCTGACAAAG TTAGCATGCA ACAGGCCGTC
TGTGGTGGAG TGCGGATTTG TATCGCATCG GAATCCACGG AGGTCACAAC TGCGCACAGT
CTCGATGAAG GTATGCACAT CTCTGACGGA GATTTTTTGC AGGAATATCT TTCAAAATTT
ACGGCTTTGG GACTGAAGGA AAATGAGGTT GCATTGTTCT CTGCCCACCA AATGGGATCT
TGCCGTTTGA GCGCGACCCC CCTTTCTGGC GCGTTGGATC CGAATGGGGA AGTTTGGGAA
AGCGACGACT TGTATGTTAT GGATGCAAGT ACGTTCCCAA CTGCTTCTGG GGCGAATCCA
ATGATAACGG TTATGGCAAT ATCGTTGATG CTAAGCAATC GCCTCGCTTT ACGGCTACAA
CACGTGGACT ATAAGCTTCG TCGAGCTGGA GATATTCAAA AAGCGGAAGA AATGGCGAAG
CGCCGACTAG AGCTGCGAAA TACTTTTTCT ATATCGCCCG AGAAAAACAG TTCTGCTGAA
CGACCCGGTG CACATTGGAA CAGAATTGTG GATAAATCCT TGTCGATACT CATTTTGTTA
ACCCTTATGA TACCGATCTT GCGGTCGTGG TTTTTCGATG TCCCGCTCGT CCAGGATCTA
GTCAAGCATC CTATAATGTG AGGATATGGA GGTTGCTTCT TCGCTATCCT TGTATCTTCA
GTACAATATT CAATGATAAC ACCGGGGAGG CGGATGGGGA GGAATCCGCA AAATCAGGGA
ATCTTTTGTC AATATAAGAC GTTAATGTTT AGAGCAGAAG CGACAAGCCC CGCTGAATCG
AAGGGAAATC AGTCTATGGA TGTCAGCTTT CCGAAGATGA TGCGTTCGCC ACTCACATCG
ACCTCTTCAA AATACATCAA AATTTAAGAA CGAGTGGACC GATTATAAGG GCTTTACTCA
GTCGAGACCA TAGCTGTGAA CTAAACCAGG AGTCCATTTT CATATGTTTC CGATTATCTG
TTAGAGCAGC TATGCTGTCA TTGATCGCTA AGCTACAGAC TTCGTTGATG CGACGTCTCG
TCCAAGGTTG TCAGGTACCA CAAATAGCTT TTTTGCAACT CAACTGAAGC CCGAAAAAGT
CATTGCCCCT TCCACGTTAC AATATTACGC ACAATGAAAC ATGAAAGGAG TTCTTATGCA
GCTGGGGCGA ATTTTGGCCA GCTTGGCTCT GAAAGCAACA AATTCAGATA AAAGAAAAGG
TACTCCTCTG AGTTTAGCAG CTTTGAGTAA TCCGTTATGA GAAAAGTCAT CAGGGTTTGT
TGACGACAGT GAGACGCTTC TGAAGCGTTG TAAAAGTGCG AATGAACGTG TCCCAGATTG
TTGAAGAGAG CCAAAATAAG AAGGTTAAAA GACGCCGCAT AATTTCGTTG C
 
Protein sequence
MGPPSSLSTS QRDALRALTD GFLPPLSVPQ EYQPKYEAQN RIHEEYWSRR VSADSDFLKA 
LESSIVDKLP ARESFLTRFL LKALSTSVGT ALLFGKPTLH PFTEWPVPQQ TALLQSLKTS
SIATRRQIFN GFKRLICGLA YSYTVNGTNP FWKAVGYPGP AQNTLLSKRE DTALVAKTME
QQRPIREALV PIDLDTEYEC DIVIVGSGSG GSVAASVLSE AGYQVLVLEK GTYIAPADIS
NEEADALDRM YETHGLLTTK DGSMMILAGA TLGGGTTINW SCCLPLPSYV REEWRSEHGL
VDDFKEGGEF ETSTREILSL MGVTNKITHN ALNQKLQQGC DALGYEWEAN YVNLLQTANA
TAGYICFGDR YGMKRGALSV FLPKAISYGA KLIEGCHVEQ VILGEGENGR RRAEGVRCSV
GAHRLHVVAR KAVVVAAGAL HTPCLLRRSG LNNSHIGKHL RLHPVTVAAG FSKPTDPIEC
YQGAPLTTVC NQFSHGPAND GYGAKIECPS AHLGLLAAGL PWTNPEQFKD RMLRIRNGVV
FIIVQRDKGE GTVSLARDGA TPVVEYSVCP ADKVSMQQAV CGGVRICIAS ESTEVTTAHS
LDEGMHISDG DFLQEYLSKF TALGLKENEV ALFSAHQMGS CRLSATPLSG ALDPNGEVWE
SDDLYVMDAS TFPTASGANP MITVMAISLM LSNRLALRLQ HVDYKLRRAG DIQKAEEMAK
RRLELRNTFS ISPEKNSSAE RPGAHWNRIV DKSLSILILL TLMIPILRSW FFDVPLVQDL
VKHPIM