Gene PHATRDRAFT_41238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41238 
Symbol 
ID7199061 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp274194 
End bp276580 
Gene Length2387 bp 
Protein Length738 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185164 
Protein GI219130002 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAG AGAAGAACAA TGCTTGCGAC TTGATGTTGG ATCGCTTAGC GCAGAATGTG 
TCGCGGAATC CTGCAAAACG AGCCGTTGCA TTCTTGGCCG GGGGGCGCAA CGGAGGGAGT
CTCCAAAAAG AGTTGACTTA TCAAGAACTG GAGACCGAAA CAACGAAAGT AGCGAAACAT
CTTATTGGGA AGGGTATCGA AAAAGGAGAA TGGTAAGTAG CAAAGTGGCT GGCGGATAAA
GAGTGAATGA AGCAACGCCG AAGCATCCTG TCTCATGCCT TTTCCTTTCC GTTCTTCTAT
TTAGCGTGGT GTTAGTCTAT CCTCCGTCAC TGGACTTTAT GATAGCATTT CTGGCGTGTC
TCAAGGCCAA TGTCGTGGCG GTACCTGTGT TCCCGCCAAA TCCCCTCCGT CGCGATACCT
TGGCGATGTT TGCCAACATT GTACAAGGAT GTGGTGCCAA ACACGCCTTG ACGAATACCG
AATACAACCA CGCCAAAAAA ATGGCCGGCA TCCGTGACGT TTTTACAAAG TTTCAACGTC
CGACCTCGGG GTGGCCAGAT GACTTGGATT GGACTACAAC GGACACTCTC AAAGAGCCGA
GAAATTCTGT CAACTTGCCG CAACCCCCTA GCGACCGATC GCAAGTCGCC TTTTTACAGT
ACACAAGTGG ATCGACCAGC GAACCCAAAG GTGTAATGAT TACGCACGGC AATCTTGCTC
ACAATCTGAC CATCATTACT AACGATTCTC AAGCCAAAGA TGATACAGTT GTCGTATCTT
GGCTTCCCCA GTATCACGAT ATGGGCTTGA TTGGTTCGTA CCTTGGCGTC TTGTTTTGTG
GTGGGACGGG GTACTATCTG TCACCCCTCT CCTTCCTACA ACGACCCATG GTCTGGATAG
AGGCAGTTTC CCGGTATCGG GCCACCCACT TACAAGCTCC CAACTTCGCG TTTAAGTTGA
CTGCGCGCAA ATTCAGTATC GATGCCTCGA ACACTGAACT TGACTTGTCC AGTGTCCGGC
ATGTCATCAA CGCCGCCGAA CCAGTTGATG AAGAGTCTAT CGATAACTTC TACAGGATTT
TCGGCAAGTA CGGATTTGCA AACGTTATTT ATCCCACCTA CGGTTTAGCA GAACATACTG
TCTTTGTATG CTCGGGTGGC AAACAACGCC TCACTGTGGA CAAAGCCAAA CTCGAAATTG
ATGCCAAAGT TGTTATTTTA GAGGACGACG ATCACCAAAG CACTGATATC AAGGCTGTTT
CCAAGCTTAT TGGCTGCGGC TTCCCTTCTC GTCAAAACGT TGACGTTCAA ATAGTGGACC
CAGAAAGTTG CAAAGCTTTG GCTGGAAACT TGGTCGGTGA GATCTGGATT CGTTCGCCTA
GCAAAGCAGC CGGCTATTTC AACAAGCCGA AAGAGACAAA AGAAGATTTT CACGCGGGTC
TTGTCAGTGA CGACGGTAGC AGCATTGGCA ACGCGGTAGG TGGTTACCTG CGCACTGGAG
ACCTTGGCTT TCTACACAAA CATGAGCTTT TTATTTGTGG TAGGCTGAAA GATCTCATTA
TCGTCGGTGG CCGGAATTAC TACCCACAGG ATATAGAGGC GACAGCTGAG GCTTCGTCGG
ATCTAGTGCG ACTAGGGTGC TCTGCTGCTT TTACAATCGA TCCAACCCAT GAAGGTGGCG
AGGAGGTTGC GCTTGTTATG GAACTCAAAG AAGCGCCATC TTTGAAAGCT ACTCAGACAG
TTTGTGAATC ACTGGCGAAC CAGATCAAGT CCGCTATCAA TCAAGAACAC TCCTTAGGAC
TGACAGATAT TGTGTTTTTG CACCCGCGCA CGGTTCCGAA GACGAGCAGT GGGAAGATTG
CACGGTCCTG GTGCCGAAAG GGATTCATCG CAGGATCATT AAAGATAATC TTTCGCAAAT
CATTCAAGAG TCAATCATTT TCACTGGAGA TGGAGGAGAC AACATTTGAC ACTCCTACGC
CTCGTCCGGT GTCTTCGGAT CAATCAAGTA AAATTCGAAG CATGGACAAG AAAGAGATTC
TCGCCAAGCT TTCGACCGAT ATCTCTCGAG TCGCATCTAT TTCTCCCGAT GCGTTGGACA
AAAGCGCAGC TCTCATATCC ATGCTCGACA GTCTTTCGCT CTCTCAATTC AAAGGTATGT
TGGAGAACAG TTATTCGGTC GACATCTCGG ACGAGTATCT TTTTCGCGAA TCCACGACCT
TACTGAAACT AGTGGAAGTG GTAAAATTAG GTTACGCGCC TGATGACGAA GCAAATACTA
CCCCTGCAAC CTCTGCGTCA AACGGAGCCA TTTCAACGCC TGGTCAAGCT AAAGGCATTG
CTGGGGTTTT GGGCTGTCCA CCCGGAGTGG TGTGTACAAT ACTCTAG
 
Protein sequence
MSEEKNNACD LMLDRLAQNV SRNPAKRAVA FLAGGRNGGS LQKELTYQEL ETETTKVAKH 
LIGKGIEKGE CVVLVYPPSL DFMIAFLACL KANVVAVPVF PPNPLRRDTL AMFANIVQGC
GAKHALTNTE YNHAKKMAGI RDVFTKFQRP TSGWPDDLDW TTTDTLKEPR NSVNLPQPPS
DRSQVAFLQY TSGSTSEPKG VMITHGNLAH NLTIITNDSQ AKDDTVVVSW LPQYHDMGLI
GSYLGVLFCG GTGYYLSPLS FLQRPMVWIE AVSRYRATHL QAPNFAFKLT ARKFSIDASN
TELDLSSVRH VINAAEPVDE ESIDNFYRIF GKYGFANVIY PTYGLAEHTV FVCSGGKQRL
TVDKAKLEID AKVVILEDDD HQSTDIKAVS KLIGCGFPSR QNVDVQIVDP ESCKALAGNL
VGEIWIRSPS KAAGYFNKPK ETKEDFHAGL VSDDGSSIGN AVGGYLRTGD LGFLHKHELF
ICGRLKDLII VGGRNYYPQD IEATAEASSD LVRLGCSAAF TIDPTHEGGE EVALVMELKE
APSLKATQTV CESLANQIKS AINQEHSLGL TDIVFLHPRT VPKTSSGKIA RSWCRKGFIA
GSLKIIFRKS FKSQSFSLEM EETTFDTPTP RPVSSDQSSK IRSMDKKEIL AKLSTDISRV
ASISPDALDK SAALISMLDS LSLSQFKVEV VKLGYAPDDE ANTTPATSAS NGAISTPGQA
KGIAGVLGCP PGVVCTIL