Gene PHATRDRAFT_39762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39762 
Symbol 
ID7195340 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp646549 
End bp649544 
Gene Length2996 bp 
Protein Length917 aa 
Translation table 
GC content61% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183645 
Protein GI219126817 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000929997 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCCGA CCGCCGACTT CACCATTTCC GACTTTCCTC ACAAAGTCCT CGATCCCATC 
GCCACCGACA CCACCGCTCC CTCGTATGCG TCGCTTCTCC TGGCCCAACG CCAGCTCTCC
GCCAACGCGT CCGCCATTCC CAGCCTTAAC GGCGGCGGGG CCCATGGTCA CATGGCCCTC
ACGCTCACTG CCGAAGCGTA CGCCGAACTC TCCGACATCC CTTTTGTCAT CCCCGTTGCT
CCCCCTGCCG ACCCTGAACC CGGCACCACG CAACCTCAAA TAACGGAGAA CAACCGACTC
CACAAACGCG CTGTGGCCAT CCACAGCCTC TACGTGGCGG TCAACAACGC CCTTCGTCGC
CAGATCCTCG ACGCTGTTCC TCGTGTCTAC GTTCGCGACC TGGAACACCC CCAGTTTGCT
TACAGCCACG TTTCCTGTCG CGACCTCCTC GACCATCTCT GGCGCAACTT TGGTACCATC
TCCGCTTCGG ACCTTAAGAA CAACATTCAG TCCATGTACA CCCCGTGGAA CCCAGCTGAC
CCCATCGAGA CCATTTTTCA TCGCTTAACC GACGCCATCG CGTACTCGAC GGCGGGACGT
GACCCCATCA CCGAGGCTGC CGCCGTTCGC GCCGGCTACG ATGTTCTCGA GCACTCCGGC
CTTTTTCCTC GTGCCTGTGA AACCTGGCGC ACTGCCTTGC CGGCCACCCA TACGCTTGCC
AATCTGCGCG CCGTCTTTAA GGTCGCCGAC ACTGACCGCA AGCGTACGGT TACCACCGGC
TCCCTAGGCT ACGCCAACGT CCTTGCCACA GCTCCATTGG TTCTCCCGTC GCTTGCGCCC
GACTCGCTCA GCCTTCCTTT TTCAGCCCTC TTGGTGTCAC ACTCCTCTGC TGCCCTCTCT
GAGCGAACTT ATTGCTGGAC CCATGGGTCC AGCAATAACC GTCGGCACAC TAGTGCCACG
TGCAAAAACA AGGCCCCTGG CCACCGCGAC GACGCGACGG CCACCAACAC CCTTGGCGGC
TCCACCAAGG TTTGGACTGC CCCCAAGCCT CCTGAATAGG AAAGAGGGAC GGCTACGCCA
ATGATTAAAA CTAGTAATAC CGATTCTCTC AATCATATTA CTAGTCTTAA CTCGTCTGTA
GTCCCCTCCC CGCCTAGTAC CCACACCTCT GCCATTGCCG ACACCGGCTG CACAGGCCAC
TACATTACGA TCAACTGCCC TCACACGCAC CGGCACCCAG CCAACCCCAG CCTCTCCGTC
CGTGTCCCGA ATGGCTCTGT CCTCCGCTCC AGCCATGTTG CCACCCTGGA CCTCCCTGGT
TTCTCCCCTG CCGCCTGCCA AGCCCACATT TTTCCTGGGC TCGCTTCCCA TCCGCTCCTC
TCCATCGGTC AACTGTGCGA TGACGGCTGT ACGGCAACCT TCTCGGCCAC TCGCCTTGAC
ATTCATCGCG ACGCCACCCT GCTGCTCTCT GGTGCCCGCT CCCCCCACAC TGGCCTCTGG
CACCTTGATC TTACCCCTCC CAAGCCCCCT GCTACAGCCC ATGCTCTTGT TCCAACCACC
CCCCTCGCCG ACCGCATTGC TTTTGTTCAC GCCTCGCTCT TCTCCCCGGC TCTCTCTACC
TGGTGCCAGG CCCTCGACTC CGGCCATCTC GCGACTTTTC CAGACCTTTC CTCCCGCCAG
GTCCGCAAGT ACCCACCCAG CTCCCCCGCG ATGATCAAAG GTCACCTCAA CCAACAACGC
GCAAACCTGC GCTCCACCAA GCTTTCCCCT GTCTGTTCCC CTCTCTCGAC GGAACCCCCT
GCCGTCGCTG TGCCCGACCT CGATCCTCCT GACGCCCACC CTGTTGCACG CACACACCAC
GTCTTCGTTG CCCACCAACG GGTCACCGGG CAGATCTACA CCAACCAACC GGGCCGTTTC
TTCACTCCCT CCAGTGCCGG ACACAACGAC ATGCTTGTCC TTTACGATTA CGATAGCAAC
GCCATCCATG TTGAACTCAT GCGGAACAAG TCAGGACCCG AGATTCTTGC CGCCTACCAA
CGTGCTCACA CCCTTTTTAC CCAGCGCGGC CTGCGTCCCC AACTTCAGCG CCTCGACAAC
GAAGCCTCTA TAGCCCTCCA AGCCTTCATG ACCTTAGAGC AGGTCGACTT TCAGCTCGCA
CCCCCCCCCC CCCCCATCTG CACCGTCGTA ATGCCGCCGA ACGGGCCATA CGCACCTTCA
AGAACCACTT CATTGCTGGC CTCTGTACCA CAAACCCGGA TTTTCCCCTT CATCTTTGGG
ACCGACTCCT CCCACAGGCC CTCATTACCC TCAATCTTCT TCGTCGCTCC CGCATCAATC
CCAAGTTGTC CGCCCACGCA CAACCTTACG GTGCCTTTGA CTACAACCGC ACCCCGCTTG
CTCCTCCCGG CACCCGCGTC TTAGTCCATG TCAAGCCCGC TGTTCGCGAA ACCTGGGCCC
CCCATGCTGT CGAAGGTTGG TATCTCGGCC CCGCTCTCAA CCATTATCGC TGCCATCGCG
TATGGATCAC GGAAACACGT GCCAAACGTG TTGCTGACAC CCTTTCCTGG TTCCCGACCC
GCATTCCCAT GCCCGCCCTT TGTCCACCGA CCGCGCCCTG GCCGCCGCCC GTGACCTGGT
CCATGCCCTC CAGAATCCTT CCCCGGCGTC TCCGTTCGCC CCCCTCGATG CCACCCAGCA
CCAGGCACTC ACAGATCTTG CCACCCTCTT TGCCACTGTG GCCGCCCCAG CCGACGACAT
CCCTGCACCC GCTCCCGTGC CTCCGGTCCG TCCCCCTGCC CCAGCAACTC CCCTTGCTCA
GGTCCGTTTT GCCGTTCCTC TTGTCACGGC CGAACATGCC CCGGCACTTC CGAGGGTGCC
CATTCCGGCC CCAGCACTTC CGAGGGTGCC CACCCTGGCC ACCTATCACT CTCGCACCGG
CAACCCAGGC CGTCGCCGCC GCAAAGCACG CACACAACCG GCAACCCCAA CCCTAG
 
Protein sequence
MSPTADFTIS DFPHKVLDPI ATDTTAPSYA SLLLAQRQLS ANASAIPSLN GGGAHGHMAL 
TLTAEAYAEL SDIPFVIPVA PPADPEPGTT QPQITENNRL HKRAVAIHSL YVAVNNALRR
QILDAVPRVY VRDLEHPQFA YSHVSCRDLL DHLWRNFGTI SASDLKNNIQ SMYTPWNPAD
PIETIFHRLT DAIAYSTAGR DPITEAAAVR AGYDVLEHSG LFPRACETWR TALPATHTLA
NLRAVFKVAD TDRKRTVTTG SLGYANVLAT APLVLPSLAP DSLSLPFSAL LVSHSSAALS
ERTYCWTHGS SNNRRHTIPS PPSTHTSAIA DTGCTGHYIT INCPHTHRHP ANPSLSVRVP
NGSVLRSSHV ATLDLPGFSP AACQAHIFPG LASHPLLSIG QLCDDGCTAT FSATRLDIHR
DATLLLSGAR SPHTGLWHLD LTPPKPPATA HALVPTTPLA DRIAFVHASL FSPALSTWCQ
ALDSGHLATF PDLSSRQVRK YPPSSPAMIK GHLNQQRANL RSTKLSPVCS PLSTEPPAVA
VPDLDPPDAH PVARTHHVFV AHQRVTGQIY TNQPGRFFTP SSAGHNDMLV LYDYDSNAIH
VELMRNKSGP EILAAYQRAH TLFTQRGLRP QLQRLDNEAS IALQAFMTLE QVDFQLAPPP
PPICTVVMPP NGPYAPSRTT SLLASALITL NLLRRSRINP KLSAHAQPYG AFDYNRTPLA
PPGTRVLVHV KPAVRETWAP HAVEGWYLGP ALNHYRCHRV WITETRAKRV ADTLSWFPTR
IPMPALCPPT APWPPPVTWS MPSRILPRRL RSPPSMPPST RHSQILPPSL PLWPPQPTTS
LHPLPCLRSV PLPQQLPLLR SVLPFLLSRP NMPRHFRGCP FRPQHFRGCP PWPPITLAPA
TQAVAAAKHA HNRQPQP