Gene PHATRDRAFT_44961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44961 
Symbol 
ID7199488 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp802536 
End bp804620 
Gene Length2085 bp 
Protein Length682 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179067 
Protein GI219116544 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.114443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAACG ATGACAGCGA TAAGCTAATT CATCCCATGA CGGCGGTTCC CGACGCCATC 
CGCATCGTAC TGACCGAGTG CGCCCGCGTG CTCCTGCAAA AATCGGATTT TGCTCCACCA
GTTTTGTCGA TTGATGCCCA TGTCGATGGT GGGTCCTTGC TAGGTCAGGT ACTTGCCGAG
CCAGTTGGCA TGTCTGAACC AGGGTATCCG CCCTACCGAG CAAGCATTAT GGACGGATTT
GCCATTCGAA CGACCGATCG GTTTCCATCG ACTTTTCCAA ATGGATCTTC CCCGCCGACA
ACGAAGCAAT GGACCCATAC AATTGCTGGA AAAGTGTTCG CTGGTAATAC TTCCCAAACA
AAAGATGACA CCGGTACCTC ACAGACAGTG GACGCGTGTC AGTTACCCAC AGCATACTAC
CTCACCACAG GAGCCGTCGT TCCTAATGAC TTTGACTGCG TGGTTCCTGT CGAAAAATGC
ACCGTCAACA ACAAGACAAA TCCGACTTTT GTGAATATAC AGGCAGTTCC GGCCGACATC
CAATCAGGTA AATGGATCCG CGATGTAGGT TGCGATATAC CAGCCGGTAT GGAAATGTTG
CCACGGGGTC ACGTTCTGGA TGCCGTCTCC CTGGGTTTGA TACGGCAATC TGGTTGTGAG
CGCGTAAGTA TTCGGCGTCG ACCAGTCGTG GGCGTCCTTT CTACGGGCAA TGAGCTACTA
GGTGACAATG TACGCGGGGA ACAATCTCGG CACGGTATGA TTCCGGATGT GAATCGACCC
GTTCTGCTTG CCACGTTGAA ATCTATGGCA AACTGTACGA CTGTGGATTT GGGGTTGGCT
CGAGACGATA GCGTCGACGA TATGGCATCC CACCTTCAGT CTGCATTGGA GCGCTGTGAT
GTAGTGATTA CAACTGGCGG AATTTCCATG GGAGAAACGG ACATTATTGA AGAAGTACTA
GTGGAACGAC TCCAGGGTAA AGTACACTTT GGTCGACTCC TCATGAAACC TGGTAAGCCC
ACAACCTTTG CTACGGTTGT GACTGCACCG TCCTCTGGGA CCAAGCTAAT CTTCGCCATG
CCAGGAAATC CTGTGAGTGC AGTCGTCTGT ACACACTTGC TGGTACAGCC GTGCCTCGAC
TTGCTCCATC ACGGACCGGA TAGTACAGCC GATACTTACG GAGAGAGTGT GGAAGAACAA
ATACACCGCG TTGTGTTGAA TGCCGCAGTC CATCCAGAAG TTCAGGCAAT TCTCAGTCAG
GACATCAGAC TGGATCATGA ACGACCCGAA TATCATCGCG TTCAACTGCG AGAACAAATG
CCGGGAGGAA GCGTTTTCGC GTTTAGCACG GGGGTCCAAC AGTCTTCCCG ACTCATGAGT
ATGCTTGGCG CCGATGCCCT TCTCATTTTA CCGCAAGGAA CGACGTCCAA ATCAACTGCC
AAAAAAGGTG AAACGTACAC GGCACTCCTA CTTCGTCATC GCAGTCGCTA CCCACAAAAA
CTCGTGACCG AGGCTCAGCA CTTGAATCCA ATCTCTTCAA CGGGAAACAA TATCCGGATC
GGCGTTGTCT TTGCCGCCCC AAGTGTGCAG TTGTTACTCA CCCCTACGCT AGAAGAGATC
ACCGAGTCGG TCCAAGCTGC AATGGCTGGA TCCAAAAAGA GCCACAGCAT AGAAATATCC
TCCACACAAC TGTACACAGG CAGCGCAACC AAAATTGAGA ATTTTCTTAA CTCCATACCG
CTGGATGTCG ATATTCTCAT CATTACTTAT TCAAAACGAC AATTTCGGTA TCAGCTGGCC
TTGGCCAATT CGCTGCGCCA TGCGCTGATC AAACGTGCCG ACTGGATAGC GTTGCAGGCT
CGTCAAGGTT GTGCCGCATA CGATCCTACC ACTGCCGCTT CTGAAATGGT GGTCGGTTTT
TGGGAACGAA CGAAAGAAAC CTCCTTATCC GATGCGATAG TGGTATGTTT GCCGGCCGAA
GGGGTCGGAG GATTGTCCCA TGTGCGGGGG GTTCTGCGAC ATGCGCTCCG CGTGGCACGC
GGGGCCGGTC ACTCTGATGA AGGCTATCCT CGGAAGGAAA GCTAG
 
Protein sequence
MTNDDSDKLI HPMTAVPDAI RIVLTECARV LLQKSDFAPP VLSIDAHVDG GSLLGQVLAE 
PVGMSEPGYP PYRASIMDGF AIRTTDRFPS TFPNGSSPPT TKQWTHTIAG KVFAGNTSQT
KDDTGTSQTV DACQLPTAYY LTTGAVVPND FDCVVPVEKC TVNNKTNPTF VNIQAVPADI
QSGKWIRDVG CDIPAGMEML PRGHVLDAVS LGLIRQSGCE RVSIRRRPVV GVLSTGNELL
GDNVRGEQSR HGMIPDVNRP VLLATLKSMA NCTTVDLGLA RDDSVDDMAS HLQSALERCD
VVITTGGISM GETDIIEEVL VERLQGKVHF GRLLMKPGKP TTFATVVTAP SSGTKLIFAM
PGNPPCLDLL HHGPDSTADT YGESVEEQIH RVVLNAAVHP EVQAILSQDI RLDHERPEYH
RVQLREQMPG GSVFAFSTGV QQSSRLMSML GADALLILPQ GTTSKSTAKK GETYTALLLR
HRSRYPQKLV TEAQHLNPIS STGNNIRIGV VFAAPSVQLL LTPTLEEITE SVQAAMAGSK
KSHSIEISST QLYTGSATKI ENFLNSIPLD VDILIITYSK RQFRYQLALA NSLRHALIKR
ADWIALQARQ GCAAYDPTTA ASEMVVGFWE RTKETSLSDA IVVCLPAEGV GGLSHVRGVL
RHALRVARGA GHSDEGYPRK ES