Gene PHATRDRAFT_43023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43023 
Symbol 
ID7196829 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1822796 
End bp1824707 
Gene Length1912 bp 
Protein Length524 aa 
Translation table 
GC content44% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177380 
Protein GI219111257 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000014504 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TACCATGACA TTTCCAGTAG TGCTTATACC ATCGCCTGTT CACGAGGCTT TCGGAAGTAT 
TCAATGGGAT TCATTTTTAC ATTTAAAGCT TTCTAATCCA GAAAAAGCGC TTTTACAAGC
CGGAGAACAG AACGTTATCG AGTAAGGCAT AAAGGGATTT GTCGTACAAT TTGAGAAGAA
TATCTGGACG TCTTACACTC TGTTTTTCTG GAACACAGGT ACACGCTGTA CGACAAAGAA
GATGCTGCAG CTTATGTGCA GCTCCTGCTT AAAGTTCTCG ATCAAATAAC TAGTCATCGT
GGAACATCGT CGCGTCGAAC GTCACTCAAA GTTTCCAAGC TCTCCTTGGC CGAAAGTTTG
CCACCCGTTG ACGCGCTGCA GTACCTCGAC TGTGACGGCA TTGGAGTGGC CACACACTAT
GTCATCTCGC AACTTTATGA AGTCATCACG ACACTCAAAA GTAACGCTTT CGCTTCACGG
ACCAAAAGCA ACAGTGTCGT TGCTAGCAGC AACTTTGTGT CGAAAGCCAC TTTGTCCAGC
ATATTCTATC CATCAGGCAT TTTGATTGAC GACTGGCGAC CTCTTTTGCG TATTCTACTG
GGTACAAAGA GCGATTCTTA TGTGCACAGT ACGTTACAAC AGATTTAGTG CTTAAAAATT
GTTTTTATCC GCTTTAACTC ACCTTGACTT TTTCTCTGAA ATTATAACAC AATCAACGTC
AAGGAGGATC AGGTTTCTGT CTGGCGTGTA TCCTCTTGGA AGGATGCACT CTGCAAACAA
ACGGCCATCT ATTTTCGCCA ATATCATCAA TTTTGGAATC CTTTGTTTCG TGGATTGTGT
CTCGGCTACA GAGTTCTAGC ACACAATCTC TTTCTATGGT TACCTCAAGC TTGACTGTCA
TCATCTTGTC AAAAGAAGTT CGGCATCTCT TTGGCAAAGC GGGAGGTGTT GGATATCTGA
GTCGGCGCTT ACGCGTTCAT CAAAAATCAA TTGATTCAAA GATTAGTGCT TCCGTGCAAC
ATCAATACGA ACTCTCCTAC TGCCTCTGGA TCATGTCGTA CGATTGTGAC ACATCGGTAT
CAATGCGAAG CCACTTTCAT AGAGACGGAT CAGTCCAAGC ACTGGTAGAT ATGGTGGCCG
CTGCACCTCG CGAGAAGGTT GTGCGGTGTG CACTAGCAAC CCTGCGTAAT TTGGCAACGT
GCGCTGCGGA CGAAGCGCCT TTGGAATTGG CAAAGAAAAA CATCAATGGT TCAACATATT
TGATTGATAT GATAGGTTGT GGTCTACCCA AGTTGATTGA CCTGATGATG AATTGCCCAA
TTGCTGACTT CGAGATTAGC GAAGGTATGC AATGTTCGGT CCCTTCCCCA TTTTCAAAAG
CGGAACCGTA TCTAAACAGG TTGATGATTT GTGTCTCCTA TATGACAGAC TTGGACATTC
TCCATAAACT TTTGCACGAA ACTTGTCAAG AGCTAACACG TTGGGACGTC TATAAAGTGG
AGCTTGATTC CACTAACTTG ACATGGGGGA TAGTCCACAC CGAAAAGTTC TTTCGCGAAA
ACGCCCGAAA AATGGAAGGA TCCGATGGAA AGTTTGAAAT GGTGAAGACC CTAATTCAAT
TGACTGCATC GGATAGCGAG GATGTCGCCG CAATTGCTTG CTTTGATTTA GGGGAATTTG
TTCGTCATTA CCCTAATGGA AGAGACATTG CTCGGCGCCT TGGTGCGCGG GATTTTGTTT
TCCCGCTCAT TGAGCACGAA AATCCCAAAC TACAGCATCA AGCATTAACT TGTATCTCAA
AATTGTTAGT CCAAAATTGG AAGGTGAGCA GTTACATTGA CAGAGATAAA CCTGAATGTG
TTCCCCTAAC TCCAGCACCC CTACCTCTTG GATCTTGCAG TCATTAGGAT AA
 
Protein sequence
MTFPVVLIPS PVHEAFGSIQ WDSFLHLKLS NPEKALLQAG EQNVIEYTLY DKEDAAAYVQ 
LLLKVLDQIT SHRGTSSRRT SLKVSKLSLA ESLPPVDALQ YLDCDGIGVA THYVISQLYE
VITTLKSNAF ASRTKSNSVV ASSNFVSKAT LSSIFYPSGI LIDDWRPLLR ILLGTKSDSY
VHRGSGFCLA CILLEGCTLQ TNGHLFSPIS SILESFVSWI VSRLQSSSTQ SLSMVTSSLT
VIILSKEVRH LFGKAGGVGY LSRRLRVHQK SIDSKISASV QHQYELSYCL WIMSYDCDTS
VSMRSHFHRD GSVQALVDMV AAAPREKVVR CALATLRNLA TCAADEAPLE LAKKNINGST
YLIDMIGCGL PKLIDLMMNC PIADFEISED LDILHKLLHE TCQELTRWDV YKVELDSTNL
TWGIVHTEKF FRENARKMEG SDGKFEMVKT LIQLTASDSE DVAAIACFDL GEFVRHYPNG
RDIARRLGAR DFVFPLIEHE NPKLQHQALT CISKLLVQNW KSLG