Gene PHATRDRAFT_42685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42685 
Symbol 
ID7196335 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp786302 
End bp789485 
Gene Length3184 bp 
Protein Length847 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177162 
Protein GI219110821 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0260166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTGG CCGACTTTTC ATCACCCTTT CTGGTGCTGG TCAGCCATGC AATTGCGATG 
GGCAAAGGGG TGATGGCACG AAGCGCCACT GATTCACTGT ATTCGACAAA TGGTAACTCG
ACACACGTTG GTGCAGTCTC CACGAAAAAA GCAAAACGGC GATTGGACAC CGACGGATTT
GCACCTTTAT CATGCAACGC CAACTTGGAT ACGACCTCAT GTGCGTCATT CACGTCGATT
TTCGGAAACA AGGCTGTATA TTCATCTCGA CTCATCGTTC CCTGTGGCTC ATGTATAACT
ATGGACCATC CTGGACCGAC ATTGACCTTC GGCGATGGGC TTGATATTCA TGGAAAGCTC
GTCTTTCCCG ATAGGTACGA ACTCATTGTC AGGTCGACCA TGATTCTGGT ACAAGGCGAG
CTTCACATTA CCGCTGTAAA ACCGGTCGAT GGAAATCCGC AGGTCAAATT TGTCATGCAT
GGTGACAAGG ATGAGTTCTT CGAGCCTGCT GACAGCAATT CCAACATATG TGGCGATGGT
CTTTGCGAAG CTGGGAGTCG GTCTATCACT ATCGCGGGCG GAAAGCTTAG CCGTAAGTAA
AGTACTCTGC TAGCAATTCC GTCAGAGAAA CTGATGATGC TTTTTTCTAA TCTACTTCTA
TCGTTCTCCT GTTCACGGTG CAGTCCGTGG TGTGCCCGCC GAGACTCCAA CTTTCATGAA
TTTGTACGAT ATGGACGGAG ACTCAACCAT AATCCTCCCG GACTCAGTAT TGGAGAAATG
GGAACCAGGG GCTCGAATCG TCATCACTTC AAACACTCAG GCTTGGTGGG CAGACCAACA
ACGAACCATC GAAAGAATAT CCGTTGCCAA GCCTGGTTTT GTCAACATAG AACTAAATTC
TCCTATTACC CGGCCAACTA CTGTAAAAGA CGATGTTGGG TTTTCTGTCG AGGTCGCTCT
GCTCTCTCGT AACATCATGT TCGAAAGTGA GGACGGCGGC GGACACTTCT GGGTCATGCA
GACCCCGCTC GTCCAGCAAC TGATCGAAGG TGTCGAGATC TCTAACTTCG GACAGCAGGG
GAGGCTGGGC AGGTATCCGT TGCACTTTCA TATGTGCGGA GACGTGCGAG GTTCTATTAT
CGCCAAGAAC ACCGTCCGAA ACTCCAATCA GCGCTGCTTT GTCGTACACG GAACCAATAA
CCTCCGTCTT GAAGACAATG TTGCCTTCGA CACCAAGGGA CATTGCTACA TGCTTGAAGA
TGGTATCGAA ACCGGCAACG AGTTTGTTCG TAACATCGGT ATTCGCACCG GAGCGCCCAG
AACGATCATT CCGGACATGG GTTCGAACGG CATCGAAAGT GATGGACTAC CGGCGACGTT
CTGGATGACC AACCCCCATA ACACCTGGAT CGACAATGTT GTCGCGGGGT CCGAGCATAC
AGGCTTTTGG TTTGAGCTGT TAAAGCGCGG TGACCGAAAA GTAGACTTTC CTAATCTTGA
CCCGAAGACA GATTCCATCA TTAAGTTTGA TGGCTTTGTT GCTCACAGTA CCTCAGCAGT
AGGGTTCACG TACTATCTGT CTGGCTACGA GCCAACTTCG CTTCAATTGT TTGACAATAT
GCGCTTCTAC AGAAATCACA ACAAGGCCAT TAGAATCCAC CGGACACGCA ATGTTGTAGT
CCAAAATGGA ATCTTTATGG ATAATCCAAT CAACGTTGAG GTCGATCGTT CCGAGGAAGT
ACACCTATTG AATACTTCAA TCGTGGGGTA CTCTGATGGA TACAAGGATA CTGTTAGGCA
AGGGGGTTAC GGGTTCGCAT TAGCGCCTTG CGGCAATCGA AAAGATGAGT TGCTGGGGTT
GACCTTTCAG TCAACAAATC CTTGGATTTA CAAGATGGAA TCAGAGTTTA ACGGTGTCAT
CATGGACAAT GTGAGATTTT CCGGTTTCAG TAGGAGTGGT TGTGGGTCTT ATTCTTCCAT
TGACCTTGCC TCTCGTCTGG ATGGCTTCAA AACCTTCGAG ATGTTCTCTT CTTTTACAGA
TGTGACCATC GACAACCCCG ACTCGATTGA TTTTTGCAGT GGCGCATCTA CCAATGCAGA
TGATGTGTAC GTCTCCGATG TCCACGGCAC CCTTTTTGGA GATAGGGGTC CGTCAACTCT
CTTACGCAAT TCGGCTATCT TGCTTCATTT TGTCGACGAG AGCAAGTGCA CTGACAATAA
TGAAAAGTGC TACAGCTACT GTGAGGACAC TTGCTTCCGC ACTGTATTTT ATCATGTCTC
ACAGAGCCAA AGAAATGTAT ATGCACTGAA AGTGTGTGAT CGAAACAGTA CTTCGAATTG
TGTCGAGGTC TCAGGTACGG TATCTCGACC AGCCTGGCCT ACGCGCTTCG CCGTCCATGT
TCCAAGTGGA CGCCAATACA ACGCTTTTTT CGTAGACGCC AATGAGAAGA GAGTCTATCC
TGACGGCGTA CAAATTACTT TTAAAGACAA GTTGTGTCCA AGTGCCCCAA ATGAAGTCGA
CATTTCCCTC CTCGGCTCGG GTGGCGAAAT TGCTCCCCCA ATAGCGAACC CAACAACCTT
TCCCTCAAGG CCTCCGACAC TTCGACCTAG CTCGGGTGGA GTCGAATCGA CGACGAAATT
GTCAACGAGC GCTCCCGAGG GAGCGCAATT GGCCCCTGAC CCATCCCTAG CTGAGGATCG
GAGTGTTCCC TCGCTGAGCG TGGGACGCAG CAAATGGTGG AAGTGGTTCC CAGGCTGGAG
CTGGTAAAGT TGAGCCATTT GTTTCCTAAA GATGAAGCCG TCTGATCCCA CTACATCCTG
AACTTCTACT AATATATGAC TGTCTGTATC TTCGTTGCCT TCGGTCAACT GGATCAGAAA
TTCCATTCCA CATACATCGG ATATTTGTCA TGAGTTAGTA GACAAACGCA TGACTTGTAT
CGAGGTAAAG AGCCGATGGC CATCCTGCAA TGACCAATCG CTAAAACTAA TCCTTGATTT
CGCAATGCAG TCGCCAATGC AATCGAGTAT TGGAGTCGTT ACCTATAGTT AGCCTTGGAT
GAAGCGTGGC CAATCTCCTA CATGATTACA TTTTCCCATG TCTTCAACAT ACCAGCATTG
AAGTTTTCGC TTACACACAA TGAAATATAC TTTTACTGAT TGATAATTAG GATTTATTAG
GTTA
 
Protein sequence
MKLADFSSPF LVLVSHAIAM GKGVMARSAT DSLYSTNGNS THVGAVSTKK AKRRLDTDGF 
APLSCNANLD TTSCASFTSI FGNKAVYSSR LIVPCGSCIT MDHPGPTLTF GDGLDIHGKL
VFPDRYELIV RSTMILVQGE LHITAVKPVD GNPQVKFVMH GDKDEFFEPA DSNSNICGDG
LCEAGSRSIT IAGGKLSLRG VPAETPTFMN LYDMDGDSTI ILPDSVLEKW EPGARIVITS
NTQAWWADQQ RTIERISVAK PGFVNIELNS PITRPTTVKD DVGFSVEVAL LSRNIMFESE
DGGGHFWVMQ TPLVQQLIEG VEISNFGQQG RLGRYPLHFH MCGDVRGSII AKNTVRNSNQ
RCFVVHGTNN LRLEDNVAFD TKGHCYMLED GIETGNEFVR NIGIRTGAPR TIIPDMGSNG
IESDGLPATF WMTNPHNTWI DNVVAGSEHT GFWFELLKRG DRKVDFPNLD PKTDSIIKFD
GFVAHIQNGI FMDNPINVEV DRSEEVHLLN TSIVGYSDGY KDTVRQGGYG FALAPCGNRK
DELLGLTFQS TNPWIYKMES EFNGVIMDNV RFSGFSRSGC GSYSSIDLAS RLDGFKTFEM
FSSFTDVTID NPDSIDFCSG ASTNADDVYV SDVHGTLFGD RGPSTLLRNS AILLHFVDES
KCTDNNEKCY SYCEDTCFRT VFYHVSQSQR NVYALKVCDR NSTSNCVEVS GTVSRPAWPT
RFAVHVPSGR QYNAFFVDAN EKRVYPDGVQ ITFKDKLCPS APNEVDISLL GSGGEIAPPI
ANPTTFPSRP PTLRPSSGGV ESTTKLSTSA PEGAQLAPDP SLAEDRSVPS LSVGRSKWWK
WFPGWSW