Gene PHATRDRAFT_38085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38085 
Symbol 
ID7203036 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp28878 
End bp31246 
Gene Length2369 bp 
Protein Length619 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182146 
Protein GI219123676 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGGGT GCCATTCCTC CATGGTTAAC GCCTTTCGTA GCGTTTCTCG CTCACTGTCA 
ATTCCAAGCA GACTCGCCGT CTTGAAGGAT CCCCAAGATC TCGTCACCAA AGAAGCCAAC
GTGAAGCCCG CCGGGACTCG CACCAAACCA ACCGTTGATC CCTTCAATCC CAATTTCGAA
TCGATCGCTT CGGTTCCCTA CAACACAGCG TTTCCGTCGT CCACGAAGGA GTATAAGACC
GTTGTCCACG AAGCAACCGG CCACCGGCTT CACGTGCCCT TCCGTCGCGT CCATTTGGAA
GATCCCGACC AACTCTACCT GGATTTGTAC GATACCTCGG GGCCGCAGGG CGTTGATCCC
AAGAAGGGGT TGGCCAAGTT GCGCCAGGAA TGGACGGACG AGCGTGAGGG TAAGTACGAA
CGTTACACTC AGATGCATTT CGCCAAACAG GGAATCGTTA CCGAAGAAAT GCTCTATTGC
GCTACGCGAG AAAACATGGA ACCTGAGTTT GTCCGTTCCG AGGTGGCCCG AGGACGCGCC
ATTATTTGCT CCAACAAAAA GCATCCCGAA CTCGAGCCGC AAATCATTGG ACGCATGTTT
AAAGTCAAAA TCAATTCGAA TATTGGAAAC AGCGAACTGG GAAGCAATGT AAGTGTACCT
GTTCTGCCAC CAACTATCCG CAACTGTCGT AGAGAGCAGT ATTTAGTGGA ATCCACATAA
ATGCTCACCA TTTCTTGTTA CCACTTACGC TTTCAGATTG AAGACGAGGT GGAAAAGTTG
CAATGGAGCA TGCTGTGGGG TGCCGACACA CTTATGGATT TAAGCACAGG CAAACATATT
CACCAAACCC GCGAATGGAT CATTCGCAAT TCGCCCATCC CTGTTGGTAC CGTGCCCATT
TATCAGGCTC TCGAAAAAGT GGACGGGATT GCCGAAGACT TGACGTGGGA ATGTTTCAAG
GAGACGCTTC TCGAACAAGC CGAGCAGGGT GTGGACTACT TCACCATTCA CGCGGGCGTC
TTGCTTCGAT ACGTGGTACG TCATATCGCG GGTTGGCTTG CACCGCAATG TACTACGCCT
TGAAGTAGAG ATTTTTTGTA AACTCACTTT TCAATTTGAT TAGCCAATGA CGGTCAAGCG
CATGACAGGT ATTGTCAGTC GCGGCGGATC AATCCATGCC AAATGGAATA TTTTTCATCA
CAAGGAAAAT TTTGCCTACG AACATTGGGA CGACATTTTG GAAATTTGCG CCAAGTATGA
TATCGCGCTG AGTATTGGTG ATGGACTACG TCCCGGATCC ATCTACGATG CCAACGACGA
AGCGCAGTTT GCGGAACTCT TTACTCAGGG AGAACTAACT AAGCGCGCTT GGGAAAAGGA
CGTACAAGTT ATGAATGAAG GTCCTGGACA CGTTCCTCTG CACAAGGTAA GCTATGATCA
GGCAAGGTAG ATTTCGCCTC GACCGTACGT CTTGCTACTA ATCCTTCGAC TTGGTTGCGT
ACCCAGATAC CCGAAAACAT GCGCAAACAG CTAGAGTGGT GCAACGAAGC ACCTTTCTAT
ACACTAGGCC CTCTCACTAC AGATATCGCA CCCGCCTACG ATCACATTAC TTCCGCCATT
GGTGCCGCAA CGATTGCGTC TCTCGGAACT GCCATGCTCT GTTATGTTAC GCCAAAGGAG
CATCTTGGTC TTCCCAACCG CGACGATGTC AAGGCTGGTA TCATTGCCTA CAAGATTGCA
GCGTACGTAT CTGGTGGTTT TTGGTTGTAT GTCCGCCAAG TTTACAGTGT TGATTGATCC
CCCACGGCAC AGACTACACA GCAAGGTACA ATAGGAGTAT TGCATTGACA GTCAATACAC
TCACGTTAAA CACTTTGTCG TTTCTGTTTA CAGTCACGCC GCCGATCTTG CCAAGGGATA
TCCTGGGGCC CAGGATCGCG ACAATGCTCT CTCGAAGGCC CGTTTCTCTT TCCGTTGGAA
TGATCAGTTC AACATTAGCC TCGATCCTGT CACTGCCAGA GAATTCCACG ACGAAACCCT
TGACAGCGAT GCCGCAAAGA GTTCCCATTT TTGCAGGTAA GCGCAAAGGT CACGACTTTT
GTGGATGCGA AACGTCCTGG TCCTCTCTCA CTTGTCATTT TTATCAACCT GTAAACAGCA
TGTGCGGGCC CAAGTTCTGC TCCATGAAGA TTACGGAAGA TGTCCGTGCG TACGCGGCCG
AGAATGGCTA CGGAGTGGAA GAGACGGCGG CCAAGGGAAT GGAGACAATG AGCGAGCTTT
ACAAGGAACT GGGCAACAAG CTCTACGTGG AAGATGACGA GAAAACGTAC GAGAACACCT
TCAATCCTTT GAAAGATCTA GCGTCCTAG
 
Protein sequence
MAGCHSSMVN AFRSVSRSLS IPSRLAVLKD PQDLVTKEAN VKPAGTRTKP TVDPFNPNFE 
SIASVPYNTA FPSSTKEYKT VVHEATGHRL HVPFRRVHLE DPDQLYLDLY DTSGPQGVDP
KKGLAKLRQE WTDEREGKYE RYTQMHFAKQ GIVTEEMLYC ATRENMEPEF VRSEVARGRA
IICSNKKHPE LEPQIIGRMF KVKINSNIGN SELGSNIEDE VEKLQWSMLW GADTLMDLST
GKHIHQTREW IIRNSPIPVG TVPIYQALEK VDGIAEDLTW ECFKETLLEQ AEQGVDYFTI
HAGVLLRYVP MTVKRMTGIV SRGGSIHAKW NIFHHKENFA YEHWDDILEI CAKYDIALSI
GDGLRPGSIY DANDEAQFAE LFTQGELTKR AWEKDVQVMN EGPGHVPLHK IPENMRKQLE
WCNEAPFYTL GPLTTDIAPA YDHITSAIGA ATIASLGTAM LCYVTPKEHL GLPNRDDVKA
GIIAYKIAAH AADLAKGYPG AQDRDNALSK ARFSFRWNDQ FNISLDPVTA REFHDETLDS
DAAKSSHFCS MCGPKFCSMK ITEDVRAYAA ENGYGVEETA AKGMETMSEL YKELGNKLYV
EDDEKTYENT FNPLKDLAS