Gene PHATRDRAFT_42809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42809 
Symbol 
ID7196420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1196965 
End bp1198146 
Gene Length1182 bp 
Protein Length323 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176740 
Protein GI219109975 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.767843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGTGAACGGA GACGGCACTG CAAGCCAAGG CGGGCAAAAA GATCCTTACG GATTGTGCTA 
ATGCAATAAG CTGAGCAATG GTTGCATCGC ACGAAAGCGA GGGTAGCATG GCGTCCGTCG
CCCCTACTGC AAAGATTGAT AAAGATATCA GTCGACCGGC TTCGCCCCGT GATGTAGCTT
GCAGCTCGAT GGAGATGTTG GCGCGGTCCA TGGAGCCCTC ATCAAAATCT TTCTTGCCAA
CCGCGAAACC GTTTTTTGTC ATGGATTGGT TGGACCGAAT TGATGAAAAG GACTTGGAAT
TAGCGCGCCA GATTATAACG ACTCCAGGTA GGGCTCTTTT GACCGATTTC GGTAAGGAAA
CCAGCACGCG AACAGTGTCG CTGGCAGGAG CTTCTCGAAA AGATTTTGCT GGTGAGCCTT
CAAAACCTTC GGAGCATCCT CCCAAACAGA AGTTTACGCC CCGATCGCAT TTCCGCAAGC
GAGGAATCGC CGTTGGAAAT GGATGGAACG CTAAAGGCTT GCAAAAGGCC AAGGAGGGGA
ACTGGGAAGA TGCGCTGTCA TGCTGGGAAA ATGCTCTCGA AATTCGTTCG CAAGTTTGCC
TGTCTCTGGT AGATGTGGCC AATACTTGCA ATAATATCGG CATTGCCTTA GGAAAGCTGA
ACCGATTTGA TACCGCAGTT GAACACTTGG AGCGTGCTCT CGAAGTGAGG GAAGCGCTTG
CGGAAGGAGC AGATAACCAG GCAGAAATTG CCACAACGTT ACACAACATC GGAAATGTGT
ACCAGCAGGC AGGCGAGTTT GGTAAAGCGG AAGAGTACTT TGTAGAATCT AGGGACATGC
AAATCAAAGT CCTTGGACGA GATCATATAC ATGTAGCCCG AACCTTGGCA GCATTGGGCA
ATGTTCGCTA CCAGGCCAAC CGAATCCCGG AGGCTCGGAA AGCGTACTGG GAAGCCTTGA
CCATCTTCCA ACACGTTGGA CTCCCTGAGG CTGATATTGA GGTGCAGTGT GTTTTAGGAA
ACGTGCAAGA GATTGACAAA TCGAAATAGA ATTCACACGC GGAAGAATGA TTATTGAGAG
TTGAGTTTCC TAACTTTTAC GTGAGGAAGA ACATCAATGT GACAGGACGC TAAAATAACG
ATTAGAACTA GTAGAGACTC AAAACATTAA AGTTGCATTC TA
 
Protein sequence
MVASHESEGS MASVAPTAKI DKDISRPASP RDVACSSMEM LARSMEPSSK SFLPTAKPFF 
VMDWLDRIDE KDLELARQII TTPGRALLTD FGKETSTRTV SLAGASRKDF AGEPSKPSEH
PPKQKFTPRS HFRKRGIAVG NGWNAKGLQK AKEGNWEDAL SCWENALEIR SQVCLSLVDV
ANTCNNIGIA LGKLNRFDTA VEHLERALEV REALAEGADN QAEIATTLHN IGNVYQQAGE
FGKAEEYFVE SRDMQIKVLG RDHIHVARTL AALGNVRYQA NRIPEARKAY WEALTIFQHV
GLPEADIEVQ CVLGNVQEID KSK