Gene PHATRDRAFT_40779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40779 
Symbol 
ID7198544 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp373434 
End bp375597 
Gene Length2164 bp 
Protein Length685 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184698 
Protein GI219129023 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGGGA CACACGCTAG TACCGCGACG GTACGACTAC TCCGGTGTGC GACGGTACCA 
CCAGTCGGTC GATCGTCACC TTCTTTGAAA CGGCGGTACC AATCGTACGC TCCCCGAGTG
GTGGCGGCGA CTACGGCAAC GGCAGATATT CTGCGGAATG CCAACGGATC CTTCGATAGT
CCTAATTACC GGACACTGCT GATCCGCCGG TGGGCTTCCT CCACGACTAC TACAACCAGT
ACTACTGCGA CTACTACAAC GGCCACGTCC ACAACGAGTG GGGCTCCGGA TACTACTTTG
TCAACTTCTG CACTCGGTCG GATAGAATCC CTAACGTCGA CGACTTCTCC ATTGGCCCTG
ATCGAACCCA TGTCCATGCC ACGTATTTAC CGCTATTTGA ATTTAGCAAC GTACCAACGA
GATGAGCTGG AAAGCGTGTT TGACAGAATA CGAAACGGAT ACGCTCTCCA AACTACACAC
GACCAAAAGA CTGCCGTCGG GACTGGGCCC GAAGCCACGG ACTCCGACTC GGTCGATCCA
GAGGAAACCA TTACGGACTC TCAGATCCAG CGGTATCTAC TGTCGCGCAT TTACGAGCTC
GAAGAAGAAA GTGACGAAGT CATTGAAGAA GGTCCAGTTA CTCAGACCTT GCGCGAACAG
TACGTACAGC ACGAAAGTCA ACGCTTTCTC CGAGCCTTTG CGGATTACGC GACGAGGGCA
CCCGGGAATG GTACGCTTCT GACCACAACG ACCATCAACA AACCCGAATT TTGTGACCTT
TTGACAACCA AAGCCAGTCA AGTTGATCTA CAGCGAACCT GGCCCATTAC GGTGAGTATG
CTCTTGGTAG GCTCGTCTGT GGGAGTCATC ACGCCCGCCA TGCCTTTTGT TGTTGAGCAA
TTGAGTTTGA CGGCGAGTCA GTACGGTATG GTGGTCTCGG CCTTTGGATT GGCCAAAATG
CTGGGCAACA TCCCGTCCGC GATTGCCGTG GAACGACACG GACGGAAACC CTTCATGACC
TGGAGTCTAC TCATCATTGC CTGCGGCGTG GGCGGCATTG GTTTAGCCAA CAGTTTTGAA
GAACTCTATA TTTGTCGATT ACTGACGGGT ACGTGACAAA AGGGTCGCGG CAACCTCCTT
TATTTGTTTC TAGTTTACAC ACAACACTCA TCGCTAGTAC GTTTCACATC TTGCCTCTGA
CTGTGCGACT GCAGGTCTTG GTGTGAGTTT CTTGTCCACG GCCGGCACGC TCATGATTTC
AGACTTGTCC ACGCCGTTGA ATCGCGCGTC CACGTACGCG CCGATTATGA GCGCCTTTTC
GGCAGGCACC GCACTCGGAC CAGCCCTCGG TGGGATACTA GTCGACCAGG TGGGCCTGCA
TCCAACATTT TACATGGTCG GGGTTTCGTA TTTGGGAGTC GCAGCCTTGA ATCGAGCCAT
TTTGAACGAA ACCAAAACAC ATGCCGTCCA TTTTCCGTGG CAACAACGGC GGTCCGGCGA
TGATGTCGCG GGCGACAGTT TGTCGAGCTC AGTCCAGGAC GCAGTGGGTC AATGGGTACC
GTTGTTGCAG AATTCGTCCG TCCGCAACAT TATGATTATG AACGGGATGT ACTGGATTGC
ATTGGCGGGC TCACAGATGA CATTGTTGCC GTTAATGCTG ACCAATTCTG GAGGCTTGGC
CATGTCCGCG ACTCAAGTTG GTCAGGTCTA CATGTCCATG AGTCTCGTCC AAATCGTCGG
CAACCCGCTA TTTGCCAAGG TTATGGATAA GACGGGCAAG GCGCCGGCGA TTGTGACGGG
TTGCACATTG ATCAGTACAG CTATGGTTGG ACTCGCCTAC TGCGACGATT ATACACAATT
AGCAGCAGCG CTGGGATTAT GGAGTATTGG ATCGAGCATG CTCAGTACAG CCCCTCTTGC
TCACCTTTCG GATAAGGTAG ATGATGCCAA GCGGGCCCAG GCAATTGCAC TACTAAGAAC
CTGCGGGGAC GTTGGATTTT TGATAGGGGC CACTGGAATC GGGGCGTTGG CCGACTGGAC
TGGGAGCCTG GAAACAGCTA TGCAAAGCAG TGCCGGTTTG TTGTTCACAG CAACGGCGTG
GTATGCAACC AGACAGGTAC TGGATTCACG GATAGGAGCG CCTGCGAGAA AGTCGACCTC
ATAG
 
Protein sequence
MPGTHASTAT VRLLRCATVP PVGRSSPSLK RRYQSYAPRV VAATTATADI LRNANGSFDS 
PNYRTLLIRR WASSTTTTTS TTATTTTATS TTSGAPDTTL STSALGRIES LTSTTSPLAL
IEPMSMPRIY RYLNLATYQR DELESVFDRI RNGYALQTTH DQKTAVGTGP EATDSDSVDP
EETITDSQIQ RYLLSRIYEL EEESDEVIEE GPVTQTLREQ YVQHESQRFL RAFADYATRA
PGNGTLLTTT TINKPEFCDL LTTKASQVDL QRTWPITVSM LLVGSSVGVI TPAMPFVVEQ
LSLTASQYGM VVSAFGLAKM LGNIPSAIAV ERHGRKPFMT WSLLIIACGV GGIGLANSFE
ELYICRLLTG LGVSFLSTAG TLMISDLSTP LNRASTYAPI MSAFSAGTAL GPALGGILVD
QVGLHPTFYM VGVSYLGVAA LNRAILNETK THAVHFPWQQ RRSGDDVAGD SLSSSVQDAV
GQWVPLLQNS SVRNIMIMNG MYWIALAGSQ MTLLPLMLTN SGGLAMSATQ VGQVYMSMSL
VQIVGNPLFA KVMDKTGKAP AIVTGCTLIS TAMVGLAYCD DYTQLAAALG LWSIGSSMLS
TAPLAHLSDK VDDAKRAQAI ALLRTCGDVG FLIGATGIGA LADWTGSLET AMQSSAGLLF
TATAWYATRQ VLDSRIGAPA RKSTS