Gene PHATRDRAFT_31584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31584 
Symbol 
ID7195951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp433311 
End bp435167 
Gene Length1857 bp 
Protein Length618 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176585 
Protein GI219109662 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000066135 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAC TGATACTTTT GCCTGCCTCC GAGTTTGGTT TTGCGAAAGA TTACGCAGAA 
GCACCGATTC AGGATGCATC TGCAACGAAC AATCGTGGAG ATTCGCCGCT TGAACAGCTG
GAGGAATCGG AGGGAGACCA AGTCTCGGTG CCGTCTTTGT GCCAAGAAGA CGAGGTGGAG
CTACAGCCAG CCAGGAACGA AGTTTATGCT GCTGCGGTCG ATGATGGTGA CGATGGCGAT
AGCGAGCACT GCGTGATCAG AGATGAGGAC CACCTCGACG TTGCCATGAT GGAGCCGACT
ACGCAGTTGC TGGAAGAAGC AAGTTCACCA TTAGCCATGG ACGAGGAGGC TGACGTACCA
ATCAGCATCA CGTACTCCAC AGACCCTGGT AGCGACACTG ACTTTGGCGG TTTGTTGAAC
CTTGGCAATA CGTGTTACAT GGCTTCGGCA CTACAAATGA TTGCCAGCCT AGAGTCATTC
GTGGACGAAT TGAAGACCAA AGTAGAAGAA ATACCGACAG ATTCTCAGTT GCAACGCTGT
TTAGTGGACC TATTCGACCA ACTCGCGAGA GGCAAATCAG TTCGGCCTGT CGTATTGAAG
GACACGGTCG ATACGCGCTC AAGCTTGTTT GTTGGATATG ATCAACAGGA CGCGCACGAG
TTTCTGACGA CCCTTTTGGG CATGCTTGAT GACGAATACA ATATCAAGAC GAAGACGACT
CGAAAAGTCA ATGAAAATCT TGTATCATAT GTCAATACAC CAGATGACGC CATGGACGAA
GATAATGATG GCGAAATGGA TGAAACAGAG CCAGTGGACA TGAATCTCTC GCTAACCGTA
AATGTCCCAA AAGGATCTGC CTTGGTTCCA CATTCCGTGG ATACAATAGA TCCTCCTTCT
CTGTTGTACA GCCGCCACGC CTTTTCCGAG TTGGATGTGG ATGAGATTCG CCATTTGTTG
CACGGGACTC CTACCAGTCG AGAGGGCATG TTGTTGCCCG CATATACTGC TGCAACTGAA
CCACGTTGCA AGCTCGTAGG CGGTCGAATG CATACAACTG ACATCCCTTG GACATCGTAC
GAGTCTCATT CCTTTGTTGG TAATCAAATG AAGGACAACG AATGTCTAAA TTCCCATCAT
TCACCGCTTC ATGCTTCTAC CGCCGCAACG TCGACAGCTG ACGATGCCGA TCATCTCACC
GCGGACGACG ACAAAGTGGT TTCTCCTGTG AATGATTTCT TTACAACTGT GGCTCGTGCG
CGGCTTACCT GCGATTCCTG CATGTATACG CGCACTCATC TCGAAACCTT TTTACACTTG
TCTCTCGAAA TCGGGAATGA CGTCACCGTT GAGGATAGCT TGCGTCGATT CTTTGCGCCC
GAGCCACGCG AACTCAAGTG CGAAAAGTGT TTTTGCGAAA GGGCCACTCA GACTACCGAG
ATCGTCAAGC TGCCACGAGC CTTGCTTTTG CATTTTAAAC GCTTTATCGT GGACGTGAGT
GATGATTGGG CTTCCGTTTC GTATCGCAAG AATCAGTCAG CCGTAGTCTT CGAAGACACC
CTGTCGTTGG ACAAAGACAT GGGTGTGCTT TCGGAATTCT TAGCGACCGA TTATTCATTA
CCAACCACAA GCGGAACCTC TTGTAGTTTG GGGAACAGGT ATGGGATTCG CAGCGTGGTA
AACCACATTG GGGCGTCGGC GAGTTGTGGG CATTATACGG CGGATGCGTA TCGAAGGAAA
GATGAGACGC GGAAGTGGAT GCGCTTCAAT GACGCCTTTG TTTCAAGTAT ATCGGAGAAG
CAGGCTTTGT TAGATTCACA AAAGACGGCC TACATGGTAT TGTACGAGTT GGAATAG
 
Protein sequence
MATLILLPAS EFGFAKDYAE APIQDASATN NRGDSPLEQL EESEGDQVSV PSLCQEDEVE 
LQPARNEVYA AAVDDGDDGD SEHCVIRDED HLDVAMMEPT TQLLEEASSP LAMDEEADVP
ISITYSTDPG SDTDFGGLLN LGNTCYMASA LQMIASLESF VDELKTKVEE IPTDSQLQRC
LVDLFDQLAR GKSVRPVVLK DTVDTRSSLF VGYDQQDAHE FLTTLLGMLD DEYNIKTKTT
RKVNENLVSY VNTPDDAMDE DNDGEMDETE PVDMNLSLTV NVPKGSALVP HSVDTIDPPS
LLYSRHAFSE LDVDEIRHLL HGTPTSREGM LLPAYTAATE PRCKLVGGRM HTTDIPWTSY
ESHSFVGNQM KDNECLNSHH SPLHASTAAT STADDADHLT ADDDKVVSPV NDFFTTVARA
RLTCDSCMYT RTHLETFLHL SLEIGNDVTV EDSLRRFFAP EPRELKCEKC FCERATQTTE
IVKLPRALLL HFKRFIVDVS DDWASVSYRK NQSAVVFEDT LSLDKDMGVL SEFLATDYSL
PTTSGTSCSL GNRYGIRSVV NHIGASASCG HYTADAYRRK DETRKWMRFN DAFVSSISEK
QALLDSQKTA YMVLYELE