Gene PHATRDRAFT_33648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33648 
Symbol 
ID7197936 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp12638 
End bp14491 
Gene Length1854 bp 
Protein Length595 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178150 
Protein GI219114709 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCATG CCTTGGTTAC TCGAACGCAT GGTCGATATT GGACCGTACG ACGATTGTCG 
GGTACATCAG TACTTACATT CCTGTTGCTG ATAGCAGTCG GTCTGAACAT AGGCCGGGTG
CGTCAATTGA TTATTTACGA GGACCTGAAT GTATTGGCGT ACTTGGATAT CCCCCTTCAG
GTTGAGACCA CTGAAGATGT AAGTGTAAAA GGGAGCAGTG GAAACACTTT AAATCCTAAT
TACTGTTGGC TCTGATTTAC AAAGCATTTT TTATTTCAGC GAAACGCGGA AGAGCTGTCC
ATGGACGAAT ACCCAGTCAA ATTCAATACA TTGAAAAGCA CAGATCGGCT GCAAAATGTC
ACACCATCAC CCTTGTTGAC AAAGACAGAT GGTATCCTTT CCACGGAAAA GGGGTCCGTT
GAAGCGGCTA TTGCGACAAA GGATGGTGCG TCGACCACAG TCGCTTTGTC TCGACCAACC
GACCGTACGC TTCGTTTTCC TACAGTAGCT GAGCGGGTTC GGTTTTACAT GTCGTCTTGG
TACGAACCAC CCTGCAGCGA AGTGGACCTT TTACAAGTTG TGCAGCATAC TGGTAGTAAG
CTAAACAGCG AAATGGGGGA TACGACGATT GTCACGAACG GAGGCAACCC GGTCATTGAA
TTTGGAGGCC ACGAGAAACG AGTATATCCT TCTTTTTCAC TGCAGCGTCG AATGCAGTCC
ACGAACAACT CCCTCTCGGT GGTCTTGAGA GCAAGGGCAA TGGCTGGGGG AAATGTTATC
TTTGCTTTAG ACAAGACGTC GCTTGATGTA TGCCAATTTG GTCCCCAACC AAAGAAAATT
TTATGGCAGG TATACTGTCC AGAGCTCCGG GACAAGTTGT TGCTACCGTA TTTAGCTGCA
AACCAAACCA AGTTGGCCGG CAAAGCAAAG AGTGGAGACG AAAAGACGCT CATACTCGCC
CAAGTTGGTG ATGCTTTGGC TACCAGAGTT TTAAATGACT TAGGAGAAAT CAGGCATCAA
TCGCCCAAAC CCTCTGTTCC TCATTTCAAA AAGGTACGAT CTGCCTGGGA AGACAAGGAT
GGAAGAGAAT CCTTGTTGAA TGCATCACCA GTGGCTTGTT CCACGTTACA GAGCCGACGA
ACCAACCACC AAAACTTGGA ACCAATTATT TGGAAAATGG AAATTGACCG GCATTACGGT
GCAGTCCACC AAATCACAGA AGTTGATATT CCTTGGGATC AAAAACGGAA TGTAGGCGTT
TTTCGCGGGG CCACCACCGG GAATGTGAAC CAACGTTTAC CAATGCGCGA GCGCTGCCTG
GAAAACCAAC GTTGTCAACT TGTACTCATG TACCATAATT CATCCTTTGT GGATGCCAAG
TTTACCAATG TCCTAAAGCA AAGCAAACTA CCAACCGAAT TTGATGGCAT AACAATGACC
GGCAGTCGCT TTCAACTGGA TAAACTTTTG GAATTCAAGG TGTTGATATT TTTAGAAGGA
AATGACGTTT CTTCTGGGCT CAAGTGGGGG TTGTACTCCA ATTCGGTTGT ACTGATAAAC
AAGCCCTCCG TATCGTCATG GGCAATGGAA GAACTGTTAG AGCCGTGGGT GCATTATGTG
CCTCTGAAAG ATGATCTTAC GGACGCAGAA ACCCAAATAA AGTGGGTCAT CGAGCACGAT
AGAGAGGCAA AGGAAATTGC GATTCGGGGA CAGCTTTGGA TTCACGACCT TTTGTTTGAC
AAGCATTCAG AAAGAGACAA CGCTGCAATC AACCAGGAAA TACTTAGCCG ATACGAGGCA
CATTTTCGAC CAGGGATTGA ACAAGAGAAT GGGCAACCAA ACTTGGAGAA TTAG
 
Protein sequence
MEHALVTRTH GRYWTVRRLS GTSVLTFLLL IAVGLNIGRV RQLIIYEDLN VLAYLDIPLQ 
VETTEDHFLF QRNAEELSMD EYPVKFNTLK STDRLQNVTP SPLLTKTDGI LSTEKGSVEA
AIATKDGAST TVALSRPTDR TLRFPTVAER VRFYMSSWYE PPCSEVDLLQ VVQHTGSKLN
SEMGDTTIVT NGGNPVIEFG GHEKRVYPSF SLQRRMQSTN NSLSVVLRAR AMAGGNVIFA
LDKTSLDVCQ FGPQPKKILW QVYCPELRDK LLLPYLAANQ TKLAGKAKSG DEKTLILAQV
GDALATRVLN DLGEIRHQSP KPSVPHFKKV RSAWEDKDGR ESLLNASPVA CSTLQSRRTN
HQNLEPIIWK MEIDRHYGAV HQITEVDIPW DQKRNVGVFR GATTGNVNQR LPMRERCLEN
QRCQLVLMYH NSSFVDAKFT NVLKQSKLPT EFDGITMTGS RFQLDKLLEF KVLIFLEGND
VSSGLKWGLY SNSVVLINKP SVSSWAMEEL LEPWVHYVPL KDDLTDAETQ IKWVIEHDRE
AKEIAIRGQL WIHDLLFDKH SERDNAAINQ EILSRYEAHF RPGIEQENGQ PNLEN