Gene PHATRDRAFT_33660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33660 
Symbol 
ID7197942 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp35370 
End bp38672 
Gene Length3303 bp 
Protein Length253 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178154 
Protein GI219114717 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.83302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCA AGCGCAAGTT TGTCGCGGAC GGTGTCTTTT ACGCCGAACT GAACGAGTTG 
TTGACGGCGG AACTCGCCGA AGAAGGGTAC GCCGGAGTGG AAGTCCGCAC GACGCCGCAC
CGTACCGAGT TGATTATCCG CGCGACCCGC ACACAGAACG TGCTCGGTGA GAACAATCGC
CGCATTCGGG AACTTACTTC GGTGGTGCAG AAGCGCTTCA ACTTTGCCGA CGGTGCCGTG
GAGTTGTACG CGGAGCGTGT GCAGAACCGC GGACTGTGTG CGCAAGCGCA GGCGGAATCG
CTCAAGTTCA AGCTTCTCGG AGGATTGGCG GTGCGTCGTG CGTGCTACGG TGTCGTGCGA
TTCGTCATGG AAGCCCAGGC CAAGGGGTGT GAAGTCGTGG TGACTGGAAA ACTGCGTGGA
CAGCGTGCAA AGGGCATGAA GTTCGGTGAC GGATACATGA TCAAGACGGG ACACGCCGGA
CAAGTCTACA CCGATTCCGC CGTTCGTCAC GTTCTCATGC GACAAGGAGT CATTGGAATT
AAGGTCTCTA TCATGCTTCC GCAAGATCCC AAGGGACAAA TGGGTCCCAA GATTCCGTTG
GATGATGTGG TGACCATTCT GGAACCCAAG GAGGAACCGT CTCCGGCGCA ATTGATGGCA
CAGCAAGCCG CGGCGGCTGC CGCTTACCAG GCGGCGGCTC CTCCCATGAT GGAAGCACCC
GTCGACCCGG CGGGTATGGC TCCCGTCGAT CCCGCCATTC AAGCCGGATT CTAACTATAG
TCGTTGCCGT TTACCTACTG GTTTGTGTGT GTGTCGTGAG TGTGTGTACA TATATATATG
AGAGTGTGTG TGTTTGTATC GGATAAGCGA GGGTTAGTTC TAACTGTAAG TTAAGGGGGA
CGGGTACCTA CAGGAGAGCG CCATACTTGT CTAGTATGCT CGCCATTCGT ACTGCGTCTT
ACAGTTCGGG ATCGTCGACG AATCGGGCTT GCAGATTGCC GTCGAGATCG AGTCGACAGG
AAGGCCCGTC GGCTTGGTCT TGTTCCCCAA AGATGTACCG GTTGTTGGCG ACGACTTGGT
AGAGACCATC GCGTGTCCAG CGGGGCACGA TTCGCTGGGC CAGGTTGGCC GTGGCGCGGA
GGACTCGTGG AAGTCCTTCG AGCTGGGTAG CGATAAAGAG GACGGCGTCC GATTCAAACC
AGGCCCGGGT GGGCGTGACG ACAACAATGG AGGAAATATC GTTGGCGCGT TTGCCGTGTT
GGAGTAGCAA GGCCTGTCCG GTGCGACTCT GCAAACTGGC GTAGCGAAAG ACCCCGTCGT
GGGCAGCCGC GTCCCAATCG AGACAGCGGT TGACTCCGGC GTTACAGAAA TTGCACACGC
CGTCAAAGAG AATAATGGGC TTGGAACTGG TGGCAAAGGC CGTACTAGCG ACGCGGCGCC
AGTCCCAATC GACCGTTGCG GGTCCGTCCG ACGTGGTTTG GGAGCGGGGC GATGGTGTTG
ATGGAGTGGT GTCGGTACTG CTGCTGCTGG CACTTGGCGG AGTCGGAGTG GAGGTACTGG
CGAGGCTGGT TGTTGCGGTT GCGGGTGCGG TTACGACACG TCCGTTGGTA GGAGTGGCGC
AATGGAACCG TGACGACCGC TGCGGTACGG TCACGATTGA AAAGGCTTGG ACGGATGACC
ACCGAGCTCC GGTCTCGCTG CCGCGCCAAA CGACGGCGAC GACGACAATG AGCAGATACA
CGCACTGATG GAGCCAGAGT CGAGATTGAC TGTGAGATGG CAATCGCGTC ATGTTGGGAA
TAACTGTTCG TAGACTTGGT AAAATTGTCA AAGAACGTTG GAAAACAATC ACAAATCACG
AAAGGTCGAT CGTGTTGGCC AACACGCAAA AGTGACAAGA GATGGACTAT GTAATAATCG
TTCGCGAGCA CTCTGGGAAT TCTTCCTTCG TTTCGTTTGC TTTTGTAAAA ACGGTGATGG
GGTTCGCGAA GGGAAGACTT TATAAGGAAA GAGTACTTTC GTATTCTTCC TGTTCTTACT
GTTCGTTGTG AGGAACGAAG AATACGACAA TAGCCTGGGC GGCTTCCCGA ACCACAGTTT
ACCGTTCTGT GAGCGAACCT GGAATCGAAA ATCCTAGCAG TAAATCCACG ACGTGTGCAA
AATGCTTCGG GTTCCTTTGT GCTGATGATA CCGAAGGGTG CTCTGCTCTC ACTGTCGGTA
CATGATACCG TCAGTCCGCA GCTGGGAAAC GGGCGAATGA CTGTCCGTCG GTCATTCACT
GTCCGTCAGT CTATCAGTCT ATCGGTCGGT CTCACCCTTT CTGCAAGGAA AGTACACGAA
ACGAAAAGTG TAAATAAGTG GGGCGGGGCG GAGAGGGAGG GGTTCGAACA GTAGACACGA
TTACCGCTCG CAAAATGCAC CGCACGCATC CAGCTCGCCA AGTCTTCCCA CGGGACGACC
GAACACCTGG TGCCATTCTG AGAATTTGTG TTGTTGTCTC TGGATCTACC GAGTGTGTGT
GTATCCGCTT TCTTCTTTGT GCCAACTTTA GTAGTAGTAA CAGACATGAG TTACGCATCC
CCTTCGCCGT CCTACGGGAA TCCTCCGTCG TCACAGCAGC AGCAGCAACA GCAGCAACAA
CATACGAGTC CGATGCCGGC CTATTCCACA GCGCCTCCCA CGCAACAAGT ACAACAGCAG
CAACAGCAGA CGTGGAACCA GTCGTATCCA CCTCCGACAC AACAACAGCA ACAACTCCAA
TCACAGCCTC AGCCGTACTA TCCACAGCAG CAATGGCAAG CACAGCAACA GCAGCCGGTA
CAGTATCAGC AGCAACAAAC ACAATCACAA CCCGCTCCCT CGTATCCACC ATTGGCGGCA
CGTCCGTCTG GGACGGCTCC GGCTTCCCGA AATCCCCGTC TTTCTCGGCC ACCTTTTGTC
GATCCACAAA CCAATATTAT CTACGACACC TCCGACGCCG AATACGAAGG ATGGCTCACC
AAACAGAGTA CCTGGTTTAA GGTACGTTCC TCCCGCAAAA CTGTACGCAC GGATAATATA
CGATTGTGTG CGAATACAGA ACAGCACTGC CGTGTGGCAA CGTGATAACA ATCCACCAGG
CCATTCTGTT TTTGTTTTGG TATTGTTTTT CCGTTCTAAC TTCTACCCCT TTTTTCTAGG
AATGGCGTCG CCGGTACTTT ATCCTCAAAG GCAGTAAGCT TTTCTTCGCC AAAAACGAAT
ACGCCGCCCC GCACGGATTT GTGGACTTGG CTACCTGTAC CACGGTCAAA TCCGCCGATT
TGA
 
Protein sequence
MSVKRKFVAD GVFYAELNEL LTAELAEEGY AGVEVRTTPH RTELIIRATR TQNVLGENNR 
RIRELTSVVQ KRFNFADGAV ELYAERVQNR GLCAQAQAES LKFKLLGGLA VRRACYGVVR
FVMEAQAKGC EVVVTGKLRG QRAKGMKFGD GYMIKTGHAG QVYTDSAVRH VLMRQGVIGI
KVSIMLPQDP KGQMGPKIPL DDVVTILEPK EEPNGVAGTL SSKAVSFSSP KTNTPPRTDL
WTWLPVPRSN PPI