Gene PHATRDRAFT_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_1971 
Symbol 
ID7198358 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp282391 
End bp284055 
Gene Length1665 bp 
Protein Length492 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184517 
Protein GI219128642 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACTCGAGT CGGAACAGCA ACAAGCTAGC GTTCAGGACG AGTGGATGGA TCTAGCGGAG 
CGCATTGACA GCATCCAGGG GAACGATGTA TGGCGGTCCG ACCCGAGCTG CCCCTTGTAC
GAACAGGAAC GTATTTCGGC GCGGATTGAC GAATTGGTGC ACCTTATGCG ACGCCGGGAT
ATCTTTGAAC TCATGTTTGT GCTGCGTGCT TCGATTGGAC GCAACAAGTT TGGGCTGTTG
CACGAAGGGC TGTTCAGCAA GGCGTTAGCG GGTACCAAAG TACTCGTCGA GACGTATCAC
AACGTGGTCT GTGCTGCCTT GGACTTTTGT TGCGACGCTC CCGTGTCTCC TGACGAGGAT
CCTATCCCTA CGGATGCCCG TCTCGCATTT TTCAACGAAA CTCGACACGC TTACGGACGT
ACAGCCTTGC TCCTATCCGG TGGTGCGGCC TTGGGCTTTT ACCATACTGG TGTCGTCAAG
ACGCTTATGG AAAATCGCCT CATGCCGCGT GTAATTGGTG GCAGCTCGGC TGGTTCGCTC
GTGTGTGCCA TGATAGCTAC ACGGACAGAC GAAGAATGCG TGCACGACAT GTTCAACGCC
CAAGGTACCG ACGCTCCGGG ACATTCCGGC CAACTCCCAC TCAACTTTTT CCGCCCCCTC
CAAACGGGGA ATATCAACGC AGCCAAGGTA GACGCAACGC CGAACAAACA ACTCGGGGGA
ATTCGTGAAG TGTACTACAA TACTGCTGGA TTCTTTCACG ATGCCAAGCG GACCTTGCAA
GGGTTGGTTC CAATTCCTCT CCGACACTTT TCCGCCGTGT TGTACGATAT CGTCACCGGC
AACCGTCGGC CTCAAGACAT GCTCATGAAT GATACAGAAC ACTTCCGAGC TTGCGTACGG
GCCTGTGTCG GTAACTTCAC ATTTCAAGAA GCTTTCGACC GGACTGGGCG TATTCTGAAC
ATCGTGGTGA CGCCCAAGAA CAATTCCGAT CCGCCCCGCT TACTCAACTA CTTGACGGCA
CCGCACGTTA TGGTATGGTC AGCTGCCGTC GCGAGCTCCT CCCTACCCGG AGTTTTTGAA
GCTAATCGAC TAGTTGTCAA GGAAGCAGAC GGTTGGGAAC GGTACGAGTC GGGCGGCGCG
CCACAGCACT TTTCGGATGG ATCGATGGAA CAGGATTTGC CCATGCAGCA GCTATCGGAG
ATGTTCAACG TCAACCACTT TTTGATCTCG CAGGCCAACC CACACGCCGT CATGTTTGCC
AATTATCAAC AAAAGAATTC GGTGTGGAGT AATCCTGTTA CGGGCTTTGT GGATTCTATT
CTGACCTTTT TACGCGATCA AGTGCGCACT TGGTTGTTGC ATCTCGTGGC GTGCGTTGGC
GCTCGTAGTA TTACACCTAT GTTCCAGACT CAACGTGGAA TTGGTACAAC TTTCCTGACG
CAAGAATACG AAGGGCGGTC TTGCGACATT TCACTCATCC CATGGTTGGG TCATCGGGGA
CTCTTCAGTG CCTTATTGCA CATTATCTAC AACCCAAGGG AAGCCGAGTT TCGCGAATGG
ATCCAAGCAG CTGAACGAGA AACCTGGCGA CACATTCCGG CCATCAAATC GCACATCGCC
GAAGAAGTTA CTCTGGATCG TTGTGTACAA AGGCTACGAA AAAGA
 
Protein sequence
QLESEQQQAS VQDEWMDLAE RIDSIQGNDV WRSDPSCPLY EQERISARID ELVHLMRRRD 
IFELMFVLRA SIGRNKFGLL HEGLFSKALA GTKVLVETYH NVVCAALDFC CDAPVSPDED
PIPTDARLAF FNETRHAYGR TALLLSGGAA LGFYHTGVVK TLMENRLMPR VIGGSSAGSL
VCAMIATRTD EECRTLQGLV PIPLRHFSAV LYDIVTGNRR PQDMLMNDTE HFRACVRACV
GNFTFQEAFD RTGRILNIVV TPKNNSDPPR LLNYLTAPHV MVWSAAVASS SLPGVFEANR
LVVKEADGWE RYESGGAPQH FSDGSMEQDL PMQQLSEMFN VNHFLISQAN PHAVMFANYQ
QKNSVWSNPV TGFVDSILTF LRDQVRTWLL HLVACVGARS ITPMFQTQRG IGTTFLTQEY
EGRSCDISLI PWLGHRGLFS ALLHIIYNPR EAEFREWIQA AERETWRHIP AIKSHIAEEV
TLDRCVQRLR KR