Gene PHATR_44047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44047 
Symbol 
ID7204230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp787924 
End bp789434 
Gene Length1511 bp 
Protein Length463 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186130 
Protein GI219113093 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.298129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGATAAGGC GTGATCATGA AGGTAAAGAC GTTGCAACGA GCGCCGGGTG CCGTAGAGCG 
CGAGTGCTCC GGAGATCTCC GTCGCCAATC GCGCAATCTC AATCCACAGT ATCATGTCAT
GCAGAGAGCC CGGGAATACA CCCGCGCCGT CACGGCAGCC AAGATGGATC GCATGTTTGC
CAAGCCACTC GTGGGCAATT TGGGACACGG GCACCGGGAT GCGGTCACGT GTACGGCCGT
TTCGCGACGG GCCTTGCTAC CATTAGCGTC CGGGGCGGCT GATGGCGTGG TGCAGTTGTG
GGATCTGCAG TCGCGGACTT CGGTAGCTAC CATCAACGCA CATAATCGGG TAGTCACAGG
AATGTCGTTT GATGTTTCCG GACAAGCCTT TTACAGTTGC AGTGACGACG GAAAAGTACA
CCGATGGTCC ATTCATCCGC AAGTTGAAAA CCAAGACGAT GAGGACGACA ATAATGCGGT
CACGGAACCC ACCTACGGTC CACTCGCGAC TTGGCGCTGC AATGGAGTTT TCAAAAGTAT
CGATCACCAC TGGCACGACG ACCGCTTCGC AACAGCGTCC GACTCGGCAG TACAAATATG
GTCGCCCACC CGTTCTAACG CGTTGCAAAC ACACGACTCA CTCTGGGGGT CCGACGACAC
CGTCACGGTG GTACGTTTCC ATCCGGCCGA ACGAGATTTG CTCGCGCACG TTTCCGCCGA
TCGCGGTATC GGCTTGCACG ACACTCGCAC GGGAGCGGCC TTGAAAAAGA CTACACTTCG
GATGCGATCG AACGACCTAC AGTGGAATCC AATGGAACCC ATGAATTTTG TTGTCGCCAA
TGAAGACTAC AACGCCTACC TTTTCGATAT GCGAAAATTG TCCGAGCCGA AGACGATCTA
CAAAGGGCAC GTGTCCGCCG TTATGAGCGT GTCGTGGTCA CCCACGGGAC GAGAATTTGC
GACGGGTAGC TACGATCGAA CCGTACGAAT TTTTAAGGCC AGCCAGGGGG GTGCCGCCCG
GGACGTTTAC CACACAAAGC GTATGCAACG CGTTTTTTGC GTGAATTATA CGATGGATCA
CAAATTCTTG GTCAGTGGCA GTGACGACAC CAATTTGAGG CTATGGAAGG CACACGCCAG
TGAGCAATTG GGACAATTGA CGCCACGCGA AGAATCGGCC ATGCAATATC GACAGGCTTT
GGTAAGAAAA TACCAGCATT TGCCGGAAGT GCGCAAGATT TCGAAAGCTC GTAAAATACC
CAAGGCGATC AAGAACCAAA CGAAACAAGC TATCATTCAA AAGGAAAGCA AGGATCGTAA
GCATGCTAAT CGGGTCAAGT ACGGAAAAGA TGGAGAACAC GAATTTGTCG GCGAACGAAA
AAAAACGGTG GTCAAAGAAT TGGACTAGAT ATGGTTGAAT TTTCCACCTT TTTCAAAAGC
GTGTTACCCC AGTTCTCGTT GTTGCAACTG TACACCATCC GAGCTTAACT TATAAAGAAC
ACTACGAACC T
 
Protein sequence
MKVKTLQRAP GAVERECSGD LRRQSRNLNP QYHVMQRARE YTRAVTAAKM DRMFAKPLVG 
NLGHGHRDAV TCTAVSRRAL LPLASGAADG VVQLWDLQSR TSVATINAHN RVVTGMSFDV
SGQAFYSCSD DGKVHRWSIH PQVENQDDED DNNAVTEPTY GPLATWRCNG VFKSIDHHWH
DDRFATASDS AVQIWSPTRS NALQTHDSLW GSDDTVTVVR FHPAERDLLA HVSADRGIGL
HDTRTGAALK KTTLRMRSND LQWNPMEPMN FVVANEDYNA YLFDMRKLSE PKTIYKGHVS
AVMSVSWSPT GREFATGSYD RTVRIFKASQ GGAARDVYHT KRMQRVFCVN YTMDHKFLVS
GSDDTNLRLW KAHASEQLGQ LTPREESAMQ YRQALVRKYQ HLPEVRKISK ARKIPKAIKN
QTKQAIIQKE SKDRKHANRV KYGKDGEHEF VGERKKTVVK ELD