Gene PHATR_43944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43944 
Symbol 
ID7204173 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp498908 
End bp500582 
Gene Length1675 bp 
Protein Length493 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186072 
Protein GI219112977 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.323695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCGATTCGAT TTATCCCTGG GTATAGCTTT AATCGACGAC GACGATGATG ATATTGTCCT 
CTACCGCCGC TCGTGCAACT GCCAAGGCTT TGAGTCGACC CGGTTCTCGG GCGTTGGCTT
CGCAAGCGGA ACACAACTTG ACGAGGTGAG TTGGGCACTA AATAGACGTA GAAATATGGA
CTCCAGGAAA TTTCGCCATT CCGAATGCTC ACATTCTCAT CACGCATATT CTCTCTCCCA
GTGGTTTTAA CTTTTTATCC GACCAAGACC GTATTTTTAC CAATTTGTAC GGCGAGCAGG
ATTGGCGTTT GCCCGACGCC ATTAAGCGCG GTGATTACCA CTTGACGAAG GAAATCATGT
GCATGGGTCC GGACTGGATT ATCCAGGAAA TCAAGGACAG TGGGCTCCGG GGACGCGGTG
GAGCCGGCTT TCCCTCCGGG CTTAAGTGGA GTTTTATGCC CAAGGAGACA GACGGAAGAC
CCTCCTTTCT GGTTGTCAAC GCGGACGAGT CCGAACCCGG TACCTGTAAG GACCGCGAAA
TCATGCGCAA GGACCCGCAC AAACTCATTG AAGGTTGCAT CCTGGCGGGC TACGCAATGC
GTGCCCGAGC AGCCTACATT TACATTCGTG GAGAGTATTT CAACGAAGCC GTAGTGCTGG
ATGAAGCCAT CCACGAAGCG TACGCCGCCG GACTCCTCGG GAAGAACGCC TGTGGTTCGG
GGTACGACTA TGACATCTAC CTCCATCGAG GTGCCGGCGC CTATATTTGT GGAGAAGAAA
CGGCTTTGAT TGAAAGTCTA GAAGGCAAGC AAGGAAAACC TCGTCTCAAG CCTCCGTTCC
CGGCGGGTGT CGGTTTGTTT GGATGCCCCT CGACGGTAAC CAACGTGGAA ACCGTGGCCG
TGGCCCCGAC GATTCTTCGT CGCGGTGCGT CCTGGTTCGC CTCCTTTGGG AACGAAAACA
ACCGCGGCAC CAAACTCTTT GCGATTTCCG GTCACGTCAA GAACCCCATG GTCGTGGAAG
AAAGCATGTC CATCCCGCTC CGTGACTTGA TTGACAAACA TTGCGGTGGT ATGCGCAACG
GCTGGGAGTC CGTCCAAGCC TGCATTCCGG GTGGTTCCTC GGTCCCGGTC CTCAACAAGG
ACCAGTGCGG CGAAGCGCTG ATGGAGTTTG ACGACTTGCG CGCCAAAGGA TCTGGTCTCG
GTACGGCCGC TGTGACAATG TTCGACAACA CGGTCGACAT GGTGGGTGCC ATTCGTCGCT
TGTCGCACTT TTACAAGCAC GAGTCCTGCG GTCAGTGTAC ACCCTGTCGC GAAGGCACGG
GCTGGCTGGA AGATATTCTC ATTCGGATGG AAAAGGGAGA CGCCGACAAG CGCGAAATCC
CTATGCTTGA GGAAATATCC CGCCAGATTG AAGGCCACAC GATTTGCGCC CTCGGGGACG
CCGCCGCCTG GCCTGTACAG GGACTTCTGC GTCACTTCAA AAAAGATATC GAAGACCGTA
TTGACAATCC AAAAGGCTTT GATCACGAAG CCGCTTTTCA AAAGGCCTGG AGTGGCGATC
CTTTCGACAA CAACGCCTGG ACTAAGGAAC ACGGTGACGG CAAGACCTAC GCCGCGGCGT
AATAAAACGG AATTGTGCAA TGGATAAGAG TAGGAATATC GTTGACTGAA AATAC
 
Protein sequence
MMILSSTAAR ATAKALSRPG SRALASQAEH NLTSGFNFLS DQDRIFTNLY GEQDWRLPDA 
IKRGDYHLTK EIMCMGPDWI IQEIKDSGLR GRGGAGFPSG LKWSFMPKET DGRPSFLVVN
ADESEPGTCK DREIMRKDPH KLIEGCILAG YAMRARAAYI YIRGEYFNEA VVLDEAIHEA
YAAGLLGKNA CGSGYDYDIY LHRGAGAYIC GEETALIESL EGKQGKPRLK PPFPAGVGLF
GCPSTVTNVE TVAVAPTILR RGASWFASFG NENNRGTKLF AISGHVKNPM VVEESMSIPL
RDLIDKHCGG MRNGWESVQA CIPGGSSVPV LNKDQCGEAL MEFDDLRAKG SGLGTAAVTM
FDNTVDMVGA IRRLSHFYKH ESCGQCTPCR EGTGWLEDIL IRMEKGDADK REIPMLEEIS
RQIEGHTICA LGDAAAWPVQ GLLRHFKKDI EDRIDNPKGF DHEAAFQKAW SGDPFDNNAW
TKEHGDGKTY AAA