Gene PHATRDRAFT_21379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21379 
Symbol 
ID7202036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp542025 
End bp543120 
Gene Length1096 bp 
Protein Length326 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181224 
Protein GI219121752 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCTTGCAACA AGAGCGAACG GGAAGCAATC AGATTTTCTC AAATTCCGCT CGAGTACCTT 
TCCGCTACAG CGGATCGGGA GGAGCAACAA CACAGATCGG GGGCCTCGAG ACCCGATGGC
CGAACGACTT GAAGAGTCGA TTCTACCAAA TGTTCTGGGA TCGGCCTGTG CCGGTATCAT
AGCTCGCATT TCGACGCATC CACTCGATAC AACCAAAGCT CGCTTGCAGG CCCAAAGCGC
CCCGAGGTTC CGAGGTCCTG TTGACGCTCT GGCACAGACT GCCAGAGCCG AAGGTATCAC
CGGCTTGTAT CGAGGCTTTG GGGCAGTAAT CATCGGTGGC ACACCAGGGA CTGTTCTTTA
CTTATGCAGT TACGATTTCG TTAAAAAAGG GCTTTCGCAA GCTTGGGAAT CACGTATGAA
TCAACCTATG GAAGGCACGG GTGCAGATTT TGCCGTACAT TTTACGGCAG GAATGCTGGC
AGAAACAATC GCATGCATCA TCTATGTTCC AGTGGATGTT GTGAAAGAAA GAATGCAAGT
CCAACAGGGC TTACAAAGCT CACCATCGGC TTATAAAAGT AGCTGGGACG CTTTTCAGAA
GATTGCAAGG TCCGAAGGCA TTACTGGAAT CTACAAGGGC TATACGGCTA CGTTGGGCTC
GTTTGGTCCC TTTTCAGCGC TGTACTTTGT CTTTTACGAA AAATTGAAAC GCTCGAGTTG
TCAATATGTA TCCAGAGAAC CGTATACTAT ATCTGGCTCT TCGGGAAGAA ATACGGAACT
TCCTTTTCCT TGGGTGGTAG GTTGTAGCGC TGGTGCTGGA GCACTAGCGT CGTGGCTTAC
ATCGCCTCTG GATATGGCAA AATTGCGGCT ACAAGTGCAA CGTGGACATA TTGCGCAAAA
TGCTTCTTCT TTGGCTCCAG TAACGTCATA TCGAGGCGTG TGGGACTGCT TAAAGCAGGC
ACATAAGCGC GACGGATTTC GTGGCCTTTT TCGCGGTGCT GGTGCTCGAG TTCTCCATTT
CGCCCCTGCG ACAACGATCA CGATGACTAG CTACGAAATG TGTCGCTCTC TGTTTGCGGG
TATAGGAGGT GCATAG
 
Protein sequence
MAERLEESIL PNVLGSACAG IIARISTHPL DTTKARLQAQ SAPRFRGPVD ALAQTARAEG 
ITGLYRGFGA VIIGGTPGTV LYLCSYDFVK KGLSQAWESR MNQPMEGTGA DFAVHFTAGM
LAETIACIIY VPVDVVKERM QVQQGLQSSP SAYKSSWDAF QKIARSEGIT GIYKGYTATL
GSFGPFSALY FVFYEKLKRS SCQYVSREPY TISGSSGRNT ELPFPWVVGC SAGAGALASW
LTSPLDMAKL RLQVQRGHIA QNASSLAPVT SYRGVWDCLK QAHKRDGFRG LFRGAGARVL
HFAPATTITM TSYEMCRSLF AGIGGA