Gene PHATRDRAFT_22166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_22166 
Symbol 
ID7203427 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp117171 
End bp118431 
Gene Length1261 bp 
Protein Length280 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182618 
Protein GI219124663 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00107582 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGAAG GCTCGGTAAA CAGCGAAAAT GTGGGGGTCG CCTTCGGCTT GGTGATTGGC 
GCCGGCGCTG CCACAAGTCT CGGGGCGGGG GTCGTATTTG TACCCGCGCT TGTCAAACTG
GCGTCGAGAC GGACGTTGGC GGCTGCGTTA GGACTGTCCG CCGGTGTCAT GGTCTATGTA
TCTCTCGTTG AAATTTTCAA CGAGGCCAAT CGGCATTTCG AAGAAGCGGG TTTTCCCACT
GATGAAGCCT ACCTATACGC GACAATCAGT TTTTTTAGTG GAGTCATTGT GATGGTGGTA
CGTTGTATGG ATCTCACGAA AAATATCCAT CATGGGATGC GCGAAAAGAA GATCTCTTCA
CAATTGCTAA CACTCTTGTT GATTTATTTA CGGTAGCCGC TCAACTTTTT AGTTTCTTGG
TTGCTGGGGG GACATGACGA ACACGAATTC CCGCCATATC TAGAGGATAA GGAAACGCAG
GAGTTCTCCA ACGCGGAAAG CGCCGTCCAG GCTGCTGCAG AGGAATGCGG TACTACTGTA
ACAGCGTGTC CGTGTTGCTC TGAAGATCCC GCCAAAGAGT TGGAGTGTCT GCAGGAAATG
GCGTTGGAAC TGGGAAAAAG GGAGCACGAA CCCGAAGCCG GGTGCAGTGT ATCGGACCAC
GAGGATTTCC CGGCGCAGAA AGACGAATGT GTGGTGCAGG GCAAAGATCA GAAAAAGCTG
CTCCGGATGA GTATCAATAC AGCGTTGGCG ATTGGCATTC ACAATTTTCC TGAAGGCCTC
GCCACGTTCG TGGCGACGCT CGACAATCCG CGGGTCGGCG CCATTTTAGC CGTCGCCATT
GCCATTCATA ATATTCCCGA AGGTTTGTGC ATTGCCATGC CGATTTACTA CGCAACGGGC
AACCGCTGGA GGGCTTTTGG TTGGGCCATG GTATCGGGCA TGTCCGAACC ACTGGCGGCA
CTTTTGGGTT GGGCCGTTCT GGCGAGTTGT TTCACGCAAA CAATGTTTGG TGCGCTGTTT
GGTGTAGTAT CGGGCATGAT GGTAATTGTA TCCGTCCGTG AATTGCTGCC AACGGCACAT
CGGTACGATC CAGACGATGT CGTGGTGACT TACTCGTTCA TGGCGGGAAT GTTGGTCATG
GCGGTCTCGC TAGTGCTCTT TTTGGTGTAA AACTACTGCA AACCGGCACC CCCAAAGCTC
TTTCTCCAAA CGAGCGTCGG CGAAATATGC ACAAGTTAAC CGTAAAGCGT GTCTTGCTGT
C
 
Protein sequence
MSEGSVNSEN VGVAFGLVIG AGAATSLGAG VVFVPALVKL ASRRTLAAAL GLSAGVMVYV 
SLVEIFNEAN RHFEEAGFPT DEAYLYATIS FFSGVIVMVH EPEAGCSVSD HEDFPAQKDE
CVVQGKDQKK LLRMSINTAL AIGIHNFPEG LATFVATLDN PRVGAILAVA IAIHNIPEGL
CIAMPIYYAT GNRWRAFGWA MVSGMSEPLA ALLGWAVLAS CFTQTMFGAL FGVVSGMMVI
VSVRELLPTA HRYDPDDVVV TYSFMAGMLV MAVSLVLFLV