Gene PHATRDRAFT_39374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39374 
Symbol 
ID7195119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp326262 
End bp327362 
Gene Length1101 bp 
Protein Length366 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183342 
Protein GI219126183 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCGT TCGATCCCGT GGACTCGGAA AAACCCGCGG AACTCCTGGA TCCCCAAACC 
TTGCACAACG GTACCATTCC CTATCCAACC GCACTTTCTC CTTCGGCCAT TTTGGAATTT
CAGAAATGCC CGCAATCTTA CCTATTCCAA TACCTGTACA AACTCAAGCA ACCCACCAGT
CTCGCCTTGG CGAAAGGTTC CATGCTCCAT CACGCCTTGG AACAAGTGTA CGATCTGCAA
CCCGCCGAGC GTGATTTATC CACTCTGCAG AATCTATTCC GTCGCGTCTG GAGCCAAAAT
CGCGAATCGG ATGTGTACCG GGACTTGTTC GCGACTCCCG AAGCCGCCCG GGATCTCGCG
ATGGAATCCG TCTGGGGTCG GGAGGGGCTC CAACTGTTGC AGAATTACGT GCGCTTGGAA
AATCCGCAAG CCGTCACTCG CCCTAATCCA GTCCAACGGG AAATATGGGT CCGGGCTCAT
CTCACCATCG ATTCCTCACA GGGTGCGACG GGGTACGTCC TGCCCGGTCG CAAACCCCAG
GCCACGTCCA CACCGAACAA CAAGGACGCA GCCGCCTTTT TAGTCCGTGG CATTGTGGAT
CGGTTGGATA TGGTCCGGAC GCCTGAATCG AAACAAGCAG TCCTACAAAT CGTCGATTAC
AAAACCGGCA AGGCGCCGCA TCTCAAGTAT AGTGCGGGCA TGAATCAGAA AATTCGGGAC
GAAGCGCTGT TTCAACTGCA AATCTACGCA CTCCTGTTGC GCGAAAAGCA ACTCCAGAAA
CAACAGCAGT TCGAAGACGA TGCGGCCGCG TCCTCGCAAA CCTTACCCGT GCGATTTCTG
CGTCTCTTGT ACCTAACAAA CGTCAACGAT CAGGCCGAAA CGCTGGACAT GGACTTGGGA
GCGACGCCGC TTGAGCGGGA CTCACGACTC CAAGACGTGC ACGCGCAAAT CTCTACCGTG
TGGAACTCCA TTATCGATAT GGTCAGTCGA CAGGACCCTC ACGCCTTTGT CGGCTGTGAC
CGGTCGTTTT GCTATTGCCA CAAGTGCCGA TCGCGATTCG TGCCGGGATC TGTATGGGAA
CCGCCGGTGG AACCGACCTA A
 
Protein sequence
MAAFDPVDSE KPAELLDPQT LHNGTIPYPT ALSPSAILEF QKCPQSYLFQ YLYKLKQPTS 
LALAKGSMLH HALEQVYDLQ PAERDLSTLQ NLFRRVWSQN RESDVYRDLF ATPEAARDLA
MESVWGREGL QLLQNYVRLE NPQAVTRPNP VQREIWVRAH LTIDSSQGAT GYVLPGRKPQ
ATSTPNNKDA AAFLVRGIVD RLDMVRTPES KQAVLQIVDY KTGKAPHLKY SAGMNQKIRD
EALFQLQIYA LLLREKQLQK QQQFEDDAAA SSQTLPVRFL RLLYLTNVND QAETLDMDLG
ATPLERDSRL QDVHAQISTV WNSIIDMVSR QDPHAFVGCD RSFCYCHKCR SRFVPGSVWE
PPVEPT