Gene PHATRDRAFT_41857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41857 
Symbol 
ID7197913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1231010 
End bp1232650 
Gene Length1641 bp 
Protein Length546 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178390 
Protein GI219115189 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.968452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCCCG ACAAGTCCAA AAACATTCTC AAAAACATCA ATCTGTCCTT TTATCCCGGC 
GCCAAAATTG GGGTGGTGGG CTTGAACGGA TCCGGAAAAT CAACCTTACT CAAAATCATG
GCGGGAGTGG ATACCGAGTT TGACGGTACC GCCCGGCCCT TGCCCGGAGC TTCGATTGGC
TACCTTCCAC AGGAGCCGGC CCTTCCTTTT GCTACCGTCC AAGAATGCGT CGACGAAGCT
GTACAATCCT CGCAAGCCAT TCTTGATGAG TATAACCAAC TCAGCATGAA GCTAGCGGAT
CCTGATCTCA CCGATGATGA AATGAACAAG ATCATGACCA AGACGGAACA ACTGACCAAT
CAAATTGAAG CTGGAAATCT GTGGGAACTC GAGCGAATTG TGGAGCGCGC CATGGATTCC
TTGCGGGTGC CACCCGGGGA CGCCAAGACG GCCGTCCTGT CGGGTGGCGA AAAGCGCCGC
GTAGCGCTTT GTCGCTTGCT CCTGGCCAAT CACGATATGC TTCTACTAGA CGAACCCACG
AACCATTTGG ACGCCGAATC GATCGGATGG TTGGAGCAGT TCTTGGCACA GTTCAAGGGA
ACGGTGGTCT GCATCACCCA CGACCGATAC TTTCTCGAAA ATGTGGCCGA GTGGATTTTA
GAGCTAGACC GAGGAGAAGG CATCCCGCAC GAAGGCAACT ACTCAAGCTG GCTGGAGGCC
AAGAGTAAAC GTCTCGAGGA GGAAAAGAAA AAAGACACCG CGGCAGCCAA GGCTGTTGCA
GCCGAACTGG AATGGATTCG GAGCAACCCC AAGGCCAAGG GCAACAAAAG TAAGGCACGC
CTCAACCGCT ACGATGAGCT ACTGTCCGCT GCTGCTCCTA CGGAACTCCG GAACGCGGGA
CAAATCTACA TCCCCCCGGG TCCTCGGTTG GGCGATGTCG TGGTGGATAT CACCAACATG
CGCAAGTCGT TCGATGAGCG CTTGCTAATT AAGGATTTGA GCTTTTCCAT GCCCAAAGCT
GGTATTGTGG GCGTCATTGG CCCGAACGGT GCCGGCAAGT CGACACTCAT CAAAATGCTA
CTCGGCAAAG AGCAACCCGA CTCTGGTGAG GTCAAAATCG GTGAGACCGT GAACATCGTG
TCTGTTGGGC AGGAACGCAT GGATGAGTTG AACTCGGAAA AGACTGTGTT TGAGGAAATC
TCCGGAGGGC TCGATGAGCT CGAGCTGGGC ACCCAAACTG TGCAATCTCG TGCCTATCTT
TCCTGGTTTG GGTTTAAGGG AGGAATGCAG CAGGCCAAAG TGGGAAATCT ATCAGGTGGC
GAGCGCAATC GTGTCCAGCT CGCCAAGATT CTCAAGGCCG GTGGCAATAT GATTATTCTA
GATGAACCAT CGAACGACTT GGACGTCGAA GTCTTGCGCA GTCTGGAAGA AGCGCTGTTG
AATTTTGCGG GCTGTGCCAT GGTGGTGTCA CACGATAGGT ACATGTTGGA TCGCGTGGCG
ACCCACATTC TGGCCTGCGA GGGTGATTCG GAATGGTTCT TCTTCCCAGG CAACTATGCC
GAATATGAGG CCAACCGTCT GGAACGCAAG GGCCAAAGCA GCATTAAGCG CGTCGCCTAC
GCGCCTTTGC TGAACGCGTA G
 
Protein sequence
MLPDKSKNIL KNINLSFYPG AKIGVVGLNG SGKSTLLKIM AGVDTEFDGT ARPLPGASIG 
YLPQEPALPF ATVQECVDEA VQSSQAILDE YNQLSMKLAD PDLTDDEMNK IMTKTEQLTN
QIEAGNLWEL ERIVERAMDS LRVPPGDAKT AVLSGGEKRR VALCRLLLAN HDMLLLDEPT
NHLDAESIGW LEQFLAQFKG TVVCITHDRY FLENVAEWIL ELDRGEGIPH EGNYSSWLEA
KSKRLEEEKK KDTAAAKAVA AELEWIRSNP KAKGNKSKAR LNRYDELLSA AAPTELRNAG
QIYIPPGPRL GDVVVDITNM RKSFDERLLI KDLSFSMPKA GIVGVIGPNG AGKSTLIKML
LGKEQPDSGE VKIGETVNIV SVGQERMDEL NSEKTVFEEI SGGLDELELG TQTVQSRAYL
SWFGFKGGMQ QAKVGNLSGG ERNRVQLAKI LKAGGNMIIL DEPSNDLDVE VLRSLEEALL
NFAGCAMVVS HDRYMLDRVA THILACEGDS EWFFFPGNYA EYEANRLERK GQSSIKRVAY
APLLNA