Gene PHATRDRAFT_35856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35856 
Symbol 
ID7201059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp911112 
End bp912242 
Gene Length1131 bp 
Protein Length376 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180344 
Protein GI219119155 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00746929 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGTAC GCTCGTTTGG ATCGCTGGTG CGGTTGGTAG GTATAAGCTT GTCGGCGTTG 
GTCTTGGCGA CTCTCGGAAA GTTTCAGAGC GAGTACGCGC TTCTAAATGT TGATGAGATG
CTCGCCAATG TGTTGCAGCG ACAGTCGTCG TCGTCGGAAA CGTGGAGCAA TGTAGATACA
GAGGCCGAAG TCAATCTGAC AGTAGGCGCT GCCCTACCTC CTTCGACAGA AAATGTCGTA
GAGAAAGCAG CGCCAGCAAA CAAGTCCATC CAAGTCTCGC CATCGTCCGT ACCAGACCAC
GCATTTCCAT CTACGGTCCC GGCCAGTATC AACAATGACG ACTCCATGCA GAAGCGGAAG
AAAGGCATGT ACTCCAATGC TCGAACGGAC AGATCAGGGT CGGTAATCCA AGACATGTTG
GCTGCGCATT CGTATGCTTT CCACCATAAT ATGACGTATC CCGGCGCTTG TTGGACTGAT
TCCAAGGCTC CCAACCAAGC TCGCATCAAG ACAAATAAGC AGCTGTTTGC TGCTATTGGC
CTCGAAGACG AACTCACCTA CGCCTGCCCC ACGAACATCA GTGACATAGT CAAACGAGGT
CGTTACGCAA ACGTTGATCG TCGGTGGTCT CGAGAATGGC TAGCATTTAT TCGGTCCAGG
GTAAAGTATC CCGAGAAAAA TGTTTCCGCT GGTCACCAGA CGGCAGTGCA TATTCGCCGT
GGGGATGTCA TACCGTGCCC CAAAAATGGA TTGCTGAAAC GATACCGATA CTTGCCGAAC
TCGTATTACC ATGCTGTGAT TGATACGTAT GTTCCCTCCA ATAGCACCGT CACAATTTAC
TCGGAAGAGG AGTCTTACGA GCCCTGGGAC AATTTCAGGC AATACAACTT GCGCCTGAGT
GCAAGTTTGG TGGACACGTG GCGAGATATG ATGATGGCGG GCACTCTCAT CCTTTCCAGG
AGTACTTTTT CCCTTGTCCC TGCTCTTTTG AATAGGCACG GGACTGTATG GTATGCCCCG
TTTGGGACGC CCAAGGTTGA TGGCTGGGAA GTAGTCCCAG ATAATATCAC AGCAATGGCT
GACCGTGACT CTGCCAAACT GAGAAAAAAG GACGCCTGTG TCGCAAAATG A
 
Protein sequence
MGVRSFGSLV RLVGISLSAL VLATLGKFQS EYALLNVDEM LANVLQRQSS SSETWSNVDT 
EAEVNLTVGA ALPPSTENVV EKAAPANKSI QVSPSSVPDH AFPSTVPASI NNDDSMQKRK
KGMYSNARTD RSGSVIQDML AAHSYAFHHN MTYPGACWTD SKAPNQARIK TNKQLFAAIG
LEDELTYACP TNISDIVKRG RYANVDRRWS REWLAFIRSR VKYPEKNVSA GHQTAVHIRR
GDVIPCPKNG LLKRYRYLPN SYYHAVIDTY VPSNSTVTIY SEEESYEPWD NFRQYNLRLS
ASLVDTWRDM MMAGTLILSR STFSLVPALL NRHGTVWYAP FGTPKVDGWE VVPDNITAMA
DRDSAKLRKK DACVAK