Gene PHATRDRAFT_49751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49751 
Symbol 
ID7198340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp164121 
End bp165699 
Gene Length1579 bp 
Protein Length451 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184576 
Protein GI219128766 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTTGT TTACATCCCG GAGACTGAGC CTACGCTCAG CGCAGCAACT TCCAGGGATG 
CTGGCGATGG CAAATCCCGG GTACCCTAGC AACAATACCT CCAATTCCCA GGATGACAGG
GCCATCGAAA CTTGTCATCC GGGAACGCTT CGTCCGTCGC GACGCACGAG GATCGAATCT
TGGACAACGG TTCGAAGGAA CTACCACGTC TCGGCAAGTC GAAGCTTTCT GCTTCCCTTG
TTAGCGCTCG TTCCCGTTGC GGTGCCCGCC GTCTATTTTG GTCGAAAGTA CTTGCGAGCG
CAACAGTTAC GACGAGAAAG GGTTGAGGAT CAGCAATGGC ACGTCTTGCA AGAGCAGCGC
AAAGCAGACC AGACAAGCGG AAAGGAGTAC AACAACATCG TGCTGGACTT GGGAACAACC
GACTTGAAAA CTGTCGCATT CAATAACCCT ACCGTGCAGA GTGATTTCCA GCCGGACGCA
TTGTGGGCTC TTCAAAGCAG CACCACCACC AACGCCACCA CCAATAGCCT TAATTCGAAG
GCACTCTTTC GTCTCGTACG GCATTACTTG CAAAAGTCCA TGCAGCAACC AGTCGACTCA
CGCCGAGACC ACGTTCGTGT GGTGCTGACG GTACCACCCA AGCCAGAAAC TGCTGAGCGT
TATCGTCACG TATTGCAGGA TTCACTCCCG TCGGATTCCT CCGTGGTATG GCTTCCGGAA
CCGGTAGCGG CCATTTGGGG TGCGCAAGTC CTCCATCTCT TGCCCATCCC ACATGATCCT
CGAACGCTTG TGATTGACAT TGGGGGACGA ACGTCAACCG TGTCGCTTGT GGAAAAAGAT
CACGTCGTAG CAGCCACAGT CTTGCCCAAT ATTGGTGGAC AGGTCTTGAT TGACGCCTAC
CAACGGAACG AGCCGCATAG TACGTGGAAA GAGGCACAGC AGGCTGTCTT GGATTGCGAG
GTAGCAGCCC ATCTATACAG CGATGTGGAA GCTACTGTTC TACAGGAAGC GCTTCCGAAA
TTGTACCGAG GAGTTCATTT GTCGGACCAG CTGCCCAAAC CAACCACTCT GGAAACTCTC
TGGGAGTCTG CTGTTGCCCA TTTGCTGGAA ACTTTAGAAA GCTCCTCATC ATTGCGCTTC
ACTACCGTCG TGGTTGTAGG TGGGGCCTCC AACAATGAGA CGTATCAACA TTCCCTCCAA
GCAGCCGTCA CGAAGACAGT AGCTCCGCAA AGTGTCGACT GGATATTGCC TTCCAAAACG
GATCGATCTC AACTCGTTAC TCTCGGGGCC AGGTCCATGA TGGCCTCCTG TGACTATTCA
TTTGACAAGG GTCTGTGCCC AAAATCCAAT GTGTGAGAAG AAACAACTCA CCTGTCAGCA
ACGGGACTAT AGTAAGAAAG GTTGCGTTAT CAAACAGCGC TGTAGTCTCC GGACAGACCC
AGAAGAGCAG CTTCGTTCGG CTCGAGCGGG CAACAAATGG GATCTGGGAG CTCCTGTCAC
GACATTTGTG CCAGCTCTTT ATTGAGGATT TGAGATAGTC ATTAACAGCA TGTTACCAAG
GAAAGCGATT TATGCTGAA
 
Protein sequence
MLLFTSRRLS LRSAQQLPGM LAMANPGYPS NNTSNSQDDR AIETCHPGTL RPSRRTRIES 
WTTVRRNYHV SASRSFLLPL LALVPVAVPA VYFGRKYLRA QQLRRERVED QQWHVLQEQR
KADQTSGKEY NNIVLDLGTT DLKTVAFNNP TVQSDFQPDA LWALQSSTTT NATTNSLNSK
ALFRLVRHYL QKSMQQPVDS RRDHVRVVLT VPPKPETAER YRHVLQDSLP SDSSVVWLPE
PVAAIWGAQV LHLLPIPHDP RTLVIDIGGR TSTVSLVEKD HVVAATVLPN IGGQVLIDAY
QRNEPHSTWK EAQQAVLDCE VAAHLYSDVE ATVLQEALPK LYRGVHLSDQ LPKPTTLETL
WESAVAHLLE TLESSSSLRF TTVVVVGGAS NNETYQHSLQ AAVTKTVAPQ SVDWILPSKT
DRSQLVTLGA RSMMASCDYS FDKGLCPKSN V