Gene PHATRDRAFT_44994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44994 
Symbol 
ID7199512 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp923123 
End bp925846 
Gene Length2724 bp 
Protein Length840 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179091 
Protein GI219116592 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0685164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAGC TTGTCTTTCT GGCGACCCTT GCAGCGTTTC TTACGTTGGC GGATGCCAGT 
GATGGTTCCA TTGTCGGTAC GTGCAGAAAC CCCCTCTTGA GTTTGTAGTT GTATATTCAC
TGTCACTCCA GATTAGAAAA CTGACGCATG AGGTCCTCAA TGTCCCTTCC ATTGGTCACC
ACAGACTCCA TCGAAGAACG AAAGTTAGCT GGGACAACGT GCACTACTAA GAATTTGGAC
TTTAGCGAAT TTGCTGCCGG AACCTACTTG AGTAACTTGG AGGCTGCCTA TGGGGTGACT
ATCACTGCCG TTTCCCGTAC AAGCAAGGGC TACACACCCA ACGGAGCCGC CCGTGTTTTC
GACACATCCA AGCCCACCGG TGCAACGGGA CAATCAATGT GCTCCTCTGG TGACGGTGAC
TCTGATCTCG GATCACCCAA CTCCGCTTGT CCCGGAGGTG GACCAGGTCA CGGACCTGGA
GGTGCACCAA AACTCGCCAA CGGACAGAAC AATCCTTATA AGAACTGCTC GCCCCAAGGC
AAAGTACTCA TCATTCAAGA AGGCAACAAA AATTGTCCCG ACGACAGTGC GGACGGTGGT
ACTATCCGCT TCGACTTCTC CAAAACAGTG GACCTCGAGT CGGTGACGTC CTTGGATATT
GACGAGGGCA GCACTCCCGA AATCACCGTC TCGTACGGCA ACGGCCAGGA GGCTTTTTAT
AAGCTTCAGG CTACGGGCGA TAACGGTGTT TTTACGCAAA CGATCAACAA GAGTGACGTC
AAGTGGTTCC AGATCAAGTT CTACGGCTCG GGATCCGTAT CAGGCTTCAA GTGGGATGAG
TGTGTCACAG CCCCAACGAA AGCCCCCACG AAAAGCCCCA CAAAGGCTCC GATACCGGCC
CAAACCAGAG ATGATACTTG TCCAACCAAG AACTTGGACT TCAGTGAATT TGCCACCGGA
ACCTACTTGA GCAACTTGGA GGCTGACTAT GGGGTGACTA TCACTGCCGT TTCCCGTACA
AACAAAGGGT ACACACCCAA TGGAGCTGCC CGTGTTTTCG ACACATCCAA GCCCACCGGT
GCAACGGGAC AGTCAATGTG CTCCTCCAGC GATGGGGACC CAGATCTCGG ATCACCCAAC
TCCGCTTGTC CCGGAGGTGG ACCAGGTCAC GGACCTGGAG GTGCGCCAAA ACTCTCAAAC
GGTCAAAACA ATCCTTACAA GAACTGCTCG CCCCAAGGCA AAGTACTCAT CATTCAAGAA
GGTAACAAAA ATTGTCCCGA CGACAGTGCG GACGGTGGTA CTATCCGCTT CGACTTCTCC
AAAACAGTGG ACCTCGAATC GGTGACGTCC TTGGATATTG ACGAGGGCAG CACTCCCGAA
ATCACCGTCT CGTACGGCAA CGGCCAGGAG GCTTTTTATA AGCTACCGGC TACGGGCGAC
AACGGCGTTT TCACGCAAAT GATCAACAAA GGTGACGTCA GGTGGTTCCA GATTAAGTTC
TACGGCTCAG GATCCGTATC AGGCTTCAAA TGGGCCGAGT GCGTCCCAGC CCCAACGAAA
GCCCCTGCCA AAGCTCCGAC AAAAGCTCCT GTCAAAACTC CGACGAAAGC CCCAGTAAAA
GCTCCAACGA AAGCCCCAGT AAAAGCTCCA ACGAAAGCCC CGACTAAAGC TCCAACGAAA
GCCCCAGTAA AAGCTCCAAC GAAAGCCCCA ACCAAAGCTC CCACGAAAGC CCCAACCAAA
GCTCCAGCGA ATGCTCCAAC GAAAGCTCCA ACAAAAGCCC CTGTAAAGGC TCCGACGAAA
GCTCCAGTGA CAGCTCCAAC AAAGGCTCCT GTCAAAGCGC CGACCAAGGC GCCAACCGGT
ACCCGCGATG AAATATGTGT CGACGAAGTC CTCGACTTTA CTGACTTTTC TACAGGCGAG
TACGTCCATG ACCTGGTACG ATCTCGCGGC GTTACAGTGA CAGCAATTGC ATCCGGAAGC
GACGGATACA CCCCCGGCGG TGCGGCTCGC ATTTTCGACA CTCGCTACCC TTCCGGCAGC
ACTGGACAAG CGCTCTGCGC CCAGAACGAA GGTGAAACAA CTCTCGGGTC ACCCAACCTT
TCGTGCCCCG GCGGTGGATC CGGATCGGGT AACGGAGGCA AAGTCAACAC GCCCTTCGCC
AACTGCGACG CTCGTGGTAA GGGTCTCATC ATTCAAGAAG GAAACGTGGC CTGTCCTGAA
CACGCTGGAC AAGGCGGAAA AATTGTGTTT GAGTTTGCGG TACCGGTTGA GCTCAACTAC
ATCGATTTGC TGGTTAGCAC CGACTCCAGT CCGGTAATTA CGGTGTACTA CGGCGTAGAC
CAATCCATTT CGTTTGATAT GCCGGTGATC GGCGCCAATG GCTACCGTCG ACAAGTGATC
GATCGATCGC AGGTTTACAA GGTCGAGGTG GGCTTCTGTA GTGGAGGTAC CGTCACTGCC
ATTGACTACA TTCGTTGCGA GCCTGAAGAG GAATGTCCAC CGAGCACTGG TTCAGTCAAA
CCCCTCCCTC CGATCGAAGT GCATCTTCCC CCGCCGAACA GCAAGCACAT GGTTTTTGAC
TTTGTTGTTA TGAAGAATCA AGAATCGTGT CCTCCGGAAT GGCTTCATTA ATTTGGAGCA
TGCGTTGTGA TCGACACGGG GTCGCGCATA GTGCAGACTG TCAGAAGGCA TTTCATAATT
TATGTAAACA CAGGTACTCA GCCT
 
Protein sequence
MMKLVFLATL AAFLTLADAS DGSIVDSIEE RKLAGTTCTT KNLDFSEFAA GTYLSNLEAA 
YGVTITAVSR TSKGYTPNGA ARVFDTSKPT GATGQSMCSS GDGDSDLGSP NSACPGGGPG
HGPGGAPKLA NGQNNPYKNC SPQGKVLIIQ EGNKNCPDDS ADGGTIRFDF SKTVDLESVT
SLDIDEGSTP EITVSYGNGQ EAFYKLQATG DNGVFTQTIN KSDVKWFQIK FYGSGSVSGF
KWDECVTAPT KAPTKSPTKA PIPAQTRDDT CPTKNLDFSE FATGTYLSNL EADYGVTITA
VSRTNKGYTP NGAARVFDTS KPTGATGQSM CSSSDGDPDL GSPNSACPGG GPGHGPGGAP
KLSNGQNNPY KNCSPQGKVL IIQEGNKNCP DDSADGGTIR FDFSKTVDLE SVTSLDIDEG
STPEITVSYG NGQEAFYKLP ATGDNGVFTQ MINKGDVRWF QIKFYGSGSV SGFKWAECVP
APTKAPAKAP TKAPVKTPTK APVKAPTKAP VKAPTKAPTK APTKAPVKAP TKAPTKAPTK
APTKAPANAP TKAPTKAPVK APTKAPVTAP TKAPVKAPTK APTGTRDEIC VDEVLDFTDF
STGEYVHDLV RSRGVTVTAI ASGSDGYTPG GAARIFDTRY PSGSTGQALC AQNEGETTLG
SPNLSCPGGG SGSGNGGKVN TPFANCDARG KGLIIQEGNV ACPEHAGQGG KIVFEFAVPV
ELNYIDLLVS TDSSPVITVY YGVDQSISFD MPVIGANGYR RQVIDRSQVY KVEVGFCSGG
TVTAIDYIRC EPEEECPPST GSVKPLPPIE VHLPPPNSKH MVFDFVVMKN QESCPPEWLH