Gene PHATR_36720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_36720 
SymbolVPS45 
ID7204608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp217495 
End bp219162 
Gene Length1668 bp 
Protein Length555 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185656 
Protein GI219120849 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTT TGTTGCTAGA TGCCGTGACG ACGCAGGTTG TATCGTCAGT ATATTCACAA 
ACGGAAATAC TCAATCAACA AGTGTATCTT GTTTCGCGGC TGGATGAGAC AGGAAGCCAT
ACAAACGGGT CTGCATCTGT TTCGAAAAGT CATCTCAAGG CCGTCGTTTT TTGCCGGCCT
ACACAAAACA ATGTGAATTT GATCGCTAAA GAACTCAGCC AACGCCCACG ATTTTTGGAG
TACCACATCT TTTTTTCGGG AATTCTTCCC TCCGGACTCG TACGCGTGCT AGCTGAAAGC
GATAGAACCG AACGAGTTCG GCAGGTAAAA GAGATTTACG CCGATTTTTT ACCCGTAAAT
GAAGATCTAA CGAGTCTACA ATGTCGCAAC ACACTCGCTA TGACGGTCGC TGCCGGAACA
TCCTGGGCAC CGAAGTATGC GGCACAATAC GAGCGCAACA TACAAGGACT CCAGTCCATG
CTGCTGGCTT TGAAACGACA GCCGAGTTGT ATCCGTTACG CGGGGCATTC GGCATGCGCC
GAAGAGCTAG CGAAAGATAT GCACGATGCG ATTCAAGCTG ATGAAATTTT CCACTTTCGA
AGAAGTAATG CTGGTGGTTT GCTGCTTTTA GTGTTGGATC GTCGAGACGA CCCTGTTACG
CCACTGCTTA GCCAGTGGAC TTATCAGGCG ATGGTACATG AGTTGTTGGG TCTCAACAAT
CACCGTGTCA TTCTTCGCGG AGCTCCCAAC GTAACCAAAG ATCTCGAGGA AGTAGTTTTA
GCTGCATCGC AGGACGATTT TTTTCATAGA AACCGCCATA GCAATTTTGG AGAACTGGGC
GAAGCCATTC AAAAACTTTT GAAGGAATAC CAAAGTCAGA CGGCAAATCA AAGCTCGGCA
AGTCTCAATA CAATTGAAGA TATGCAGAAT TTTATGGATA AGTTTCCGGA ACTGCGCTCT
CGATCACACA ATGTGTCGAA ACACGTGGCC ATTATGGGTG AACTAGCTCG CCTAGTCGAA
GTTTGCTCGC TGATGGATGT GTCGCAGTTC GAACAAGAGT TGGCGTGTTC TGATGATCAT
AATACCCACT GGCGAGAGCT CATGGACAAG TTAGGGAGCA ATGCGGTAAA AGTCCCGGAC
AAGCTGAGGT TGGGACTGCT CTATGCCTTG CGCTACGAAA CATCAGCTAA TATACACATG
GTACAGTCAG CGATGGGTAA AGGCGGTGTA CCGCAGGATA TGGTGGATCT TGTGAATGTT
ATGCTACGAT ACGGTGGGGC AAAGTCAAGA GGACCAGGTT TATTCGGAAA CCACGATTTA
ATGAGCAAAA TGACCAAGAA TTTCATGACA AGTGTACAAG GCGTCGAGAA CGTGTATGCG
CAGCATGTTC CTCTTATCAT GGACACTGTC CAAACAGTTA TGAAGGGCAA GTTAGCGGCG
AGGACTCATC CTATTGTCCC TGGGTCTTGT ACAACTCGAC TACATGGTGA TACAGTCGTT
CCAGAAGAAA TTATTATTTT TATGGTGGGT GGTGTAACCT ATGAAGAAGG GACCAAGATT
GCCGAGTTCA ACATACAAAT GAAAGGACGT GTTCATGTGA TTCTCGGAGG TAGCACGGTG
CACAACAGCA CCAGCTTTCT GGACGAACTC AGGTCCACAT CGCTATAG
 
Protein sequence
MKVLLLDAVT TQVVSSVYSQ TEILNQQVYL VSRLDETGSH TNGSASVSKS HLKAVVFCRP 
TQNNVNLIAK ELSQRPRFLE YHIFFSGILP SGLVRVLAES DRTERVRQVK EIYADFLPVN
EDLTSLQCRN TLAMTVAAGT SWAPKYAAQY ERNIQGLQSM LLALKRQPSC IRYAGHSACA
EELAKDMHDA IQADEIFHFR RSNAGGLLLL VLDRRDDPVT PLLSQWTYQA MVHELLGLNN
HRVILRGAPN VTKDLEEVVL AASQDDFFHR NRHSNFGELG EAIQKLLKEY QSQTANQSSA
SLNTIEDMQN FMDKFPELRS RSHNVSKHVA IMGELARLVE VCSLMDVSQF EQELACSDDH
NTHWRELMDK LGSNAVKVPD KLRLGLLYAL RYETSANIHM VQSAMGKGGV PQDMVDLVNV
MLRYGGAKSR GPGLFGNHDL MSKMTKNFMT SVQGVENVYA QHVPLIMDTV QTVMKGKLAA
RTHPIVPGSC TTRLHGDTVV PEEIIIFMVG GVTYEEGTKI AEFNIQMKGR VHVILGGSTV
HNSTSFLDEL RSTSL