Gene PHATRDRAFT_31764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31764 
Symbol 
ID7196114 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp870442 
End bp872170 
Gene Length1729 bp 
Protein Length548 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176671 
Protein GI219109836 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACCGC CGGTTCCTAC TCGGCATCGA AAGGTGTGTC TTGAGCCGAG AAAATACGAA 
AATACAGGCG AACGTTTCTG TGCGGCTGGG GGAGAGTATT ATTCTCCTCC ACACACGACA
CGAATGGACT GCATCGGACC GGTGCAGCCT CCGGGATATC GCACGTATCG CGATTCATCG
CCGTATCCGT TTCCCCATCC AACTCATTCC ACACCATCTG GTTACTACAA CACAACCCCG
ACACTGGTAT CTAGTTGTCA CACGCGCACA CATACACATA TACAAACACA TACCCCTGAA
CTACGCGACG CAGCGTGGAG TGAAGTCGTG AGGTCGCAAT CGGTGCAATC GAATGAGAGA
ATGCCCCCAC CGCAGCATCT TCCGTACAGT GGCAGAGAGT ACGGTATCTT GCCCCCACGG
ATGCCTCCGC GCACCTCTTC CAGCCCGTGT CGTGCCGGAG TGCCTACGTA CCAGTCGACT
CCGGTCAGCA CCATGCCCGA GCACTACCGG GGCGAATATC ACGGGTTCAC AGCCGAAGAA
CGATCGGGGT ATTACACAGG CTACCGGCGA GCGTCAAATG CGTTGAATTG CCCTCCCACG
AAAACAGTGG TAGCGTCCAC CAGCAAAGTA ACGGATGATG GGGCCTCGTT CACGTCGGAA
TCGTGGGATG AGCAGCATCA GCTTTTCGTA AGGGCGTCCA TGAAAAAGAG AAGAGCTCCG
TCTACTACAT CCTTCCCGTC CAAATTACAC AAAATTATAT CCAACCCTCT CCACCGAGAA
TTTATAGACT GGCTGCCCCA TGGTAGAGCA TGGAGAATTT TGAAGCCAAA GATGTTTGAG
AAGGATGTGA TTCCTAAGTT CTTTCGTTCG GAGCGATACG CATCCTTTAT GCGACAGGTA
AGTCGATGGG GGGTCACATC GGTATCTTCC GGACCTGCAA ACACTGGCTC ATACTGATAT
TCTTCCCTTC CTTTATCAGG TAAACGGCTG GGGTTTCAAA CGTATCACGG AGGGTCCGGA
TCTTAACTCC TACTACCACG AGCTATTCCT ACGTGGGCTC CCTGATATCT GCCTTAAAAT
GCAGCGTGTC ACCTGCAAAG CGAAACCCAC TGACGGTGCG GAGTTTGGCG AGTGTCCAGA
CTTTTACAAA ATCAGTATGT TTGCTCCGCT ACCTGACCCT GACCTGCAAG ACGAAGAAAC
AGCAGCAGCA ACCCCGAAGA CAATTGTCAC CAAAGATTCT TCGACCAAGC TATACAAGCG
ACCAAGCTCT CCATCATCAC TAAGTACTAT GAGCGGATCT GCCGGGTTGC ACGATGACAT
GGCCATGACG AACGCTATGG AGCCTCTTAC ACCAGTTCGT TCCGTGCATT CCCCCGCTCC
AAATACGTCT CCTCTACCTT TCCATGCTTC TTTGGCCAAT TCTCCGTCGC TTGGATATAA
TAGTAGCAAC GTTAGCTTGA GTTCGTTCGG GGCAACGCAC GAAGAGCTGA TGTGGGGCAC
TGGACCTTTT TCATCCTTGC ATCATCGTGC CGAATCTCTC GGGGCTAGCG GTGGCCATGT
GACACCTCCG TCAAGTAGAC AATTCTATCA GTCAACGCGG TCTGAATCAC ACCACGAAGA
TACGAGCGGC CTTTCCGCAG CAGACTTATG CTATTTGACC CATCAGAACC GGGTTCTTTT
ACACCAAGCA AAAGGTTTCC GTAACGACAA TGAGTATCAG GGAGTTTGA
 
Protein sequence
MAPPVPTRHR KVCLEPRKYE NTGERFCAAG GEYYSPPHTT RMDCIGPVQP PGYRTYRDSS 
PYPFPHPTHS TPSGYYNTTP TLVSSCHTRT HTHIQTHTPE LRDAAWSEVV RSQSVQSNER
MPPPQHLPYS GREYGILPPR MPPRTSSSPC RAGVPTYQST PVSTMPEHYR GEYHGFTAEE
RSGYYTGYRR ASNALNCPPT KTVVASTSKV TDDGASFTSE SWDEQHQLFV RASMKKRRAP
STTSFPSKLH KIISNPLHRE FIDWLPHGRA WRILKPKMFE KDVIPKFFRS ERYASFMRQV
NGWGFKRITE GPDLNSYYHE LFLRGLPDIC LKMQRVTCKA KPTDGAEFGE CPDFYKISMF
APLPDPDLQD EETAAATPKT IVTKDSSTKL YKRPSSPSSL STMSGSAGLH DDMAMTNAME
PLTPVRSVHS PAPNTSPLPF HASLANSPSL GYNSSNVSLS SFGATHEELM WGTGPFSSLH
HRAESLGASG GHVTPPSSRQ FYQSTRSESH HEDTSGLSAA DLCYLTHQNR VLLHQAKGFR
NDNEYQGV