Gene PHATRDRAFT_31518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31518 
Symbol 
ID7196064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp272934 
End bp274322 
Gene Length1389 bp 
Protein Length462 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177053 
Protein GI219110603 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGCGT CGACGAAGAA AAAATCCAAG AATGGCCAAC GCACGAAGGC CACCAACAAG 
ACAAAGGTGC GTGAGATATC TCCAGGTACC AAGATCATCG AGGACCTTAC CAGCACCGAT
GTTTTGTTGG GAAGAGGCAA CGGAGTGGCA GGGTTTGTGG GCAACCAAAA CTTCCGCAAG
TTGGTCTGGT CGCAAAAAGA TGCCTACGCC AGCGCTTATC GTAACGAAAA GGGGGTTGTT
GCCGTTAGGG TCATGAGATT GGTTGCTCAA CAAGATCCTC CCGGTCGCTT CGTCGAACGA
ATTGGTCCCA ATCATTTCTT CGAGGTTGAC GAGTCGAAAG CCTTAGAAAA AACTTGCCAA
GCCCTTCGTG AGAAGAAAAA CAAGAGACCT CCTGGTTTAA TCATGACACA GCGTCCTCAT
GTCGTGAAGC CCAAAGAGCT ACGAGCTGCT AGCTGCCCTC AGACGGAAGG AAAGGTCTCG
ACAAACTCGA AACGAACGAA GCGCTCGACT GTGCGGAAAT CAGGTTCAAA CAAAATTGCG
GGAAAAGAAA CGAAGGCGAA GCTAGTGAAA AGAAAGACTA CAGGAGCAAA GGTAAAGCTG
TCTCCGAGGA TTCAAATTAA GGGTATTAGC AAGATAAGTG CTCCTCTACC TCCTCCACAG
CGGAAGTACC CAGCGAAGTC GCCACGCAAG ACCCCGTACA AGCCGAATGA GACTACTGCG
AGTATTTCTG AGGGAAGCAC TGGGAAGCAA ATGGAGATAC AATCTCCCAC TGAGCGGACG
TCTCATCAAG GCACTTGCAC GAACACGAAC CACAATGTTG TCGCAACGAT GCGCACCACG
TACGAAGGAA CACCCGGCGC GTGCTACAAA CCTCAAGATG CCAATAGTGA CAACGTGATG
AGTGAACAGG AATGCATTGC CTACACATCC CCCATTGCAA ATACTCTCAA ACGCGGCTCA
ACGAAGGATA TGGATTACGA GTTTGCTGCT CTTCCCCCAC ACCTGACTGC TTTTTTCAGT
GGAATTTATT CCAACCATTC CTGTTTTGGG GATGACGGGA CACAGTCAAA AGCTATTGCT
ATCACACCAA TCTATGAGGC TCCTCCAACC ACCACGTTGC CAGCTACATG GAGTCATCCA
AGCAATGAAC TTGCTAGCTT CACAAGTTTC TTGTGGGGTA ACGTGGGCAA AAACACCACA
TCCACATCTG CCCAGAAGTC ATCTTCAGAA TCGCCTCCAA CTGTTGTGGA CTTCGATTTC
ATAACTCCTC CCAGTTTCGG AGAGCCACAG CAATCCCTCT TGCTTGATGA TATCAATGAC
GGGACAAGTT TTTGTGACGA GCACTTTCCG TCTCTATCCG AAGAAGATTT TGCTATGTTT
ATGGTGTAA
 
Protein sequence
MPASTKKKSK NGQRTKATNK TKVREISPGT KIIEDLTSTD VLLGRGNGVA GFVGNQNFRK 
LVWSQKDAYA SAYRNEKGVV AVRVMRLVAQ QDPPGRFVER IGPNHFFEVD ESKALEKTCQ
ALREKKNKRP PGLIMTQRPH VVKPKELRAA SCPQTEGKVS TNSKRTKRST VRKSGSNKIA
GKETKAKLVK RKTTGAKVKL SPRIQIKGIS KISAPLPPPQ RKYPAKSPRK TPYKPNETTA
SISEGSTGKQ MEIQSPTERT SHQGTCTNTN HNVVATMRTT YEGTPGACYK PQDANSDNVM
SEQECIAYTS PIANTLKRGS TKDMDYEFAA LPPHLTAFFS GIYSNHSCFG DDGTQSKAIA
ITPIYEAPPT TTLPATWSHP SNELASFTSF LWGNVGKNTT STSAQKSSSE SPPTVVDFDF
ITPPSFGEPQ QSLLLDDIND GTSFCDEHFP SLSEEDFAMF MV