Gene PHATRDRAFT_48784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48784 
Symbol 
ID7195098 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp231122 
End bp232880 
Gene Length1759 bp 
Protein Length532 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183324 
Protein GI219126145 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGTACCGTT TGGTTCCAAA GTACTGGCGA TTTCAAAGAT CTTTCGAAGT CAATCAGCTG 
ACAGCGGTTC CTCGGAACGA TTTTCAGCTC AAGTCCCAAA ATGACCTCGA GCAGGACCAT
GAGCCAACTT TCGACAGAAG ACGTCAACGA AGAAACTGTG AGTAAAATAT TGGCATCCCT
TTGCGGCTCC GTCACGACAT GACGCCTTAT TGTCCAGCAA AAACGTTCTG AAGAAGACAT
TTGCAATATT TACCCAGTGA TTGTATTACC CCGGATACGC AGCGGACGGG TTACCTGGTT
TGCTGCATAT CAGCTTCGCA TGGTGGCAAT ACTACAGAAA CATTCGAGAA ATAGCACGCC
GAGCTCAAGG TTTCCTCGCG CGGTTTGGCC ATTCTTGTTT TGCGTGGTTA TAGGTCTAAT
GCTTTTGACA ACGCAATGGT GGCTATTACT CTCGCCCGTT GATAGCAGTC TGTCGGAGTA
TAGGGCTTAC CTATCTACAA AGGAATATCT TCGACCCACA AGCGCTTACT TGTCAGCCGT
CCGTGAAAGC CCCGATATCC TGTATTTTAC AAGTCGCCCT CCTGATCCTA TACCCTTGTT
TTCGGTCGAA TCGATTCCCA TCCTGCAACG TCATGCCTGT ATTGCCAAAA TACGTGAACA
GCATGATAAG GTATATGCCT CGTTCGAACG GATAGGAATG AGTGATCGAG CATTGTTGGT
AGATCCTGCC TATCACAGCA ATGTGGGAGA TCATATGATC ACGCTGGGGG AACTGGAAAT
GCTGAGGAAA TTAGGATACG GTACGCTTGA TGAACCTAGC GCAAAGGTGG CGCAATGCAG
CTTCTTACAG GCTGGGAACT ACGCGCCTCC CTGCCATCAT TTCAGCACTG GAGGGGCGCT
TTCCAACATC ATATCCATAT CAAACCCACC GCTGGCTTTT TGGCATGGAG GAGGTAACTG
GGGAGATATG TGGCCCGATA TACAAATTGC CAGGATGAAA TCCATGGAAC CATTGCTTAG
GAGCAATTAC ACGATCATTT CCATGCCACA AAGTTTGTAC TTCCAAAAGT CGAGCAGAGA
GCAAGAATTC ACCTCTATAC TGAAGAAGCA CATTGAGCTT GGTTTGGGAG CTAGTTCTAC
GTTGGTGGAC CCGCGCGAGG GTCGTAGGCA GACCGCTTCG CGGGTTGTGC TTTCGTGGCG
TGAGCACGAA AGCTACGACA TAGCCCAACG GCTGTACCCC TTCGCCACCC ACATTCTAGT
GCCTGATATT GCGTTCCAGC TCGGACCGTA CTCGCCGGTA GCATCGCAAG AAGACTTTTT
AAAGGTTGAT TTGGTCTTGC TCCTTCGAGA TGATCGCGAG TCCATGTACG CCACTCAACG
CAATCGGCGA GCCGTTCGTG ATATTCTGTC TGACCTACCG AACGGTCAGA GGCTATCGTT
TTCTATTGTG GACTGGACAG ACCGCTCGGA CCGATTTGCC TCTAAAGACA TCCTTTTCAC
GAGTTCCGCC ATTCAACTTT TGAGTATGGG GAAAGTGGTC GTGTGCGACC GACTTCACGC
GGCGATTCTA TCATACTTGT CCGGAATTCC GTTCGTCTAC CTTGAACAGC GGACGGGGAA
AATTACGAAA ACTCTTCAGG TTGCGTTTGA GCTGAACGAG ACTTGCTTGG ATGGATCGAA
AGCCATGTGG TCTCAAGCGT ACAATCTGAG TGATGCCGTT CGCCAAGGTG TAGAGTTCCT
TGATCGCTAT AGGCTTTAA
 
Protein sequence
MTSSRTMSQL STEDVNEETQ KRSEEDICNI YPVIVLPRIR SGRVTWFAAY QLRMVAILQK 
HSRNSTPSSR FPRAVWPFLF CVVIGLMLLT TQWWLLLSPV DSSLSEYRAY LSTKEYLRPT
SAYLSAVRES PDILYFTSRP PDPIPLFSVE SIPILQRHAC IAKIREQHDK VYASFERIGM
SDRALLVDPA YHSNVGDHMI TLGELEMLRK LGYGTLDEPS AKVAQCSFLQ AGNYAPPCHH
FSTGGALSNI ISISNPPLAF WHGGGNWGDM WPDIQIARMK SMEPLLRSNY TIISMPQSLY
FQKSSREQEF TSILKKHIEL GLGASSTLVD PREGRRQTAS RVVLSWREHE SYDIAQRLYP
FATHILVPDI AFQLGPYSPV ASQEDFLKVD LVLLLRDDRE SMYATQRNRR AVRDILSDLP
NGQRLSFSIV DWTDRSDRFA SKDILFTSSA IQLLSMGKVV VCDRLHAAIL SYLSGIPFVY
LEQRTGKITK TLQVAFELNE TCLDGSKAMW SQAYNLSDAV RQGVEFLDRY RL