Gene PHATRDRAFT_50623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50623 
Symbol 
ID7199481 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011701 
Strand
Start bp4624 
End bp5845 
Gene Length1222 bp 
Protein Length382 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185591 
Protein GI219130901 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000102021 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGTTGGTTC TCCCTCTCAT AGTGGAACAA AGACGACGAA AATGAGCTCA AAAACCAAAA 
TGCGACGCAA GAAGCAATCC ATATCCTTAG GTAGCGTTAT TTCTGCTGCA GCTGTCGCGT
ATGGAACATA CAAAGTAGCG GATTGGGCGT GGAATCGTTA TGTCACAAAA CGGAAGAAAA
ATGATTATCA AGTCAACGCC GCCATTGCTA CTTCTTTCAT GAACTTCTTA TGCTCGCAAA
CAAGTGTGGG CGCCCACGCG GAAGATGGAG TCGCTTCTCA TATTGATCAC ATCCCCGGCC
CAAACCGTCG CTTACGAATG CGTCGCCAAC GCATGACTCG TTGCAGGCAG GAAGCAGCGC
AGGCGCTTCG AGGTTTCTCA CCGGCACTCC GGTCGATTGT AGAGTTGCAT ACAAATACGG
CGCAGGCAAC CCGGCTGCTC AAGCAACTTC GGGCGAATCG AACCACAGAA AAGCATGCTA
CTTCTCGACG TTCTGAAGAA CAGGCGCTAT GGAAGGAAAT TCAACGGAAG ACGATGACCC
GTATGTTGAC AACTGCCTAT GCCCATACGA TTTTATTTCT TGTCCTTACC ACGCAAGTAA
ATCTATTGGG AGGACGATTA TTCGAGGAAT CTTTGCAGAA TACTTCCTTG TCTTCAAACG
TCTCGATGAG TAACGACAGT GTCGCCTCCG ATCGAATGGT GTCTTATCAA GAGTCCCATC
GTTTTGTCCT CCAGCATACA TATGATTATT TTCTGAACAA GGGTGTTCAC TCTCTGTTGT
CAACAGTCGA GCAGGCTGTC GATTCTGTTT TGGGAGGATG GAACGTCTTC GATAAAGCAT
GCCTACACAT TTCACGAGAA CAGTTTGACT GTGCGCTCGT GAAAATCCGA GGCTTGATAG
AAGGTGGCCT GAGGACAGAT GTGAGCAGGA CTTCTGGAAG GTCATCAAGA CGCGAAAGCA
TCCTTCGTTT TCTTATGCCC TCCTCAATCT TGGAGCATTC CATTCAAGAC GACCTAGCGA
GATCCATTCT CGACGAAACT TGGGATCTTG TAGAAAGCCC TGTGTTTTCG GATGCTCAAC
AGGAGTGTTT AAATGCCACT TTTGCATCTA TGCGGGATCG TTTTTGGGGC AAGATATTTG
ATGACAACGG ACTTTCTGGG ACAAAACCAT GGGCGCATGT CTTGACCCAA CTAAGAACGA
CGTCCAACAG TTTTTTCGTT GA
 
Protein sequence
MSSKTKMRRK KQSISLGSVI SAAAVAYGTY KVADWAWNRY VTKRKKNDYQ VNAAIATSFM 
NFLCSQTSVG AHAEDGVASH IDHIPGPNRR LRMRRQRMTR CRQEAAQALR GFSPALRSIV
ELHTNTAQAT RLLKQLRANR TTEKHATSRR SEEQALWKEI QRKTMTRMLT TAYAHTILFL
VLTTQVNLLG GRLFEESLQN TSLSSNVSMS NDSVASDRMV SYQESHRFVL QHTYDYFLNK
GVHSLLSTVE QAVDSVLGGW NVFDKACLHI SREQFDCALV KIRGLIEGGL RTDVSRTSGR
SSRRESILRF LMPSSILEHS IQDDLARSIL DETWDLVESP VFSDAQQECL NATFASMRDR
FWGKIFDDNG LSGTKPWAHF FR