Gene PHATRDRAFT_48357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48357 
Symbol 
ID7203568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp220664 
End bp222148 
Gene Length1485 bp 
Protein Length494 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182795 
Protein GI219125036 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCTT CGACTCTTTC TAGAATTGAT TTGGCGAAAA GTAGTAACTA CCACCACGAA 
ATGCGTGATC CTCTAAAACC AAGGCGTTTG ATATACCCAG AGGGAACTTC TGCTGCTTCT
ACGCCGAGTC GCACGAGGAG ACTAATACGG TACTGTAGAG CGATCTCAGT AACGCTCGGC
ATGTTGTCTA CGGCAAGCTT TGGACTAAAT GCGCTATTGC TATACAATTC AAATACACCG
GGATCGACTG AGCCCTCTTT CGAGAATATT TTGATATCGC TTCTAGAAAA TTCCAAGACC
ACTGTGGTCG GTGGTCGAGA AAGGGACAAA GCTGTCGCTA AAGGGGCCCT GGAGTTTTCT
TCCCATCGAG CAAGTAGTCA GATACCGCTT TGGAGGCAAA CAGGCAACAG CTCCACAGAA
GCTCTGTTGA CAACAACATT ACCAGTCTGG ATGCAAACTT ATGTGGATTG GCATGTGGAG
ACACGTGCGA ATTTAACCTC ACAGAACTGG AACTCTACCC GCTACATCTT CATTAGCTGT
CTTGCTAGCG ACACAAAATG CGGTGGAGCC AGTGACCGGC TACAGCTGTT ACCTTGGGCT
GTACTCATGG CAGCTCGCGG CAATCGCCTT TTACTGATAC GCTGGGAACG TCCTTGCGCA
CTCGAAGAAT TTTTGGTACC GAAAGATGTC GACTGGACTG TGCCTGAATG GCTGTGGCAA
AGTGTGCAAT TGTACAGTCC GCATCCCAAG CTTTTGATGT CGGGCGGCAA GCCTTCATTG
CGACATGCCC AAGCGGCCGA CCTCATTGTC GCTATTAGGC AGCAAGCCCA TGATCACGGC
AAAACATTTT ACGATGAATT GAAAGAAGAC AATGAAGCGG GGTTCTATGA AGTGTTTCAT
GACGTGTGGA AGGCCTTCTT TCAACCTTCC CCGATAGTTC AAATACAAAT CGAAAAGACC
ATGGACGACC TCGGTCTGAG ACCGCGACGG TATATTGCGG CACATGTTCG CCAGAAGTAC
CACCGAGACA AGACACACGA CACCGACCAC GTGGACAATG CCGTGAGGTG TGCCTACCAA
TCGCGACAAG GCGTTTCGAA CACTATATAC TTTGCCTCCG ATTCAACCGT GGCCACAAAG
CGGGCCGTGG ACTTTGGGCG ATACATTACG GCGTTAGCCA ATGATACCGT GCCGTCGAGC
ATCAATGTTG TTGCACGCAT CAATGTGTCG GAGCCGCTGC ACCTGGACCG CGGATCTGCG
TATTTGCAAA ACACGGATAG CTGGCAATCT TTTAAACCAG ATGACTTCTA CGATGTGTTT
GTTGATTTGT ATCTTTTGGC GTCGAGCACT TGTGTGGTAT ACGGTGTTGG CGGCTATGGT
CTTTGGGCAA GTCTATTGAC CACGAAACGG TGTTCGTTCC GGCATTCAAG TCGCCACTGC
GGCTGGGAAG TTCCGTCGAA CGGGAGCATT GCCCATATGT TGTAG
 
Protein sequence
MASSTLSRID LAKSSNYHHE MRDPLKPRRL IYPEGTSAAS TPSRTRRLIR YCRAISVTLG 
MLSTASFGLN ALLLYNSNTP GSTEPSFENI LISLLENSKT TVVGGRERDK AVAKGALEFS
SHRASSQIPL WRQTGNSSTE ALLTTTLPVW MQTYVDWHVE TRANLTSQNW NSTRYIFISC
LASDTKCGGA SDRLQLLPWA VLMAARGNRL LLIRWERPCA LEEFLVPKDV DWTVPEWLWQ
SVQLYSPHPK LLMSGGKPSL RHAQAADLIV AIRQQAHDHG KTFYDELKED NEAGFYEVFH
DVWKAFFQPS PIVQIQIEKT MDDLGLRPRR YIAAHVRQKY HRDKTHDTDH VDNAVRCAYQ
SRQGVSNTIY FASDSTVATK RAVDFGRYIT ALANDTVPSS INVVARINVS EPLHLDRGSA
YLQNTDSWQS FKPDDFYDVF VDLYLLASST CVVYGVGGYG LWASLLTTKR CSFRHSSRHC
GWEVPSNGSI AHML