Gene PHATRDRAFT_48735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48735 
Symbol 
ID7195023 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp103670 
End bp106592 
Gene Length2923 bp 
Protein Length820 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183421 
Protein GI219126347 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.60554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTTGC CCGTTACTGC CTATCTAACC ATCGAATCCT GTTTAAAATG TGCAAACGGA 
ACAATATTGG TACCCAAAAA ATCTGTCACT GGTTGTCATG AGTATCAAGG TTACAATGAT
GGATTCTTTT TTGTAATTAA CACATGGAAG TTATTGGGTC GCGTAACGTT TGGACGTATT
CTCAATCGGC AAGCGGAGTT CGTTCTTTCA GCGAGCCCGC CGATTCGTAA GTAGAACAAG
GTTGATGTTT GAACTGAGCT AGTCGTAGAG TAAGGTAACG AGTCCCCTCC AACGGTACCA
CAATACCTTG CGCATTTTCA CTGACAGTGA GGACGATTAA TAGATTGATA AGATCAAAAG
TCTTCAACAC GACTTTGAGA GTTCCGTTTG CTGCCTTTTG CTTTTTCGAG AAAGATGATA
ATGTTTCGTT CCGTCGTAGC CCTTCTTGCG TTGGGCATGA CATCAGGCCA AGCGCCGACC
TGTGACGTTG CGGACGCCGT GCCTAGCTTG CCGTTTCGAG TTTTGGAAGA TACGCAAGTG
GTGAGCCCAA CAGCGCAAGC CGGAGCCGAT GTGTGTGGAA TCTTGCCCGC TAGTAACGTA
CAAGGACATT GGTTCTCCTA CACAGCTCAA GCAGACGGCT GCTTGGACGC CGAAGTCATT
GGTCTCTCCG CAAGTGATGC TATGACGCCG CCTCTTGACC CCATCCTGTT GGTCTATACG
GGGACTTGCA GTGCGCTTAT GTGTACAGCC ATGGCCGACG ACATCTCCTT GCAGAACCTC
AATAGTCTCG TGGAACTTCA AGCCACTGCT GGAACAACCT ACTATTTCCT GGTCACTGGT
TTTGGGGGTG AAAGTGCTGG GCCCTTTTCT TTCACCATTG TGGTAAGTTG CGTGGTTTCG
CGTTGCTGAT TGACGCTCTT TTGTACCCTA CGTTGGAATT GTACCTAACC ATCTTCATCC
TTTTTGTCTC ATTTCAGCCT TCTGCGTCTA CTCAATGTGA AAATCCCTCT GCCAACCAAA
TCTGTCCCGT CTGTCCCAAC GGTGGCGAGC CGGCTGCCGG CGCCCTCTTC GATGAAGACG
TGCTGTGTTC CGATGCCGCG GTAGACGGCT CAATCCTCGA TGACGGTGGG GAAAGCTGCG
CCATTTTGCA AATTGCTGGA ACCACCATTT GCGGATGCCC CACCTCTCAA GAATCTTGCC
CACTTTGTCC CGGTGGAGAG GACGTTGCCA ACCCAGATCT GGTCGTTTTG GGCGATGGAA
GCCTGACATG CGGAGTCTTG AACAGTCTTG ACGGAGCCGA CACCTGTGGA GCCGTTACCG
CCGGGTTCGC AGCGGAATGT GGCTGCCCTG GGACAACCCC CTGTCGTCTT TGCGATGAAA
CGGCAACCAA TCCAAATCCT GAACGCCTCT TGTTCAACTT TCCACCGGAT AACGCTCCGT
ACACCTGTGC GGATGCCGAA GCGGATATTA GAGCCTCGGC TGTGCTCAAT CCCATTGAGC
AAGGCTGCAG TCCAAGCCTT GTGGACTTCG TCACCGGATT CGATGTCGTT GACTTTTGCT
GTTTCAACGG AGACTTCCCC GGTGATCTGA ACATTTCAAA TTGCGATTCT GCTAGCGTCA
TTACGGCCTT GCCGTTTACG GTCAGTGGAA ATACAGGAGA TGCCACTCCA GAAGTGAACG
CCCAGGCAGA ATCATGTGGC CTCCTGAACT ACGGAGAACA CCAAGGAGAA TGGTATACCT
ATACCGCTGA TGCAAACGGC TGTGTTACCG TTCGGACTTC CGGAAACTTG GACTCCATGT
TGTTTGTGTA CTCTGGAGAA TGTAGCGACT TAACGTGCGT CGCGATGAAC GACGACGCCG
TTTTCACAGT CTCCGGAAGT GAACTGACCT TTGACGCGGT TGCGGGGACG AGGTACTTCT
TTATGGTCAC GGGATCTTCC TCCGACGATG TTGACACCTA TACCCTAGAG ATTTCGGTAC
GTGCTCGAAT ACCAACTATG CAAACAATTA AATTCGCGCC AACATTAAAA AGGAGCCGGC
TTACCTAAAT ATGCCTCGTG TCTTACTACA TTCTTCCCCT CTGTTTTCGT TTCGACAGCA
AATTGCAGGA AACTGTCCGA GCCCGACCGG GGGCGAACTT TGTCCCGTAT GTCCAGACGG
TAGTGACCCC GATCCTACCG CATTTTACAA TGACGACCTT TTGTGTATTG ACGCAGCAGC
CGAATTTGGA GTTGTAGACG GAGACTCACA AGATTGTGTG CTATTCCAAA CTATCGGAGC
ACCCATTTGT GGTTGCGAAG TTGTTGCAGC AGACACGTGC AACTTGTGTC CGGATGGAGA
AGATGTCCCG GCTGTCGCGG CCGACAAGAG TATTCCTGAT GCCCTGACGA CCTGCTCGCA
GCTCAACAAT GTCGCAGGTA CCAGCACTTG CGGAGACGTC ACGGCTGGTG TAGCCAACTT
CTGCGAATGT CCAAGCTCCA GTCCTATTTG CACTTTATGT GACGCCAGCT CCACCATGTT
CAACCCTGAC CTTGTTCTGT TGGAAGATGG AAATTACACG TGTGGAAATG CCAACGAAGA
TACACAGTAC TATTACCTTT GGTACCCTCT TGACGCCGAG GGCTGCAACC CAAGCATTGC
GGTATCCTTC ATTGATAGTG GAATCAACGT GATTGACTAC TGTTGCAACG GCGGTCCTCT
GACGGGAACC CTCGGTCCCA CGGCTTCTCC GGCGACTGGT CCAACGGCTT CCTCGGACGT
CCCGACTACC GATTCCGGTG CCGGTGCCGG CTCCGGCAAT ACACCGGAGT CCACGTCGAC
TTCCGTTGCA GTTTCGTTTC GAGGTGGAAT CGCCACAGTG TCCTTGCTTT GTTTGTTTGT
CTTGCTCAAC TAATAGGTTA GACTATTCTT ACTGGATGTT TTA
 
Protein sequence
MTLPVTAYLT IESCLKCANG TILVPKKSVT GCHEYQGYND GFFFVINTWK LLGRVTFGRI 
LNRQAEFVLS ASPPIPLLAL GMTSGQAPTC DVADAVPSLP FRVLEDTQVV SPTAQAGADV
CGILPASNVQ GHWFSYTAQA DGCLDAEVIG LSASDAMTPP LDPILLVYTG TCSALMCTAM
ADDISLQNLN SLVELQATAG TTYYFLVTGF GGESAGPFSF TIVPSASTQC ENPSANQICP
VCPNGGEPAA GALFDEDVLC SDAAVDGSIL DDGGESCAIL QIAGTTICGC PTSQESCPLC
PGGEDVANPD LVVLGDGSLT CGVLNSLDGA DTCGAVTAGF AAECGCPGTT PCRLCDETAT
NPNPERLLFN FPPDNAPYTC ADAEADIRAS AVLNPIEQGC SPSLVDFVTG FDVVDFCCFN
GDFPGDLNIS NCDSASVITA LPFTVSGNTG DATPEVNAQA ESCGLLNYGE HQGEWYTYTA
DANGCVTVRT SGNLDSMLFV YSGECSDLTC VAMNDDAVFT VSGSELTFDA VAGTRYFFMV
TGSSSDDVDT YTLEISQIAG NCPSPTGGEL CPVCPDGSDP DPTAFYNDDL LCIDAAAEFG
VVDGDSQDCV LFQTIGAPIC GCEVVAADTC NLCPDGEDVP AVAADKSIPD ALTTCSQLNN
VAGTSTCGDV TAGVANFCEC PSSSPICTLC DASSTMFNPD LVLLEDGNYT CGNANEDTQY
YYLWYPLDAE GCNPSIAVSF IDSGINVIDY CCNGGPLTGT LGPTASPATG PTASSDVPTT
DSGAGAGSGN TPESTSTSVA VSFRGGIATV SLLCLFVLLN