Gene PHATRDRAFT_48046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48046 
Symbol 
ID7203385 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp1897 
End bp3989 
Gene Length2093 bp 
Protein Length678 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182595 
Protein GI219124615 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0330655 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGATG CTGTGTTCTT CTCCGTCGTG GCTCTCGCCA AAGCGGGCAT TGAATGTTGT 
CGGAACGCTC AGATCTGCAA GGATGAGGCA GCCCGGATCG GTAAGCGTCT GACAATAGTG
GTCGCGCGAG CACACGAATG GGGAGCTGTT TGTGAAAGTA CGCGCCTTGC TCATTTTCAT
GAAGTTGTGG AGAATGTCTT CCTGCGCTTG CAAGCAACCA CATCGTCTGT AAACAGGCGC
TCCTTGTGGA ACAGAAAATT CAAGATTGCG TTACAACCCC AGACTATACT CTGTGAAATC
CTTAAAGCAG AGAGCCAGCT GAATACTGCC ATCAATGATC TTCAGATGGA GCAGTCCAAT
GCCATCTTTT CGCACCTCCT TGACGTCTCC AAAGGAGTTG CGGATTTGCT AGATCAGTTT
GGCGCTCTCG CTCTGAGCAA GTCGGATCCT TCTGCAACAA TCCAGCGGCA AGTTGAAAAG
GCCTTGGCGG AAACTCAAGT TCGAGCTCCC CAAGTTGCCG TTGCCAACCC CGATGACGTT
AGCCAAGATG GTGGACGAGG GCAAGATCCT ACCTTAGCTA CAAGCGGTAT GAAGGTTTCG
CACCAGTATA AGACAACCTT TTGTCCATCC AAGGATGATG TACTCGCTAT CTCGCTCCAG
CCATCCTTGC TGAAGTTTTG TGATGACCAC GAGAGCCTCC TCGGAGGTGG AGGTTTTGCG
GAAGTCTTTC GCGGAACATA CAACCTCCAA CCTGTCGCCA TCAAACGCCT CAAGGTATAT
CGGGGAGATG TAAATTCTCT CTCGAAACTT CAGATCGCCC GCGATGTGGA ACAACTAGCC
GCGGAAGCCC TTCTGACGCA CAAATGCGGT ACACATTCAA ACATCATTCA TGTTGTTGGA
TGCCTTAGCA TACTAAGCGA AGTCGAGAGA CCCTTGCTTG TCATGGAACT TATGCATACG
ACTCTCTTTG ATGTTATTCA CGATCGTGTT TTAGCAGATG CTCTGGCATT TTCCCGTCGT
CTCTATTTGT TGAAAGGTAT TGCAGGCGCG TTAGAGTTTC TTCATTTGCA AGGCATTGTC
CATCACGACA TCAAGTCTCT CAACATTTTG TTGAACAAAA AATTGACGGT TGCCAAGCTG
GCAGACTTTG GGAGTCAAAA GTGAAAGGCC TGAACACCAC AAAACTCCGT CTTGGAACAA
TCCTAGCAGC TACCAGCCAT CAAGGCAACC AGATTGCAGG TACAGCCGCC TACCAAGCGC
CCGAAATCCT ATCAGAAGAA GTCAAAGACA CATCGCGCGT TTGTGAGATG TATTCATTCG
GGGTGACAGT ATGGGAGTGT GTGACGAGCA AAATTCCACA TGGAGGGAAA AAGGAATCAT
CTATAGCGCT TTTGGCTGCA ACTAAGAAGT ACCTCCCCAT GCTTGCGGCG CCCTCCAGCC
CCCCAAAGGA TCTTTCAGAG ACAGAATCAG CTTCCTGGAA AGCGCTGAAT ATGGTTGCAG
CATCGTGTCT CTCTCGCGAC CGCTCAGTGA GACCCACTGC TTCGATAGTT GTTGCGCTCT
GGCACAAAGT CAAGTCTCCG GAAGTGGAAG TACCTTATTC TTTTTTTCAA GACTCGTCTT
TTACCAAAAT GGGCGATATT ACCGAAAGTC GAGTAACGGC CGGTCCGACG ACTCAGGGAA
CAGCAAAAGA CGACCTTGTC ATGGATGTTC CGGACGACAG AGAAAAGTCT AAATGCGGAG
CAGCTGCGAT GTCAAAAAAT CATCGCCGCA AATTATTGGT TGTTGCACTC GTTGCTGTCT
TGCTTACGCT TACAGTCACA GTGGTCGTTG TGCTGGTATC CAAATCGTCG CCAGATCTTG
CATCCCCAGC ATCAGGAGTT GATGTACCAG CTGATGCGCC AGTCTCTACG TCCCCGCCAT
CCCTACCGCC GACTGCAGTT CCGGTCTCCG TCCTCCCGCC GACTGCAGTT CCGGTCTCCG
TCCTCCCGCC AACCGGTACG CCAATGACCG TCCAATCTCC ATCTGGGGCG TGCCTCAACG
CCACAAGCAT AAATAGCCCA AAATGGGTTG TTTTTAACGC TTTTAAACTT TAA
 
Protein sequence
MTDAVFFSVV ALAKAGIECC RNAQICKDEA ARIGKRLTIV VARAHEWGAV CESTRLAHFH 
EVVENVFLRL QATTSSVNRR SLWNRKFKIA LQPQTILCEI LKAESQLNTA INDLQMEQSN
AIFSHLLDVS KGVADLLDQF GALALSKSDP SATIQRQVEK ALAETQVRAP QVAVANPDDV
SQDGGRGQDP TLATSGMKVS HQYKTTFCPS KDDVLAISLQ PSLLKFCDDH ESLLGGGGFA
EVFRGTYNLQ PVAIKRLKVY RGDVNSLSKL QIARDVEQLA AEALLTHKCG THSNIIHVVG
CLSILSEVER PLLVMELMHT TLFDVIHDRV LADALAFSRR LYLLKGIAGA LEFLHLQGIV
HHDIKSLNIL LNKKLTVAKL ADFGTTSHQG NQIAGTAAYQ APEILSEEVK DTSRVCEMYS
FGVTVWECVT SKIPHGGKKE SSIALLAATK KYLPMLAAPS SPPKDLSETE SASWKALNMV
AASCLSRDRS VRPTASIVVA LWHKVKSPEV EVPYSFFQDS SFTKMGDITE SRVTAGPTTQ
GTAKDDLVMD VPDDREKSKC GAAAMSKNHR RKLLVVALVA VLLTLTVTVV VVLVSKSSPD
LASPASGVDV PADAPVSTSP PSLPPTAVPV SVLPPTAVPV SVLPPTGTPM TVQSPSGACL
NATSINSPKW VVFNAFKL