Gene PHATRDRAFT_42467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42467 
Symbol 
ID7196665 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp148666 
End bp150977 
Gene Length2312 bp 
Protein Length679 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177026 
Protein GI219110549 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.486955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCATTTCTC CATCAGCTTC TCATCCATTG GGAACACCAC CATGATGAGT AACCATCACT 
GCAGCAACAC GAATCAGCAG CTCCAGGAAA AGAGATTGAA GATAAATGGT AGCAACAGTA
ACAATAGCAG TCCATCGAGA AGCTTGGCAC GGATGGCCTC ACATCAGCAC CGTGTGTTGC
TGTACGCAAC CTGTTACAAC GTCCTGGATG GGTGAGTAAA GGATTGGCCC ACCCTTCGCA
CTAATGCGTG CACGAAGAAA GCTGGCTGCA AGGCATTCTT TTCTCACTCA TCCTTTTCTC
ACTTATTTCT GTTGTTATAG AGTGACCTTG ACGATCCGCA AGCTTGAACA AGAGATCCTC
AACGCCGGGC ATCTTGTGTG CGTGCTGACT ACCCGTTCGG GAGATATGGC CAATACCAAT
CTAGACGGTT CTCATCCCAA TCGCAATATC ATCTTTCTCG ACAACGCAAA GCCTATTCCC
TTTTTGAACG ATCCAAATCA ACCCGATTTG GCGTATCAAC TCGGTTTTGG CCTTTCGGCT
ACCGTTCGGA GACAGCTGGA TGAATTTGAA CCTTCCATTG TACACATTAC CTGCCCCGAC
TGTACGGCCT TGCACGTCAT TCAGTACGCC CGACTCAAGG AAATACCCAT CATGGGAACC
TACCATTCCA ACATTCCCGA GTATATGGAG CATTACCCGG GCCTTTCCTG GCTCAAGCAC
ATTCTGAGCT GTTTCTTTCG CCATCAGTAC AATTTTTTGC AGGCCCTATA CGTACCAACG
CCCTACATTC AAAAACACCT CGAAGACAAC CACGAAATGG ACAAGGTCAC GTCACTCCAA
GTATGGGGTC GTGGCGTCGA TATCGACCGA TTCAACCCCA GTTTTCGCTC CCTCAAATAC
CGCCGAGATT TGGGGATTGG CGACGATACC GTGGTTATTT CGTGGGTAGG ACGATTGGTT
CCGGAGAAAA GGGTCGATAT TTTTGCCGAT ACTGTGCGCC GTCTCAGTGC CCAAGGACTT
AACGTGCACG CTCTCGTAGT TGGTGCCGGA CCAGCCGAAG AAGAGATCAA ATCTCTTCCC
AATACAACCT TCGCCGGTTG GATGAATGCC GACCAACTCG CCGTGGCTTA CGCCTCTTGT
GACGTCTTTC TCTTTCCATC GTCCGTTGAA ACCTTCGGCA ACGTTACTCT TGAAGCCATG
GCGTCGGGAT TGCCGGTCGT TGTGGAACAA GGATGCTCCG GTCATCTCGT GCGCCACGGG
GAAAACGGGT ACGCTTGTCA AGCAGGCGAT GCGGATGCCT TTTTTGAATG TACCCGAGAT
CTCGTGGTAG ACCAGACACG GCGCGAGGCC TTTCGCGAAA CGTCAAGGAA TATGAGTCTA
TCGTTGGAAA AACGTGCCGT TGTACGAAGG ATGCTTGATC ACTACACTCG AGTGACGGAC
GAATTTTATA TGGAATACGG TGGACACCAC GCCAACCGTG ATCAAGTCTA CCGCCACACG
GGGTCCTTTC GCGCGGGGAG TCACCCACGT CCCCTCATCC TCGTGTTTAT TGAATATTTG
TTCATTGTAC TTTTTCAAGT GATTTGGAAC ATGACGGAAA TGTTCCTGTA TATGCAACAG
GCCTTGGGGT CGGTGCGGCC GGTCGCCACT CCCCCTGTGT CACCGATCCG CGCGTCGGTC
AAGAAAGCGA GTTCAGAACC GTATCCAGAA AGGAACGACA CGGGTCTTTC GATTGTCAAT
CTCGAAACAA TTCGTCTCGT GGATGATGCG GAAGATTCCC TGCTGGGGGA AGAGTCCACC
GACGGTGATA CCCATACGAC GAACAGCTTT AGTCACGAGT CAGCGAGCGA CTCTCGATAC
GGATGCTGTG TGGGAGGTGA ACATCAAGCT TTTGGGGACT GTCAATTGTC GCACACACTG
TCCAAGTCGT TTATCCGGAC GGTTGAATTT CAATGTCGCA TGGAATCGCG CATTCGCAAC
GGTTGCAATA CGTGCTGCAC CATGTCTCTA TTGCCGTCGC GGAAGCGCAA GAACTCCATG
GACAGCTACC AAAACGACGA AATGCCTCTG CGCACGCGTT CGTCCGAACC GGAAACAGAA
GTGCCCGTGT GTGTCATGGA AGGATCGCAG GCGCTGCAGC CAAAACGAGT TATGCGACGC
AACCAAAATG TAACTATGGT GGAAGTCTGA CGAAGAAGCG TTTCATGGAG CTATCAATGT
GTATGATTGT CATGTTCTCG TCGACGTGGT GATTGTCTGC TGTGTACCAT AAAAAATTAA
ATGTTATCGT GTAGTTAGCG GCATTCACTG TG
 
Protein sequence
MMSNHHCSNT NQQLQEKRLK INGSNSNNSS PSRSLARMAS HQHRVLLYAT CYNVLDGVTL 
TIRKLEQEIL NAGHLVCVLT TRSGDMANTN LDGSHPNRNI IFLDNAKPIP FLNDPNQPDL
AYQLGFGLSA TVRRQLDEFE PSIVHITCPD CTALHVIQYA RLKEIPIMGT YHSNIPEYME
HYPGLSWLKH ILSCFFRHQY NFLQALYVPT PYIQKHLEDN HEMDKVTSLQ VWGRGVDIDR
FNPSFRSLKY RRDLGIGDDT VVISWVGRLV PEKRVDIFAD TVRRLSAQGL NVHALVVGAG
PAEEEIKSLP NTTFAGWMNA DQLAVAYASC DVFLFPSSVE TFGNVTLEAM ASGLPVVVEQ
GCSGHLVRHG ENGYACQAGD ADAFFECTRD LVVDQTRREA FRETSRNMSL SLEKRAVVRR
MLDHYTRVTD EFYMEYGGHH ANRDQVYRHT GSFRAGSHPR PLILVFIEYL FIVLFQVIWN
MTEMFLYMQQ ALGSVRPVAT PPVSPIRASV KKASSEPYPE RNDTGLSIVN LETIRLVDDA
EDSLLGEEST DGDTHTTNSF SHESASDSRY GCCVGGEHQA FGDCQLSHTL SKSFIRTVEF
QCRMESRIRN GCNTCCTMSL LPSRKRKNSM DSYQNDEMPL RTRSSEPETE VPVCVMEGSQ
ALQPKRVMRR NQNVTMVEV