Gene PHATR_42051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_42051 
Symbol 
ID7204413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp168869 
End bp170608 
Gene Length1740 bp 
Protein Length579 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185644 
Protein GI219120825 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAGTG CGAAAGCCGA GTACAAATTC TTGACGGGGC AGGACTTTGG TCCTCCACCC 
AAGGAAGAAA AGAAAAAGAA ACAGGACAGT CCCGCTGAAG CAGACGAGGT TCCTTCCGAA
AAGAATAAGG AAAAGCGTGC AGCCAAAGCA GCAGCCAAGG CTGAAAAGGA GGCGAAAAAG
GTTGCACAGC GGGAAGAGCG AGCCCGTCGT GAAGCCGAGA AGACCGCCAA ACTCGCCGGC
ATTGGGCAGG ACAATTTTGG CGATGCGCCC TTGATCCAGT CGCAGATCAT CACGGACAAA
GTGTGGACTC CCATTCAGAA TTTGAAACCG TCCCTCGCGG GGGAAAGTGT TCTCGTTCGG
GGGCACCTGC AAACAGCTCG AGCCGTTGGG AAGGGAGCTT TTGTCCTCGT GCGTTCCAGT
CTTTACTCCG TACAAGGAGT CGCCTTTGAA TCGAAAGACG ACGATGGCGC CGTCAGTTCC
GCTATGATCA AGTACATGGC TGGACTGCCA GCAGAGTCGG TCGTGGACAT GCGAGGGATC
GTTACGGTCC CCGACCAGCC CGTAGATTCC GCGACGCAAA AAATGGTCGA AATTAAAATT
GAATCGTTTC ATTGCGTCGC CAAGGCCAAG AAAGCCTTGC CCTTTCAAAT GGAGGACGCC
TGTCGACCTG ATTCCGGAAA GGAAACCGAC ATTGGTGCCT ACAATGAAGA TGATCCGGAA
GTTGTCGACT CCGAAGACGG TCTCATCCAT ATCGGTCAAA AGATGCGTCT CGATTATCGT
TGGATCGATT TGCGGACCCC CGCTAACCAA TCCATCTTCC GCATCGAGAG TATGGTCGGC
TGCTTGTTCC GTGAGTTTTT GCTTCAACGC GGGTTTGTAG AAATTCACAC ACCTAAGCTC
ATTGGAGGAG CCTCCGAAGG TGGCTCCGAC GTCTTTACGC TAGACTATTT CGGACAGTCT
GCCTGTTTGG CCATGAGCCC GCAGCTCCAC AAACAGATGA CGGCCGCCTG CTCTGGCTTT
GAACGCGTTT TCGAAACCGG TCCTGTATTT CGGGCCGAAA ATTCGAATAC CCGTCGGCAC
CTTTGCGAAT TTACCGGACT CGATCTGGAA ATGGTCATTC ACGAGCATTA TGATGAAGTG
CTGGCGGTCA TGAGCGAGCT CTTTATTTAC ATATTCGATG GCGTTAACGA GCGCTGCAAG
CCGGAACTGG AACGTGTTCG GGAGCAGCAT CCGTTTGAGG ACTTGCAGTA CCTGAGCCCG
ACGCTCAAAC TGACCTTTGC CGAAGGCTGC GCCTTGCTCC GTGAAGCTGG CATCGATCAA
GACAATTACG AAGACTTGAG TACCGAAAAC GAAAAGAAAC TCGGCGACAT TGTCAAGCAA
AAGTACGGGA CGGACTTTTT CTTCTTGGAC AAGTTTCCTT TGGCGGTGCG GCCATTCTAC
ACCATGCCCG ACCCGAACGA CCCCAAACTT TCCAACAGCT ACGATTTTTT TATTCGTGGT
CAAGAGATTG TGTCCGGTGC CCAGCGTGTT CACGATCCTG ACTTGATTGA AGAGCGCGCC
AAGGCTTTGG GTATTGATGT CGAGAGCATC GCCGACTACG TTGAATCCTT TCGTCACGGT
GCCCTACCGC ACGGTGGCGG CGGTATCGGA CTGGAGCGTG TCGTCATGTT GTTTTTGGGC
CTACCCAATA TTCGCAAGGC GGCCTGGTTT CCCCGTGACC CAAAGCGTAT TTCTCCGTAA
 
Protein sequence
MLSAKAEYKF LTGQDFGPPP KEEKKKKQDS PAEADEVPSE KNKEKRAAKA AAKAEKEAKK 
VAQREERARR EAEKTAKLAG IGQDNFGDAP LIQSQIITDK VWTPIQNLKP SLAGESVLVR
GHLQTARAVG KGAFVLVRSS LYSVQGVAFE SKDDDGAVSS AMIKYMAGLP AESVVDMRGI
VTVPDQPVDS ATQKMVEIKI ESFHCVAKAK KALPFQMEDA CRPDSGKETD IGAYNEDDPE
VVDSEDGLIH IGQKMRLDYR WIDLRTPANQ SIFRIESMVG CLFREFLLQR GFVEIHTPKL
IGGASEGGSD VFTLDYFGQS ACLAMSPQLH KQMTAACSGF ERVFETGPVF RAENSNTRRH
LCEFTGLDLE MVIHEHYDEV LAVMSELFIY IFDGVNERCK PELERVREQH PFEDLQYLSP
TLKLTFAEGC ALLREAGIDQ DNYEDLSTEN EKKLGDIVKQ KYGTDFFFLD KFPLAVRPFY
TMPDPNDPKL SNSYDFFIRG QEIVSGAQRV HDPDLIEERA KALGIDVESI ADYVESFRHG
ALPHGGGGIG LERVVMLFLG LPNIRKAAWF PRDPKRISP