Gene PHATRDRAFT_42714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42714 
Symbol 
ID7196115 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp873885 
End bp875829 
Gene Length1945 bp 
Protein Length580 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176672 
Protein GI219109838 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGG CGCTGACAGA CGCTGCTGAC TTGGATGAGA ACGAAGAGGC GTCTTCAACA 
ACCGTGATGC CTCTTGCTGC TGACGTCCAC AGTGTCTCTG GGCCTGTGGC CCCTCCTCCT
ACGCCAGTGA CTACAATACA ACAGCCACCA TCAACTGTTT TACCATTACA CCAAGGATCA
AAATCAGACA CAATTAACTT TAGCAACTTC AACCACCCAG CGTTGGATGG CAGCCTATCG
CCACAAAACC CCGCAATGGG TGTGTCAGGA GACAATCACA TCAATTTCCC CGAGGCAGTC
TCTTCTACCA CTGCGTCGTC AGATACAGGA GACCAACAAG CACAACTGAG AGCCATGTAT
CTAGCTGGCT TTCGTGCAGC TCAGGTGCAT AACGATCGTT TATCCTTAAA GGACAACTTT
GAAATAGCCA AGCATGACTC ACAACCAGGG ACTCTAACCA TGGAAGGAAC AAACCCTCTA
GCAGCGCCTG CGATTAATGC CGGCACGTTT CTCATGCCAG TAGCGACTGG AGAAGCAGCA
GGATTGGTTG CAGCGAGCCC CACGTCATCG AATTTTCCAC CGGGCAATGG GAGCACCATG
CTGACCCGCA GGCATAGCGA TCTTCCAGAC TCAGGGGTTG CGACTCGACG CATCACAAGA
ACGGCCTCGT CAACGAGCTC CATGGCAGCT TCCCCCGCCC TATCGGCAAC TGCCTCGCCC
AGTGGAGGGG GAAGTTCGGG CTCGAATCCG TTCCCGCGTA AGCTAATGGA CATGCTGCGC
AAAGAAGATT CATCCGTTGT TGCATGGCTC CCTAGTGGTG ATTCCTTCTC GGTACGAGAC
TCGGACCGTT TTGTGGCGGA TATTCTACCC AGATACTTTC GGCATACCAA ACTTACTTCG
TTTCAGCGTC AACTAAATTT ATACGGGTTT CGACGAATGA CAAAGGGTCC CGACGCCGGT
GCATATCGTC ACGACATGTT CAGGCGAGAC GATCCCGATC TGTGCCTACA GATGAAGCGA
ACCAAGCAAA AGGGATCAGC GTCTCCTCAA TTGAGACCGA ACGGACGAGG TGGTTCTAGC
TCGGTTACGT CGTCACCTCT TATGACTCCC GATCAAAGCC CTAGTCTATA TGCTTTGGAT
CCCGATGCTC TCAGCCGGAG TGCGCCCTCT ATACTATCTG CATCCGTGAT GGGACAGTAA
GTAACATTTG ATGGATTGAA ATGTGGGAAA TGTTGATTGA ATCTCACAGT TTTTTCTCTC
GCAAGCCCGA ATGAGCCTCC TCCGTTCAGC CTCAACCCTC CCAGTGAGCA CAGACGAGCT
GATTTTCGGA GTAATCCACC TGGCCATCCA GGGATTAACA TGGCGCAGAC AGGACTATCG
ATCCTTATGG GAGATAATAG TGTCCAACAT CAGTCTTCGT CTTCAGTTCC ACAAGGAAAG
TCACTGGGGA AATTGACTGC CGAGCAGCTA ACTCAGTATC AAGCCGATCT GATAGACAGG
GAACGGCAAG CCAGCGCCTT AGCAGCCGCA GGGATGGTGG CGGAAAGTGT CAATAAGACT
CAGGCCACCC ACGGTCACAG CATCGCGCAG GGCCTCGCAG CCCCGCCACA ACTGTCTCAT
GCCACTGCGA CTCCGACCCA GACCGCAAAT ATATCAGAGC TCGACAGCAT AAACTGGAAT
TTGATGGACA TAGGGGCGAT GCATCTTGAC GATATGGACA TGGATTTTGC TTCCCTTTTT
GACCCCGCTA ACGAAGCGGC AAGTATGGAA ACGGAAGGCA GCGGATGGCC AAATGTAGGA
AAGTCTGCCG CTTCTACCTC CAGCGATCCA AAGTAACTCA GGACACTTCT ACAGCGTCCG
CTTAGACGGT CAACAATTTT GAAAGGTATC TATACGACTT TATATTACTG ATAGCTCATA
ACTCTTCAGG GGCACTAGTC AAAGT
 
Protein sequence
MEKALTDAAD LDENEEASST TVMPLAADVH SVSGPVAPPP TPVTTIQQPP STVLPLHQGS 
KSDTINFSNF NHPALDGSLS PQNPAMGVSG DNHINFPEAV SSTTASSDTG DQQAQLRAMY
LAGFRAAQVH NDRLSLKDNF EIAKHDSQPG TLTMEGTNPL AAPAINAGTF LMPVATGEAA
GLVAASPTSS NFPPGNGSTM LTRRHSDLPD SGVATRRITR TASSTSSMAA SPALSATASP
SGGGSSGSNP FPRKLMDMLR KEDSSVVAWL PSGDSFSVRD SDRFVADILP RYFRHTKLTS
FQRQLNLYGF RRMTKGPDAG AYRHDMFRRD DPDLCLQMKR TKQKGSASPQ LRPNGRGGSS
SVTSSPLMTP DQSPSLYALD PDALSRSAPS ILSASVMGHL NPPSEHRRAD FRSNPPGHPG
INMAQTGLSI LMGDNSVQHQ SSSSVPQGKS LGKLTAEQLT QYQADLIDRE RQASALAAAG
MVAESVNKTQ ATHGHSIAQG LAAPPQLSHA TATPTQTANI SELDSINWNL MDIGAMHLDD
MDMDFASLFD PANEAASMET EGSGWPNVGK SAASTSSDPK