Gene PHATRDRAFT_34965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34965 
Symbol 
ID7200157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp769402 
End bp770814 
Gene Length1413 bp 
Protein Length470 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179503 
Protein GI219117417 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTGG ACACCGAGCT GACGACCACC GCGGTAGCTG TCGAAGGTCT CAACAACATC 
ATCACGGGAG GGCCTTCCCC GGATCAGCCG AATGAACGTT ACAAGGCGAT CAGTCTTTCT
CATCTAGAAA AGGTTGCCGA AAAGCATCCA CAGCTCAAAC CACACTTGCG GGACATTCAG
CTGTCTGGCC TCGTCTTTCC TTTCAAGGCT TCTCCCTACT ACGTTGACGA ATTGATTGAC
TGGGAGTGCG AAGATGTCAG GGAGGACCCG TTTTACAAGC TTGTCTTTCC GACCATGGAT
ATGCTTATCG AAGAACACCG TGAGAAGCTC GAAAAGGCGC ACAAAGCAGG GGACCCGGTA
AAGCTGATCA AGACTGTAGC TGAGATTCGC GAAGATCTTA ACCCCCACCC GGCTGGTCAA
AAGGAGCTCA ACGCTCCCAA AGAAGATAAG CTTACAGGTG TCCAGCACAA GTACAGTGAG
ACAGTTTTGG TCTTTCCTGC CGCCGCTCAA ACATGCCACG CTTACTGCAC TTACTGCTTC
CGATGGGCGC AATTCATTGG AGACGACGAA CTCCGATTTG CTCAAAAGGA GGCTACCTCG
CTTTTTGAAT ATCTTGCCGA ACACGAGGAA GTCTCGGATA TACTCATGAC AGGAGGAGAT
CCTATGATCA TGAAGACCAA GTCGTTGGCG CAATACTTGG AGCCTTTGAC CGACCCCAAC
TTTCTGCCAC ACATCAAGAA CCTTCGGATC GGAACCCGAA GTCTTTCCTT CTGGCCCCAA
CGATTCACCA CGGATGACGA TGCCGACGAG TGCATTGAAC TCTTTCGACG GGTACGTGAG
CAAGGCAACC GTCACATTGC AATTATGGCT CATTTAGGAC ACGACCGTGA ACTCTCTACG
GACAAATTCC AGGATGCCGT CAATCGCATT CAGAAGGAGG CCTACGCCAC CATTCGTTCA
CAGAGTCCCA TTATGCGCGG AGTTAACGAC GATGCCGAAG TATGGGCCAG AAAGTGGCGC
AAAGAGGTGC AAATGGGAAT CATTCCCTAC TACATGTTCA TGGCACGTGA TACCGGTGCG
CAGCAGTACT TTGATGTACC TCTGGTTCGT GCCCACAAAC TTTACAGCGA CGCCATTCGC
AATTGTTCTG GTTTGATTCG TACGGCCCGT GGGCCCTCTA TGAGCTGCAC TCCCGGAAAG
GTGGAAGTCA CCGGCGTTGA AGAAATTATG GGACAAAAGG CCTTTGTTCT CCGGTTCTTA
CAGTGCCGTG ACGAGGCTTG GATTGGGCGT CCCTTCTTTG CCAAGTACGA CGAGAAAGCC
GTCTGGTTTG ACGACTTGGA GCCCCTTCCA GGGATGGAAT TGCCCTGGAA CGAGAAGGGC
CTCCCTCGTC CTATCTGGCC CAGTTTGAAT TAA
 
Protein sequence
MAVDTELTTT AVAVEGLNNI ITGGPSPDQP NERYKAISLS HLEKVAEKHP QLKPHLRDIQ 
LSGLVFPFKA SPYYVDELID WECEDVREDP FYKLVFPTMD MLIEEHREKL EKAHKAGDPV
KLIKTVAEIR EDLNPHPAGQ KELNAPKEDK LTGVQHKYSE TVLVFPAAAQ TCHAYCTYCF
RWAQFIGDDE LRFAQKEATS LFEYLAEHEE VSDILMTGGD PMIMKTKSLA QYLEPLTDPN
FLPHIKNLRI GTRSLSFWPQ RFTTDDDADE CIELFRRVRE QGNRHIAIMA HLGHDRELST
DKFQDAVNRI QKEAYATIRS QSPIMRGVND DAEVWARKWR KEVQMGIIPY YMFMARDTGA
QQYFDVPLVR AHKLYSDAIR NCSGLIRTAR GPSMSCTPGK VEVTGVEEIM GQKAFVLRFL
QCRDEAWIGR PFFAKYDEKA VWFDDLEPLP GMELPWNEKG LPRPIWPSLN