Gene PHATRDRAFT_49738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49738 
Symbol 
ID7198430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp112249 
End bp114086 
Gene Length1838 bp 
Protein Length544 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184488 
Protein GI219128582 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGCGGCACCG TCGTTTTCCA GTCCTTGGCG TTGCCAACAC ACTCTCGAAC AACCACCAAG 
CATCATTACA TATATATACA TATACCAATC AGTGAGGAGA TACTCGTTGT TTTGCCAGTG
CACATCAACG TGTTCGTCGT CGATAGTTTT CGTACTCGTT TCGCGACAGC ATGAGCACCG
TCCAGGCCAA CGGAATCCGG GACGACGCTG CGCTGTCGGA CATTGCGATT CCATTCTACC
GCAGTACGCC AATCCTCGGA TCGTTGGCGG TGATACTGCT CACTACGGTG GTCTACTTCG
TCTACCATCG GTACTATCGT TCCGCCCCGC CCTCCAAACG CGGAGCCCAC GTCCAGGTCT
CTTCCATCGA CGCACAAAAC TCCGTCAGTT TGGCAGACAT GGCCTACCTG GCGCACGTGC
TATCGCCCCA GTCGACCCAT ATGGACGTAC TGTGGGCTGC TATTTCGACA CCCGAAATGC
TGCAAACCAG TGAGGCCGAA GTGCACAAGG TGGAGCGGAT CCGACGAGAC CGACAAAGCC
GACGCGCGAC GCAAAAACCC AACATTCCCG ACGAATTGGA AGCCCTCGTG GAAGACGACG
ATGGATGGGG GGAAGACGAA GACGATGTGG ACGAAAGCGA CCAAGCGGCG GCACAAGTCC
GTGCCAAAGC CAAACAGGCC GAACAGGAAC GCCAGCAAGA TATGGAACGG CTTAAAGCCG
CCACCGGACA AACCAACGAA CCGCTCGAGG GAATCGATCC AGGAGTTATT GGACAAACTT
GGGTCGAAGC GACACTGGCC CAACACCAGT CCTGGCCACC GCCATTGACG GACGTTATCA
CGGCACATAC GTATACTTAT GAAGACCAGC CCGTTGCGAA CCCACTCGAC CACAAAGGTC
TGCGCCGTTA TCTCTGTATG ACCATGGGTC GTTTGAACGC ACAAATACTC AACACCAAAC
CCGAACTACT CCAAGCCGGA GCGCAAAAGC TCATCGACCA GACGTATTTT CGGGGCAGTT
TGGAATTCCG TGGTCGCGTC GGGACCTTGT TGGAAGCTAT CTTGCGACTC GGTACCGTGC
TGAAATCCCG GGCACTCGTT GCAACCACCA TTGAAGCCGT CGCGTCGTTC AAAGTCGGGT
GTTTGCCGGG AAAATCGACG ACTTGGTTCC TGCAAACCAT GCAACGCCAG TACGGATGTC
AACCCCATCT CGCGATACAC GAAAAGAAAG TCGAAGTCCC TCTTTACGAA GAAAGCAGCA
TCCTGGCCAC GGGAGACATG GCGGAATTTT TCCTCGACTT GGAACGCACC CACGCAGAAA
ATTTTCTGAA ACAAAAGATC GCCATGTGTC AAAAACAAGG CATCCCGCCG CAAGTCGCGC
TGCAAGCCTA CCGGGAAGGA TGGTGGTTCC TATTGCGGGC GGAAAACGTG AACGACCCAA
ATATTCGAGC GGAACCGATC ATGCGCGAGT CGCCCATTCT TTCCAAACTC GACAGTCAAG
ATCTGGACAA GTTTGAAGCC GCGACACCGG CCGCACAGCG GCTCATTACC GCATGGCCCA
TGATTGTGCA AAACTGTGCC CAAAAAGCGG GCAAGGTGCG GATTCAGTTT CCGGTACCGT
CCATTCCGGG CAAGTACCGG TTGGTGCTCG ACATCAAATC GCAAGATTTT TTGGGCGCCG
ATCAACAAAT CGTCATCGAA AAGGATGTTG TGGATGCCCA GACGATCCAG CGAACACTGA
AACCCAAGGA AGAACAAGCC ATCAAAAATG AAGAGTCCAA GGGGGAAACC AAAAAGGAAG
CGTAATTGCG TCTTATAATA TACTTACCGA TTCAAAGT
 
Protein sequence
MSTVQANGIR DDAALSDIAI PFYRSTPILG SLAVILLTTV VYFVYHRYYR SAPPSKRGAH 
VQVSSIDAQN SVSLADMAYL AHVLSPQSTH MDVLWAAIST PEMLQTSEAE VHKVERIRRD
RQSRRATQKP NIPDELEALV EDDDGWGEDE DDVDESDQAA AQVRAKAKQA EQERQQDMER
LKAATGQTNE PLEGIDPGVI GQTWVEATLA QHQSWPPPLT DVITAHTYTY EDQPVANPLD
HKGLRRYLCM TMGRLNAQIL NTKPELLQAG AQKLIDQTYF RGSLEFRGRV GTLLEAILRL
GTVLKSRALV ATTIEAVASF KVGCLPGKST TWFLQTMQRQ YGCQPHLAIH EKKVEVPLYE
ESSILATGDM AEFFLDLERT HAENFLKQKI AMCQKQGIPP QVALQAYREG WWFLLRAENV
NDPNIRAEPI MRESPILSKL DSQDLDKFEA ATPAAQRLIT AWPMIVQNCA QKAGKVRIQF
PVPSIPGKYR LVLDIKSQDF LGADQQIVIE KDVVDAQTIQ RTLKPKEEQA IKNEESKGET
KKEA