Gene PHATR_44026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44026 
Symbol 
ID7203990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp729608 
End bp732616 
Gene Length3009 bp 
Protein Length384 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186405 
Protein GI219113643 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAC TTCTTTACTT GCGATCAGGA CTGGTTTTTG CGGCCAGTGC TTTGTTAGGG 
GTTACGCCGG CCGATGGGTT TTCCCCGACA GCCACGCGGT CGCAGCTCCT TGCCGAGCTG
CAAAGGTGTC TCACTCCGAC CGACTTGCTC GACCGGGTCG GTGCACGTGT CTCGCGGAGT
ATCGATCCCG ACGGCAGCCT AGCGAGCCTC GTTCTCGTTC GTCTTTCTAA GCTTGTGATT
TCTTTGGATA ACCAAGACCG AGCTTTCGTG GCGGATGAAT CATCAACGAA GACTCTGAAT
GCCATCATCG AGAGTCTCCT CGATTCCGAC CCGTCATCAA ATTGTGAATC TATTGTCGAA
GCAACGAAAG CTTGTTCTAC TTTTTCGCGC ATTTCTCCAT ACTCACCAAA CTCCACTTAC
GAACGGCTTT TTAATTTTTG GAACGATTCA TCGGATATTG TCTCGCAACT AGAACCACAC
CACACTTCCG GTGTCAAGTG GGCTTTTGAT GTTCTTACAC TTCAGTCATC GGCACCCTCA
CCCACCGCGC TCGATCAGGC CTATGCCGAT CTCAATTTGC CATTCGCCAT CATACCTGGT
TGTCTTGCTG GATTGGAAAG CCTTTCCGTA GATCGACTGA CTTCACAGGT ACATTTCAGG
GTAGATGATA TTCGCACGAC TTCGAACAAA GTAGTTTCAG AACGCCGACA AACCGCTTGG
GAAGGAGATA CTGGCGTGGC GCCATTCACT TATAGTGGCA AGGCGATGCC TCGGAGCGAC
TGGTCACCAC TAGTTGTTCA AGTGCGCGAT GTCGTGCAAG CTCGTACAAA TCAGTACTAC
AATGGATGTC TTTTAAATCT CTATACGGAT GGCTGTAGTG GGATGCGGTA CCATATCGAT
CCAGACCAAG GGACTCTATG GGACTACGAC ACCGCGGTAG TTAGTGTGGG GGCATCGCGG
CGATTTGCGT TCCGTGCCAT GTCCTCTTCT ACTTTGCAGC CACACAATTT TGTTGTCATG
CACGGAGATT TGACGTACAT GTTTGGAGAG TGCCAGTCAC AGTTTCAGCA TACAGTGAAA
AAGGCGGACG ACAAAACCGA CACCACTCCG CGTGCGAGCT TGGTGTTCAA AAGAACCTGG
GAATGTAAGA AATGAAATTG CACATAAACT TATTCACTCT TTGTGGAATC AAAACGGAAG
TGATGGTCGT CTTTGTGGGA GCGCATAAGC TCAAGGAACA CCTTCTCGTA ACCACTGTGA
TAAAGGTGAT CCTCTATCAT GGTGCGAGAC ACATCGATTT CGGAAAAGAA AACATTGGGC
ACGATATTCC CTGAAATACC GCTCCCGTCT CTATACAGGT GAACGCAACG ACCCGGAGGA
TACAACTCCG GCTTAATCCG TTCGACCGGG TTCTGCTCGA TCGGAAGATC GTCCAGCAGT
GGCTCCACCA CACCACGGAT ATGCTGAGTG ACGTCTTTGG TAAATAAGAC TGGCTGGCGG
CGTGCCAACT CTTCAATCGC ATGATCTATA TCACGTTTTG CAAAGGGTAC CCAGTCAAAC
TCCATAATAT CCAATAGCAA GTTGGCCAAA TGGATGCCAC TCAGGCGAGG TACTATATCG
CAATCGTTTA TAACCGTCAA AATGTAGGAA GCTTGTTCAG CCAATTTCTT GCTGAGTAAA
GCCGGGCAAC CAAAGCCAAT CACATTCACA TCGAACTTGG GATGATCTCT GAGTTCCATC
CCTGCGATAG CTGCTGCACC AGCTCCTAGA GAGTGCCCAG TCAACGTTAG CTTGATGTTG
AACTTACCGG ATTTCTCAAG AAGCTCCTCC AAAAGACCTG TATGGGTATC AGCTATATGC
TGGCCGCTAC CCAAAATGAA CGAATGCGCT TTACCACCTC TGTACTCTAC CTCGTCGCAA
AGTAGGCCTG TCAAGGCATC TGCAACTGAT TTTGTCCCAC GGATAACAAT CAAGACTTCC
AAAAAGGGAT TAAAGGAGGA TTGATCTCGC TTGACCGCCA CAAAATTGGC GGGCTTTCCT
GGCTCGCTTC GAACCTCCGC GTATACAAGC TCGTAGGGTT CCTGTTGCCT GTGAAGACCT
TCACGAATAT CTTCAACACT GTCTGCGTAT GAGAGATACG CTAAGCTCAG GATGTTGTTC
AGGTCCTCAA CCTCTTTGAT GTTAATTCCG GGATAGAAGC GGTGCATCCG GCGCTTCCAG
CTTGGATTTT TAACTTCATC CTGGCCCTCC AAGTAGTAGA ACAAAGCACT AGGTGAAAGG
TGGGAAAAAT CGAGGGAGCC CATGTACTTC TCGGCAACGC TTTCTATGTG ATCCTTGTAA
TTATCCAGCA TTCGGAAGAG AGATTGAAAC GAGTGTGTGT CGACAACTTC GCCTTCGTCC
GATGGTTGCC TAGCTTTCTC TACGATATCT TTCAGATAGT CTTGTTTGCT GTCACCTCCT
GCAATGAGAC CTAAAAAGTC CTTTGCCATG GCCGAGAACG ATGGCTTTTC CTTATGCTGC
TTCTCAGGGC CGCTTTTTGT TTCTTCAGTG AATCCAATTG AATTTTCAGC TGCTTCGAAC
ATCGCCATAA GCTTATTGCC TACTGCAGCT GGCCGTAGGG CCACAGTTTT TTTTTGATCG
GCGTCATGGA CCGATTTTGA TGTGGAAGAA GTTGAATCGT CCGTCAGAAT TCTTTTCCAC
GCGTCTTCAG CTTTTTCAGT CAAATCTACT GTCATGTCAT AAATGGAGTC ATCTTTTTTT
GAATGCTTGC CCTCTGATTT TTCCTTCTCT GTACCCTCTA GGAGCGCGTT CTCATCCGAA
TGCTGGAAGG ACGTTATAGT CAGGAAAAAA GCTGCAATAA GTGTTGACGA CAGTACTCTT
CGATTTACGG AAAATGATTT GAAGCGTTGG AAAACCTGTG CATTGAAAGG AGACATGGTT
TCAAACTTCG CATCATTCGC AAATACCCAA ATCGACTCTC ATCAGCACTT GCTGTTACGG
TGAAAGGAT
 
Protein sequence
MGKLLYLRSG LVFAASALLG VTPADGFSPT ATRSQLLAEL QRCLTPTDLL DRVGARVSRS 
IDPDGSLASL VLVRLSKLVI SLDNQDRAFV ADESSTKTLN AIIESLLDSD PSSNCESIVE
ATKACSTFSR ISPYSPNSTY ERLFNFWNDS SDIVSQLEPH HTSGVKWAFD VLTLQSSAPS
PTALDQAYAD LNLPFAIIPG CLAGLESLSV DRLTSQVHFR VDDIRTTSNK VVSERRQTAW
EGDTGVAPFT YSGKAMPRSD WSPLVVQVRD VVQARTNQYY NGCLLNLYTD GCSGMRYHID
PDQGTLWDYD TAVVSVGASR RFAFRAMSSS TLQPHNFVVM HGDLTYMFGE CQSQFQHTVK
KADDKTDTTP RASLVFKRTW ECKK