Gene PHATRDRAFT_44809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44809 
Symbol 
ID7199760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp295480 
End bp298914 
Gene Length3435 bp 
Protein Length385 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178972 
Protein GI219116354 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGGTTCATG TTTGCCATGG ACAGCCATGC CCCTTTAGCT AGCTAGTAGA GTTGACTAAC 
TGTAAGAGAA TCGAAAGCGC AAAAAGGCTT CAAAGTTTGC AATTATTGGC CGGCAGCGAG
TGACCCATCG GTATGACACT CACAGTACTT ATTTGCTCTC GTACGGAGGA CCTTCGATCC
CTGACATATC GTGGGATAAA GTTAGGGGTT CCGAATAGCC GCTGTCCTGC GCTCTATGAT
GTAGACCATG GCACAAATAC TGGGATTGCC AGTGTCTACT AGTGCGTCAG TTGTTCCGTC
CTGGATCGTC GGCCGTGCTC GCTACGTTTG ATCAGCACTC AGGATCATTT GCTTTATCCT
AGTCCAGCCG GACGACGGCA AACATGTGTG GCCGAACCGC ACAAACAGGA GATGCCGTCC
GAGCTGCCGC GAAAAGTTTA GGAATAGCTC TCGAGACTAC GTCAAAGTCG AACGCTGGTA
GTACGACGAA TTTTTCGGGG CCAGAGAAGC AACAGCAACA CATGGAATCT ACTGATGCTC
AAAGAAGCTC TTCATCCTCA AAGCCATTAT CTGTGGTCAA GGAATCCATG TTTTCTTCGT
TGCATGACAA CGAGTCTTCT GAGTTGCGCA ACAATTTTAA TCTCAGCCCT GGGATGGATG
CAGTCGTCTT TTGGAAAGAC AAATTTGACA CTGTTCGCGC AACGCGAAAG ACTTGGGGCT
TAATAACACG GGGTGGAAGC GAAGAGAAGC CGATCGAGGA CGGCATGGGC AAGCATTTTA
GCAATCTTAT GTTCAACGCT CGTTCTGATA CCCTTTATAC CAAGCCGACC TTTGGACGAC
TCGCGACATC AGGCAAGACC TGCTTGATTG CGGTAGACGG ATTTTTTGAA TGGAAAACTG
TCGTGGGAAA GAAACAGCCT TATTTCGTTT ACCGCAAGCA ACACGAAAAC CAAAAAGCAG
AAGAAAATAG GCAACGGGGA CTTCCTACTG ACTGTAAAGC ATCCTCTCGG CCGTATCTTT
TATTGGCGGG CTTATGGACA AGTGTTCCTA CTGGACTGGC GGACGGCGAT ACGCTCGATA
CTTTCACAAT AGTGACCACG GAAGCTTGTC CGCCACTACA ATGGCTACAT ACTCGTATGC
CAGTTTGTGT ATGGGAGGAC GCTTTGGCCT GGGAATGGTT GCGACATCCA ACGCAAAGAT
GCCACCGCAA GTTAGAAGAC GCCTCTCGAA ATACAAAAGA CAACCTTTTG GCTTGGCATG
CCGTGACGTC GGAAATGTCG AAACCCAAAT TTCGCTCGTC GGAAGCTATT AAGGCACTGC
CGCAGCCAAA ATCGATTCAA TCGTTTTTCG CCGTGATGGA AAATGACAAG AAATTGGATT
CTTCACCGTC TAGATCCCCG TCGCGCAAGA AACCACCTAC CTCTCTGTCG AAACGGAATG
CGAAGAGTCA TCAAAACAGC CAAACTCCGA CGAAAAAGAC CAAAATCAAA ACTAGTGGAA
TTGCATCCTT CTTTAAGCCA AAGCCATCGC CTCCATCTTG ATGGTAGCGA TGGCAGAACC
GTGCCCTTGA CAAAGGGGGC GCATTCGATT TAATCTCGAT ACGACGTTCA AAGCACAACC
CCATGTAAAT TGAGGTGAAT ACGCCTACTT ACCTACCTTC ATTCGATACA GTCAATGCAC
GAAGGATTTT TCATAGAAAA ATTTCACAGT CTAAACAATG ACGTAATCCG GGAAAAACGT
TTCGAAAAAA TCTATATCAG GATTTGCAGG CACGTGGCCC AAGAGTCTTT CTAACCTGCC
CGAACTGTGC TTCAATGACA CAAATTGTTT ACTTTCTAGC TTATTGGCAC GCTGTTTAAA
ATACTGACCT TGTACTAATG AGCGAAGATG ATTTGTAGCT AAAAGGAGAA ACAGAACCGG
CTCAGCTTGA GCGGCTTTCG GGAATAGCAG ACTGACACAA AATGCGCAAC CCATAGTGGC
TAAGTCCGCT CGCAATTACT ATAGCCCTGG GCTTTCTGTG GATAAAGTTT GCAAGTTCCC
GTTGCTGGAA TAGCTCCAAG AATTCTGCTG CTTCTGCTGT TGTGGCTGCG GATTGTCCGT
TGGCACGGCT AAGAGTGCCG ATTGCATTGG AATGTGATCG ATAGGACACT TGGGCGGCCC
TACTGGAGTT TGCAGGCTCG GTTTAAGGCA GCGCTCGTGA AACATGTGTC CACAGGGACG
CAATTGAGTG TCGACATTGG TCTGTGAGCA CAGCGCACAT ATCGTCTCCG ACGGAACTTG
GACATGACCA TCCCCTTCAC TCGGATCAAC TGAAGATGCG CCGTCCTTTC CATAATGCAG
CTTATACTGA CTACTTCGTT CTTCCGGCAT TTCAACAGCG TGTTGTCCTG GATGCTGATG
CGAGGTCTCT CCTTCTTGGC CCAAACCTAC TGGAGCCGAA GCTGGTGGAT AGGAATACTG
GCCGTATTGT TGATGTTTTT CGCGATTGAT GAACTCCTGG GATAGTGTAC GCTGAATCTT
CGCTGGTAAC ACCAAATCAG TAAGTTTTTC ACAGCTTCTG CTTACGGACA TGAGACCACG
GGCTGCCAAA GCATCATCAT ACGCGCGTTT CAGAGTTCCC GTATCGGTAT TCGACGTTTC
TGACCCCATA CTCGACAAAG CAGCCAGCCT AGGAGCCCCC GACACTGACG CAGGCAACGA
ATATTCCTTA CGCATTGACT GCGAAGGCGA TTCGTCTTCC AAAGACAAGC CAATCTCGTT
ATCTTCATCG CCTGAAGGTT TGGTAGCTCC TTTTTGATCC CGACCTCCCA TTGCATTAAG
ATTGTTTGTT GAATTACGGT CACCACCAAA GAACACTTCC AGAAACGCAT ACGAGTCATT
TTTCCCTATG GAATCCATAC TCTTCACTTG AGCCAGATTA CCCATTTTGT TCAGATTCCC
CTGGCTCGCT AAGTCCGCTC CACTGAGTGA ACTGACGCTC TTTACACCCG GAATAGTTCC
GAGATTACTC AACGATGTGA TGCTTGAGAA ATGCGGCCAC GAGTTCAGCG TTGGCACGTT
TGGAAGAGCA TTGTGAGACG GTGCTTTCAT GGGATTTTGA TTTGCGCCTT GTTGCTGTGA
AGCAATGCTT GTTGTTTGAG GATGTTGCTT GGCTTCTGCC TTCAGACGCG CTGACTCGGT
TGCCTGCCGT ACGAGATCTT GATCACGGGC GAACAATTCC ATGAGCACAG CATGACTGTC
GGTTTTGCTC AATTGCGTAC TTGAGGACAG CTGAGGCAGG GTGCCGATCG AGCTGCTCGT
TGACAGTTGA GGGTTTGATT GCATACCCTG ATCTGACGGG AGTGTGGAGG AGTAGTAGTG
TGCATCGTTC GTTTGTTCGT TAGAGGGCGC CGACGAGTAG ATCGAATTAC CGTATATGTC
GGTAGATCCC GACGA
 
Protein sequence
MCGRTAQTGD AVRAAAKSLG IALETTSKSN AGSTTNFSGP EKQQQHMEST DAQRSSSSSK 
PLSVVKESMF SSLHDNESSE LRNNFNLSPG MDAVVFWKDK FDTVRATRKT WGLITRGGSE
EKPIEDGMGK HFSNLMFNAR SDTLYTKPTF GRLATSGKTC LIAVDGFFEW KTVVGKKQPY
FVYRKQHENQ KAEENRQRGL PTDCKASSRP YLLLAGLWTS VPTGLADGDT LDTFTIVTTE
ACPPLQWLHT RMPVCVWEDA LAWEWLRHPT QRCHRKLEDA SRNTKDNLLA WHAVTSEMSK
PKFRSSEAIK ALPQPKSIQS FFAVMENDKK LDSSPSRSPS RKKPPTSLSK RNAKSHQNSQ
TPTKKTKIKT SGIASFFKPK PSPPS