Gene PHATRDRAFT_40749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40749 
Symbol 
ID7198621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp299077 
End bp300597 
Gene Length1521 bp 
Protein Length506 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184775 
Protein GI219129183 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTCCA TTGAAGTCAT TCGCTTTACA CGGAATCAAA GAATTTTGAG TGCATTCGTG 
GTACTTCTCA ATATTGCTGT CGCTCTGTAT CTCCTATTTG TCGACTCCCA GAGGAGTCCA
ACGCATGACC GCTCGTTTTC ATTTCCGCAC GCCCTAGCAA CATTGCCTCC CAACGCTTCT
CAGTCGGAAT CTATTGAGGT TGGGTCGTCG AGTAGGACGA GGGAGATAGC TACATTAGCA
TCCGCTATGG CTGAGATCAT ACGTTTACTT ATACCCGAGC CATCAACAGC CAACGAGTAT
CTGGCACAGC ATGAACCAAA AACTTTTCGA TTTTATGTAT ACGACAATTT ATCTCATGAG
TACACGTGGC AATATAGCGC TAGTTGTATG AAGGCGAAGC GCCGTCTGAG TGATACATGT
GATTGGGGTG AATCAGTCTG TGGGGAAAAG CGGCTTACTC GCAGCCCGTA TTCCAAGCGC
AGATTAAACC GCAACGGCGA TTTGGTTTTG AGCAAGGCTT TTTCGTCCTA TCAAGGAATC
CTACGAACTT ACGATCCGAT TGATGCGGAT TTGTTCGTGG TCCCGTATCC AAGTCAGGCA
CACTTTCACT GCAATCAAAC GTCCCACGAG GACGTGGAAA CGCGTTTACT GGATCGACTT
GCTTACTTCA ACAAAAAGAC CCGTAGAAAA CATTTGTTTT TTTCTTCCGC GGTACGGTCT
GCCTCCAACA AATTTATGGG TTCGCTGCCG CTCTTGGTAA CAATCGGGCC AGTTGACCGA
CAGTGCAGAA TAGGTCGGAA TTGCGGTCAA ATTGTGATGC CGTACGTAAA CACCAATCCG
GAGTATCAAC CAATGGTCGT TCAAAAGAAC CTCCGTTCCC TAAAGGACCG AAAGTTCGCC
ATGGTGGCAA AATTCAACGC CTATATATCC GGTAACAGCA TGCCTCGTAG CGATTTTCTC
AAGGTTGTCG GTAACGTGAC GGCGATCGCT GGATTCCCTG TGCTGATTTC GGCGCTGGGT
CGGCGCCGAA CCATGCCCAA CGAGCGCAGC GTACTAGAAG ACTACCGCAA CGCAATCTTT
TGCCCTTGTT TGCGAGGCGA CGAACCTCCG CAGAAAAGGC TGTTCGACGT CATGATGTCG
GGATGTATTC CGGTAGTATT GGACTTTCCA TCGAAAGACC CAGGCTACCG GTCACATTTT
GCGTCTATGG CAACGTCAAC GCGCGGGGCC TATCCTTTTG CCAAGGGTTC TTTCCACGGT
TGGCCAGAAA TGGGTTTGGA CTACAACGAG TTCATGGTTA CTGTGAATGG TACTTGTGGT
GTATCATGCA TTGTTCCGAC TCTGGAAGAT TTGCTCTTGA ACCATCGTGA TCGATTGGTA
AACATGCAGG AGCGGCTAGC AAAAGTTATC AAAGTGTTCA GTTATGGGAT GGAGCACAAT
ACATTACAAC ACGCAGACGC GATATCAGCG ATTCTCGTGC AAGTAAAGCA CTACGTCGAT
AGTCTCGGTC AAGTTTCATA G
 
Protein sequence
MLSIEVIRFT RNQRILSAFV VLLNIAVALY LLFVDSQRSP THDRSFSFPH ALATLPPNAS 
QSESIEVGSS SRTREIATLA SAMAEIIRLL IPEPSTANEY LAQHEPKTFR FYVYDNLSHE
YTWQYSASCM KAKRRLSDTC DWGESVCGEK RLTRSPYSKR RLNRNGDLVL SKAFSSYQGI
LRTYDPIDAD LFVVPYPSQA HFHCNQTSHE DVETRLLDRL AYFNKKTRRK HLFFSSAVRS
ASNKFMGSLP LLVTIGPVDR QCRIGRNCGQ IVMPYVNTNP EYQPMVVQKN LRSLKDRKFA
MVAKFNAYIS GNSMPRSDFL KVVGNVTAIA GFPVLISALG RRRTMPNERS VLEDYRNAIF
CPCLRGDEPP QKRLFDVMMS GCIPVVLDFP SKDPGYRSHF ASMATSTRGA YPFAKGSFHG
WPEMGLDYNE FMVTVNGTCG VSCIVPTLED LLLNHRDRLV NMQERLAKVI KVFSYGMEHN
TLQHADAISA ILVQVKHYVD SLGQVS