Gene PHATRDRAFT_41365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41365 
Symbol 
ID7199167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp282682 
End bp284643 
Gene Length1962 bp 
Protein Length653 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185355 
Protein GI219130401 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCTC TCTCGCAGCT TCTCCGGCGT ACCATCCGCG AAGGTGTACC GTTGTACAAC 
GACGGGCAAA CGCAGGAATG CTACGAAGTC TACCTCATGG CCGCCCAGCA CTCCCTCGAA
TCGCCTCTGG TGGTACAAAG CCGCGCTGGG ATCAGTCTCA AGGACGCGTG TCGCCAGGCA
GAAGCCAAGG CTACATCGCC TTATCTGAAC TTCTCAGAGG CAGCCTGGAT CTTGCGCCGC
TGCTTTGACG AGATTTTGTC TTCGCAAGCG TCTCGTCAGC TCAACCTCGC ATCTTCGTCT
ATAGCCAGCC ATGCAGATAG TTCCGACAAC GAAGTTGGCT ACTGGCAAGA TGAGCGGGTA
CTGAACTCCA GCGATGCTGA GAAAGGACAG GCATTGGCCG ATATTGGCTT GATTCTTCAT
ACTCGTCTCA AACCCGCTCC ACAGGAGCAG GTGGAAACGA CTGTCCCTTA TGTTGTTCCA
GGGTCGGAAA TAGTCGAGAC TCTGACCTAC TTGGGCCTGG CAGGAAACAG CGCTTTGGCA
ACTGAAAAAT GCAATCTTTT GGTGCGATCA GGTCTATTAG TCCCTGTCTC GTCCGAGGAC
GCCACTCGTT TTCAGGACGG CACCCACTTG TATGCGTTTC CGGGTCGCAA CGAGCTGGAA
GCCTCCCTTG AAAGGATTAT TGCTACCGAC CGCTCCGCAA TTGACGGTAC GGAAGAAAGT
GTTCTGCGAA TTGCTCTGGA AACTTCTCTG AAAGGTACAA ATGTGGATGT GGAATTAAGA
CCTGTCCGGG AACATGGACA TCGCTCGTCG GCTTTCGTGA TCGCAGAAGA CTTGGCCGGT
GCCAAGATTT CCGATTCGCT GGAGGCTTTG CTGCCCCATT TGGTCATTGC CGATCGACGG
TACAATCTCA AGAAATATGA GGCTTGCTTT TTAGGCAATC AAGCCGTAAC GGCTGTGTTG
GATACAAAAT TGGCCGACTC GCGCAACGAA GCACTCGTGA TCCTAAATGA CATGCTTAAT
GTTGGATTAA TCCATCACGT CACCCACGAC CATTTGGTGG AGGACAAGGT GTTATTTTAC
CGCGTCACAT CGATTGGTGA TATTAAAACT GCGCTCGACA ATACTTGCGC TTTGCCTGAG
GAACCTAAGA CCGGCTTCGA ACGGCTCCGT CACTCCGCGC TCTTGCGACG GTACAATCAG
TTTGCCAGTC TCAATATTCC CGATATTTTG AACGCTTTTT ACGGATGCGA TAGCGAATCC
GGCTGGGATG AGGTAGACTT GCAGAATTGG CGGACGAACA TGAAGCGCTG GGGCTTTGGG
AGACGAGAAG ACCAGGATGA TGACATGGTC CATCGTTTGT CGCCGTTGCT GTTGAGTATT
GATCCGGAAA CTTGGGACGT GACGGACGAT GAAGAGTGGG AATCCCCATT TGGTATTATA
GCTCAAATCG CCATTTTCGA TCAAGTCTCA CGTTCCGCAT TCCGCGGGAC GGCCGACGCC
TTCAAGTGGG ACAAAATCGC CATTCGTGCC ACGAAAGTAG CCATCGCCAA GGGATACTTT
GAAACGGCCT ACAAGTCAAC CTTAAATCAG TTTCTTATTC TTTTACCTTT GGAGCATTCC
GAATCGTGGG AAGACCAAAA GCTAGGCGTG CGCTTGCTCT TAAAGCTCCT CAGTACCGTC
GCTGTTCAGG ATGAGGGCTT TTCCGATTAC GAAATTGTGA AACGCCTTGA ATTTTCCAAA
CGTTTATCGA CGGCTTTCTT GGAGCACGCC CAGGTAGTGG TTAAATTCCG ACGGTATCCG
CACCGCAACC GAGTCCACGG GCGGAGTACT ACGTTGGAAG AGCGAATATG GTTGGCCTCT
GATCTGGTGC CGCGCTGGGC CAAGTCGCAG AATCCCGAAG ACGCCCACAA TCTAATAAAG
TTGCCCATCA TTCCACTAAA GCGTCTCACG AAAGGACGTT GA
 
Protein sequence
MESLSQLLRR TIREGVPLYN DGQTQECYEV YLMAAQHSLE SPLVVQSRAG ISLKDACRQA 
EAKATSPYLN FSEAAWILRR CFDEILSSQA SRQLNLASSS IASHADSSDN EVGYWQDERV
LNSSDAEKGQ ALADIGLILH TRLKPAPQEQ VETTVPYVVP GSEIVETLTY LGLAGNSALA
TEKCNLLVRS GLLVPVSSED ATRFQDGTHL YAFPGRNELE ASLERIIATD RSAIDGTEES
VLRIALETSL KGTNVDVELR PVREHGHRSS AFVIAEDLAG AKISDSLEAL LPHLVIADRR
YNLKKYEACF LGNQAVTAVL DTKLADSRNE ALVILNDMLN VGLIHHVTHD HLVEDKVLFY
RVTSIGDIKT ALDNTCALPE EPKTGFERLR HSALLRRYNQ FASLNIPDIL NAFYGCDSES
GWDEVDLQNW RTNMKRWGFG RREDQDDDMV HRLSPLLLSI DPETWDVTDD EEWESPFGII
AQIAIFDQVS RSAFRGTADA FKWDKIAIRA TKVAIAKGYF ETAYKSTLNQ FLILLPLEHS
ESWEDQKLGV RLLLKLLSTV AVQDEGFSDY EIVKRLEFSK RLSTAFLEHA QVVVKFRRYP
HRNRVHGRST TLEERIWLAS DLVPRWAKSQ NPEDAHNLIK LPIIPLKRLT KGR