Gene PHATRDRAFT_36120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36120 
Symbol 
ID7201178 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp609471 
End bp611553 
Gene Length2083 bp 
Protein Length626 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180672 
Protein GI219119841 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00433417 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGAAC CTCCGTCTGC ACCTACCCAA AGATCAACCA GTACAACGAC TACAACTACA 
AATTCTAGTC GCTCTGCTAT TGCCGATTTC AAACGAGGAG TAAAACGAGA CAAGACGCAC
TATCCAGTTC TTAAAGATGA CCGTTATTGG GACAACTTCT ACCGTACTTT TGTCGTTACC
GCAGTATCGC ACAATGTAGT GATAAAGTAT AGCCACGCAA TACTGCAATT TGCCACTAAG
GTGAGCTATT GTCATAGCTA ATGTCAATTG TGAGCGACCG TATATTCCTC GTGAGATGTG
GGATAAACTG TCCGACGATG CAAAGGAGAT TCTCCGTGGT ATGTCTTCTT CTAAGAAGGA
AACGCCTCGG CCAACAGCAA GTCATCATCT GCTTTTCATG CCAACTCCCA CTCTTTAACC
GATACGGGAC ACCCCTCATC AACGGACGAA TTGTTGCACG AAAACGGCAA CGGTAAATTC
CATGAGTGCG GGAACGACAC GGAACTGCTT GCACACCTTA CTGATTGCTC AAGTAATATG
GCAAATGGAG ACATTTGCAA GGTCCTTGCT TCAGCTTCCT CCTATAAGCA AAATTCAAAG
AACTCCCTGC TGTCAAATAT GCTCGAGTAC AGTATTTCCC GACACTCCGT TGCTGGGACT
ACATCCTCCC TCATCAACAG AGGCGCAAAC GGCGGACTTG CAGGAAGCGA TGTTAAAATC
CTTAACAAAA GAGGCCGTTC TGCAAGCATC ACGGGTATTA ATGACCATAC TTTGCCTGAT
TTGGACATTG TCACCGCCGC CGGCCTTGTT GAATCCCAAA ATGGACCCAT CATTGTCATA
CTTCACCAGT ACGCACACCA TGGAAAAGGT AAAACAATTC ATTCTAGTGC GCAACTTGAG
TATTACAAGA ATATTGTTGA GGACCATTCC CGTGTTTTAG GAGGTAAACA ATGTATCATA
ACTCGAGATG ATTATGTTAT TCCTCTACAT GTTTGTCAAG GACTAGCTTA TATGGACATG
CGACCTCCTT CCGATACGGA ATTTGACACG TTACCCCACG TTGTACTTAC TTCCGATGTC
GACTGGGACA CGTCCATTAT TGACAACGAA ATTGACCTTG TCACAGATTG GGATGATGCC
GTCCAGGACC TTCCCAGCGA CGTACGTGGA ACCCTGTTTC AATTCAACTG GTGAAAACCG
ACACAGGCAC GTTGCGAACT TTGACATTTT TTCGTCACCT GACTTTGTTG ATCGGTCCAC
GGCTATCAAT AATATACTCT TGTCAAATCA ACATGACATG ACCCCCAATC CACACAATTA
CGAAGCCTTG CGTCCTTGTC TTGGCTGGAT CTCCGCCGAC ACAGTCCAGA AAACCATTAT
GGCCACTACG CAATTCGCTC GTGAAGTCTA TAATGCACCC ATGCGTAAAC ATTTCAAGTC
TCGTTTTCCG GCACTTAACG TTCACTGGCG CAACGAAGCT GTAGCTACTG ATACCATTTG
GTCGGACACG CCTGCTGTTG ATGATGGCGC TAAATTTGCG CAATTATTTG TCGGTAGACA
ATCGCTTGTC ACCGACATTT ACCCTATGAA AACAGACAAA GAGTTTGTTA ATGCTCTCGA
AGACAATATT CGTCATCGTG GCGCCATGGA TAAACTCATC AGTGATCATG CTAAAGCCGA
GATCAGCAAG AAAGTTTCTG ATATTACCTG CGCTTACCAC ATTGATCAAT GGCAAAGCGA
GCCTAATCAC CAGCACCAAA ATTATGCCAA ACGCCGAATT GCAACTGTCG AAGCAAATGC
GAATAAAATT CTAAACAAAA CTGGTGCACC CAATTCTACA TGGTTATTGT GTGTTTCCTA
CATTTGTTAT TTGTTTAATC ATTTGGCACA TGAGTCTTTG CACAATTGCA CTCCTCTTGA
AATTCTTAAT GGTAGTACTC CTGATATTCG CGTACTCCTT CAATTCCATT TCTGGGAACC
AAACTACTAC CAACTTGAAG ACCCTACTTT TCCTTCCGAT GGAACTGAAA AGAAAGGCCA
TTTTGTTGGA ATTGCAGATT CCGTTGGTGA TGCCCTTACC TAA
 
Protein sequence
MPEPPSAPTQ RSTSTTTTTT NSSRSAIADF KRGVKRDKTH YPVLKDDRYW DNFYRTFVVT 
AVSHNVVINK SSSAFHANSH SLTDTGHPSS TDELLHENGN GKFHECGNDT ELLAHLTDCS
SNMANGDICK VLASASSYKQ NSKNSLLSNM LEYSISRHSV AGTTSSLINR GANGGLAGSD
VKILNKRGRS ASITGINDHT LPDLDIVTAA GLVESQNGPI IVILHQYAHH GKGKTIHSSA
QLEYYKNIVE DHSRVLGGKQ CIITRDDYVI PLHVCQGLAY MDMRPPSDTE FDTLPHVVLT
SDVDWDTSII DNEIDLVTDW DDAVQDLPSD VRGTLHVANF DIFSSPDFVD RSTAINNILL
SNQHDMTPNP HNYEALRPCL GWISADTVQK TIMATTQFAR EVYNAPMRKH FKSRFPALNV
HWRNEAVATD TIWSDTPAVD DGAKFAQLFV GRQSLVTDIY PMKTDKEFVN ALEDNIRHRG
AMDKLISDHA KAEISKKVSD ITCAYHIDQW QSEPNHQHQN YAKRRIATVE ANANKILNKT
GAPNSTWLLC VSYICYLFNH LAHESLHNCT PLEILNGSTP DIRVLLQFHF WEPNYYQLED
PTFPSDGTEK KGHFVGIADS VGDALT