Gene PHATRDRAFT_42953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42953 
Symbol 
ID7196782 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1616609 
End bp1618699 
Gene Length2091 bp 
Protein Length557 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176818 
Protein GI219110133 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.677298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTCT TTCGTCTCGC CATTTTCGCC GTCTTTGCGA CCTCTGTGCA AGGCAAAACC 
AACGAAGATC AAAAGGCAAC TAGTAAGCTT ATGGTGCATG TAAGTTGCGT AAGATTTTGA
GCTGTTCTTC ATCTATCGGG ATTCACTTTT TGAATCAACT ACTATTTTCG CTACGTTGAG
GAAGGCGAGA AATGTAATCA CGTTGATGTC TGAAGCTGCC TTTAGGAATC AAATTGGTCA
AAATTGTAAC CTTTCTGTAT CAGGATGAGT TGTAATAAGC TTTTGATGGT TCCAGACCTT
CTTTTCAGTG CATGAAACTG TTTTTCGACA AGTTTTTTCA AATCCCGTTG TCCAGTGTGT
ACGTGACAAA TACACTCACT TACTCTGTAT TGCTTATGCT TTCAGATTCC CCACATGCTT
TACAAATCCG CAGGATACGA TCACCGCGAA GCTCTATTTG GCATGCCAGC GTATGGTGGT
TCGATCTCTC AGAATGTGTA CTACGCCGAC AGCGACCTCT GTGATCCGTC CGAAGAAATT
GAAGGCTATC CACAAACCGA CTCGGATGGC GACGACGACG ATGTAGCACC ATTTCCAGCG
CCCTATATTC TCATGGTTAA CCGCGGAGGA TGTACCTTTG TGCAAAAGGT ACGGAAAAGA
CGACGATAAA AATGTCCTTG AACCGGCGAA GTTCTCACCT TTTTCTCATC CACAATGTTC
TCGGCGACAG GTGCGCAACG CACAGCACAT TGGAGCATCA GGTGTTTTGA TCGCCGACGA
CACCTGTCTC TGTTCGGATA AAGTCTGTAT GGCCAACAGT GAAGACGACG AGGACGCCTG
CCAAGTCAGC GAACCCATCA TGTCCGATGA CGGTTCGGGT GCAGACATAT CAATCCCGTC
TTTCTTAATG TTCAAGATGG ACTCGGAGCG AATCATTGAA GAAGTCAAGA GCAATCGACC
TGTTCAGGTT GAAATGGCTT GGAGTCTTCC GAACCCTGAC GATCGTGTAG AATATGATCT
GTACACGTCC CCGACCGACT CCATTAGCAA ATCTTTTATC CAAAGCTTTA AACAGCTTGC
GGTGGCTCTT GGAGGCCGCG CGTACTTTAC GCCGCATATG TACATTTTTG ACGGCATAAA
ATCACAATGC CACGGATCGG ATGGGGAGAG TCATTGCCAT ACTCTCTGCA CCAACAACGG
ACGGTACGCC ATATACGCCT CCAACCTAAG CTTGAGGCGA CAAGAATTGG ACACGCTTCT
CACCCTTTCC TTTATCCTTT CCTATAGATA CTGTGCCACC GACCCTGATG GTGACTTGGA
ACGTGGTATC TCGGGTGCTG ACGTTGTCAC CGAAAGCTTG CGTCGTATCT GCATCTGGAA
TCACTATGGC GCCCCCAACG GTATCGGAGA AATCTGGTGG GACTACGTAA TCGAATTTGA
ACAGCGCTGT GCCGCTTCAG ACTACTTTTC TGACACAGCC TGTATCCAAG AAGTGTACCA
CCGCGCCCAG GTTGACGGTG ACATGGTCGA GCGGTGCATG ACGGATAGTG GTGGAACTAT
AGCGGATGGG GCCAACACCA AGCTTGACTT TGAACTAAAC GCCCAGACGG ACCGGGGAGT
AGTTATTCTG CCAACTACTT TCGTCAACAC GGCTGCTATC CATGGTGCCT TGACCCCGTC
GAATGTTTTT AACGCAGTGT GTGCGGGTTT CGCCGATGGC ACAGCCCCCG AAAGTTGCAA
CACGTGCAGT TCCTGTAAAG ACACGATTTT CTGTGTCGGT CAGGGGTACT GCAAAGCGAA
CGATTCGTCC GGTGGCCCCG CGGAAAGCGG AGTATCTGGA CATGCCTTCG CGACTTCCAT
GCTGATTGTG ATCGGGTGTT TCTCCACCTT GGGTGCGTGG TACTACAAAC GCACCAAGGA
CGAGCTGCGA GACCACGTTC GTGGCATCAT GGCCGAATAC ATGCCTCTCG ACGACAACGA
AGGAGACCTC GGAAATCCAA TGGATTTTTC AAACAATGGC GATGCGACCA CTTCACTCAT
GATGGGCCCG GACTCGATTT AGAAAACCTC GTGCAGCGTA GACGATACGT C
 
Protein sequence
MTLFRLAIFA VFATSVQGKT NEDQKATSKL MVHIPHMLYK SAGYDHREAL FGMPAYGGSI 
SQNVYYADSD LCDPSEEIEG YPQTDSDGDD DDVAPFPAPY ILMVNRGGCT FVQKVRNAQH
IGASGVLIAD DTCLCSDKVC MANSEDDEDA CQVSEPIMSD DGSGADISIP SFLMFKMDSE
RIIEEVKSNR PVQVEMAWSL PNPDDRVEYD LYTSPTDSIS KSFIQSFKQL AVALGGRAYF
TPHMYIFDGI KSQCHGSDGE SHCHTLCTNN GRYAIYASNL SLRRQELDTL LTLSFILSYR
YCATDPDGDL ERGISGADVV TESLRRICIW NHYGAPNGIG EIWWDYVIEF EQRCAASDYF
SDTACIQEVY HRAQVDGDMV ERCMTDSGGT IADGANTKLD FELNAQTDRG VVILPTTFVN
TAAIHGALTP SNVFNAVCAG FADGTAPESC NTCSSCKDTI FCVGQGYCKA NDSSGGPAES
GVSGHAFATS MLIVIGCFST LGAWYYKRTK DELRDHVRGI MAEYMPLDDN EGDLGNPMDF
SNNGDATTSL MMGPDSI