Gene PHATRDRAFT_40785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40785 
Symbol 
ID7198642 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp392477 
End bp393941 
Gene Length1465 bp 
Protein Length393 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184796 
Protein GI219129227 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.432323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTTGC GGACTCTTCC GGTCCTTTCG TCGTCGCGAC TCGTCAGCAT GCTCATGCTT 
TTGAACGTTG TCACAATATT ACATTATTTC CAGGCACAAA CGCACCTTTA CGATGCGGAG
GCCTCCCGGT ACTTTACCGA GTCCTCCGGG ACGCTGCAAA CCCACAGTGC TCCTTCGGAC
GGCATCACCA ACGCCAGACT GACAGTAAGA CCACGGCACG GACGTTTCCG GGATGATACC
ACCGCGACTG CTCCTCTGTA TGGTAGTACT GTTTCCAGGG ACTCTTCGGG CAACGCTTCT
TCTTCTTCAT CGTTGTCCGC GACCTGGACA TTCTGGCTCT GTGAAGAATG CCTACAGGTT
GTAAATTACG CCCCGCGCGT CAAACTCACC AAGCCCGTCC AGACACGCTC GGATGAACGG
AAATTCCTGA GTTACTTTTA CCTGAAAGAA GCGGGCAAGC ATCCCTTCCA AGGTGCCCTG
GATGCCCAAG GCCGGTCTGG TTTCCACTAC GACGTCACCA GTTTACGACG GAGCCCGCCC
TCCTTCGTAG ACAGCTTTCC CAATCTCACG GCCGAGTGTC TCCGGCGCGA TGACGAATAC
TACGCACTCC AAAGACTTCG GATTCATTCG CCGTCACCAG AACAATCAAC TACCGCCACT
CGACGACTAT CGCAATCGTC TACTCCGGCG AGAATACTCT GTGTCGTCTA CAGCAGCGAG
CCCTTTCACC ACAAGCTGCA GGCCGCTCGA CAGACCTGGG CTCCCAAGTG TGACGGCTTC
TTCGCTGCGT CCAACGTAAC CGATCCCACC TTTGATGCGG TCAACATTGT CCACAATGGT
CCGGAACAGT ACAACAATAT GTGGCAGAAG GTACGCTCCA TTTGGGCTAC GCTGTACGAG
TTGTACTATG AGGATTTTGA CTGGTTCCAT CTTGGCGGTG ACGATATGTG GCTTCTAGTC
GAGAATTTGC GTATGTATTT GGAAAGTGAC GAGATTCAAG CCGCTGCCAA CGGAGGCTTT
TCCGACACGT TACCACTAGG GGTGCAATCG GGCAACAATA ACAGTACTCG GATACAACCA
GACCAGGTGC CCTTGTATCT GGGGAGCCGT CTTGCCTTTC GGAAGAATAT ACGAACCTTG
TACAACACGG GCGGACCGGG ATACACTCTC AACAAGGCGG CGTTGAAGCT CCTCGTGACG
GAAGGATTGC CCGTCATGCA CAGTCAGCTA CGAACCTCTG CCGAGGATTT GCGAGTAGCC
GAGGTTTTCC GACGATTCCG CGTCTTGCCG TACCCGACAC ACGATCGCGA CGGAGGTGAG
CGGTATCACC ACTTTACTCC GGGTTTGCAT CAGCTATCGG CCATGCCCGA ACAATATAAA
TGGTTCGACA AGTGGGCTTC ACCAATGGGA TGGAAAGGAG GGTGGAATCA TTCTTCTGTG
TACAGCGTCG CGTTCCATGG TATAA
 
Protein sequence
MGLRTLPVLS SSRLVSMLML LNVVTILHYF QAQTHLYDAE ASRYFTESSG TLQTHSAPSD 
GITNARLTVV NYAPRVKLTK PVQTRSDERK FLSYFYLKEA GKHPFQGALD AQGRSGFHYD
VTSLRRSPPS FVDSFPNLTA ECLRRDDEYY ALQRLRIHSP SPEQSTTATR RLSQSSTPAR
ILCVVYSSEP FHHKLQAARQ TWAPKCDGFF AASNVTDPTF DAVNIVHNGP EQYNNMWQKV
RSIWATLYEL YYEDFDWFHL GGDDMWLLVE NLRMYLESDE IQAAANGGFS DTLPLGVQSG
NNNSTRIQPD QVPLYLGSRL AFRKNIRTLY NTGGPGYTLN KAALKLLVTE GLPVMHSQLR
TSAEDLRVAE VFRRFRVLPY PTHDRDGASR SMV