Gene PHATRDRAFT_40662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40662 
Symbol 
ID7198578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp89887 
End bp91221 
Gene Length1335 bp 
Protein Length444 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184644 
Protein GI219128910 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTGTA CTACTTCAGG AATAGACGAT ACATTATTGA GAAGTAGAAA TAGCACATTG 
GATCTGTGGC TAGACTGCAA CGCACTCGAG GGCGTCAATT TCGCCGATGC GCTGCCTTTC
CTGCTAGCCT TGAGGCAGAA CACGCACGTG ACCGACGTAA ATCTGGCTAT CGTTCGATCA
CCTCATGCAG CGGAGACGAT ATTGGCCGTC AGTCAGATTC CCAATCTGCA AAGCTTAAAG
GTCTCATCCA TCTTTGACTG TTCCGGGGAT TTCGGGCTGT CGCTACCGAC ACTGACACAG
GCTCTGGGTC AGGCACGACG ACTGGAAGCT CTATCTCTGG ACTGGATTGG CATCTTGGGA
GATGCCGACA ACAATACGCT GCGGGAACAA CATACCGCAT TAGAGCGGAC ACTAGAGTCG
CACCCAAGAC TCTGTGAGAT TGTTTTGACC AATTTTTACT TTCCGAGCCA GACGAGTCCA
AATGATTGGA TTCAGGCGGA TCGATTCGTG CGTGTCATCC TCGCATTGCC CAACTTAACT
ACGTTACGAA TGGATGCCGT CACAAGATTC TACGGGCGAC CACTTTTGAC CTCGACTACG
ATCAAGCTCT TGTTTTCCCA CGACTGCCTC CAGACGATTC TTCTCAAAAA CATATGTGTG
TGGAGCACCC GGTGCGACCC GCAGGTCAGC AAGGCGTTGC GAAGGAACGA GCGCCTACTG
GAACTATCAC TTACATCATG TTACCTGGCT AGTCACGGTA GTGTTCTGGC TGGTTTGGAC
GAGAATCGAT CAGTCCGGGA TCTAGACGTG AGCGACTCGT CCCTGTTATT GGAAGCGCTT
TCGTTGGGTC GCGCCCTCGG AATGAATCGC GGGTTGCAAA AGTTGACACT CTGTCGGAGT
CGTCTGGATG TGAACCCTGC GACCCATCAC GACTATGCGC TCGCTCTGTT GCAAGCTCTG
GCGAATCATC CCACAGTCAA GCAATTTCGG ATGAGTATCC TGTACGATTG CACGCTCCTT
GCCGAACGTC CGTCCTTTCA ATCTGTGGAG GAAGTTTTAC AAACAGCATT GCGCGTGCTA
GAATCAAATC ATGTACTACA AGAGTTGTGC TTGGATGGAA TGTGCCAAGA TGAACCATTT
TGGGCATTGT CGGAAGCTAT TCGGCTACGC TTGGGTCTAA ACAAAGCTGG ATTTTGGGGA
CTATCGCAGA CTACAAGTAA AGCGTCGGAA TGGGCTGACG CCTTGGGTGC CGTACGATAT
GATGTGGGAT GTCTGTACCA TGTGCTCCGC GACAATCCTC TCTTGGTAGC GACGACAGCG
GTGTCGGTCA AGTAA
 
Protein sequence
MECTTSGIDD TLLRSRNSTL DLWLDCNALE GVNFADALPF LLALRQNTHV TDVNLAIVRS 
PHAAETILAV SQIPNLQSLK VSSIFDCSGD FGLSLPTLTQ ALGQARRLEA LSLDWIGILG
DADNNTLREQ HTALERTLES HPRLCEIVLT NFYFPSQTSP NDWIQADRFV RVILALPNLT
TLRMDAVTRF YGRPLLTSTT IKLLFSHDCL QTILLKNICV WSTRCDPQVS KALRRNERLL
ELSLTSCYLA SHGSVLAGLD ENRSVRDLDV SDSSLLLEAL SLGRALGMNR GLQKLTLCRS
RLDVNPATHH DYALALLQAL ANHPTVKQFR MSILYDCTLL AERPSFQSVE EVLQTALRVL
ESNHVLQELC LDGMCQDEPF WALSEAIRLR LGLNKAGFWG LSQTTSKASE WADALGAVRY
DVGCLYHVLR DNPLLVATTA VSVK