Gene PHATRDRAFT_35869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35869 
Symbol 
ID7200864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp933837 
End bp935441 
Gene Length1605 bp 
Protein Length534 aa 
Translation table 
GC content57% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180350 
Protein GI219119167 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.597228 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATC TCGTGGTGCA TCGGTCGGAC GGTGCCCACG GGGATCATCG TCGCCCTCCT 
CCCGTATATC TCTGCTACGA CGAACGATTA CTCCGTCACC ATCCGGTACA TTGGCAACCT
CCCGCAGTCT ATCCAACACA AGACGACGTA CGGATAAAAG CGTTTCCAGC CGAGTACGTC
TACGAAAACC CCGAACGCAT CCGTTGCGTC TACGAACATT TGCAAAGCGT CTTTCCGCCC
GACACCTTTC TGCCGCTCCC GTGCCGTCTA GCCACACGTG CCGAAATCAC CGCCGTCCAC
GACGAGGCGC ACTACGATCG TCTCGCCGCA ACAGCCTGTA TGACTGCGGA AGAACTCGTC
GAACAAAGCC AGCTCGACAG TGATTTGTAC TGGAACCAGG AGACCTTTGC TGCCGCACGC
TTGGCTTGTG GCGGCCTGTT GAATTGTGTC GACGCCGTGT GCCGATTAAA CGCGACTCCC
AACACCGACG CGACACCACC GGCAATTCAC GCCGTGGCCT TGATTCGGCC TCCGGGACAT
CACGCTTGTC AACATCGCGA AATGGGCTTT TGCTTTTTGA ATTCCGTCGC CGTGGCCGCG
CGGTACGCGA TTGAGCAAGG CCACGCCACC AAAGTTCTCA TTCTCGATTG GGACATACAC
CACGGTAACG GCACACAAGA TCTCACCTAT AACGACGAAC GTATACTCTT CGTGTCAATG
CACCGCTACA CGGGCAGCAA CGTCGCCAAG CATTTCTTTC CCGCCACCGG AAAACCCACC
GAGACCGGAC GGAACGCCAC CAACGTCAAC CTGGCCTGGA CGCAAGGACA CATGGGTGAC
GTGGAATACG CCGCCGCCTT CTCGGAACTC GTGTTGCCCA TCGTTGTCGC CTTCCAACCC
GAATTGGTGT TGATTTCCTG TGGACTGGAT GCCGCTCGGG GAGATTTGAT TGGGGACAAT
TCCGTGTCAC CACTGGGGTT TCGGGCCTTG ACGCACAGTG TTGTCCGCGC CGTAGGCACC
CACACCACAC CCGTGGTAGT TGCCCTGGAA GGCGGCTACA GTATGGACGC TTTACCCATT
TGTATGGAAC ACGTCGTCAG GGGACTATCG GCCGCCAACG ACACCAGTTT AGACTGGGAC
GTGGAGAATC TTCCACAGGC ATGGGCGAGT GATAGCTTGG AGTACGCTCA CCAAGCTTTG
GCAATGTACT GGGACTCCAA CCGTCGAGTA GCGGCCGACA ACCAGCCCGC GATACAACCT
TCGGCAATCA GTAATATCAA TCAAACCGTT ACAGCCTTGC AAAAGTGTTC CTCACGCTGG
AACGAATGTG GTTTAACGAA ACTCCTAAAG CCACCGGGGC CGTCTGCAGT CTCGACTCGC
GCGTCACGGC GTTTGGTAAA GTCCTCTCCC AACACTTTTC TACCAAATGG GTCGAATGCC
ACACCGCAGC CAAAAGCCTC CAAGGTTCTG GCTGTATCGT TGGCGAAGGA TCCCGTCAAG
GAAACATCGC TCCCCGCTGC CAAAGCGGAC GAGAACGTCG ACGGTGGGGA TGGTGATGCG
TTGATTGCGG CTTTGCAGTT GCTGTCCTTG TCCAATGGCA AGTAG
 
Protein sequence
MSDLVVHRSD GAHGDHRRPP PVYLCYDERL LRHHPVHWQP PAVYPTQDDV RIKAFPAEYV 
YENPERIRCV YEHLQSVFPP DTFLPLPCRL ATRAEITAVH DEAHYDRLAA TACMTAEELV
EQSQLDSDLY WNQETFAAAR LACGGLLNCV DAVCRLNATP NTDATPPAIH AVALIRPPGH
HACQHREMGF CFLNSVAVAA RYAIEQGHAT KVLILDWDIH HGNGTQDLTY NDERILFVSM
HRYTGSNVAK HFFPATGKPT ETGRNATNVN LAWTQGHMGD VEYAAAFSEL VLPIVVAFQP
ELVLISCGLD AARGDLIGDN SVSPLGFRAL THSVVRAVGT HTTPVVVALE GGYSMDALPI
CMEHVVRGLS AANDTSLDWD VENLPQAWAS DSLEYAHQAL AMYWDSNRRV AADNQPAIQP
SAISNINQTV TALQKCSSRW NECGLTKLLK PPGPSAVSTR ASRRLVKSSP NTFLPNGSNA
TPQPKASKVL AVSLAKDPVK ETSLPAAKAD ENVDGGDGDA LIAALQLLSL SNGK