Gene PHATRDRAFT_54869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54869 
Symbol 
ID7203505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp414626 
End bp416132 
Gene Length1507 bp 
Protein Length445 aa 
Translation table 
GC content57% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182681 
Protein GI219124795 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTACCAAC CGAACAGTAG TGCATCCATT CCCAATCTAG CCCACTTGCC TTCCAGTTAC 
AAACAAATCC TTTAGCGGCA CCGTGCAGTC CCCCATTTGT GTACCTTGCC TTAATACCTA
CTTACCTACC TACCTACGTA TCTCTTGCGC ACACTCCCAA AACCTCCACA TGACCGTGCA
CGTGACGCTC CACACGTCCG GACCCCGCAA CGTCGTACAG GTCGCCGACG CCGAATCGAT
CCTCCAAAAC TTCCAGGCGA CGCTCGCCCG CTACACGACC GACGATGGTT CCGCCCTAGT
CGACAAGTTG GACTTGTCCG GTCGCGCCTG GCCCTTGTCC TCACTCCAAG TCCTCGAGGC
CTTTTTCGAA GCGCACGTCG TCGACACCGT CCGCGTCCTC AAAATCGACG ACATTATTGC
CTCTCTCCCC ACCGTAGATG GCCTCGCGTC TCTCCGTTGG TTTGCCCGGG TCTTTCAACA
CGCCCCCGTC GCCGTACTCA ACCTCAACGA CAACGCCTTG GGAACGCGCG GAATGGCAGA
AATACGACCG TTGTTGTCCA ATCCGCACAT CCGACACCTT GCTCTCGATA ACGTTGGAAT
CAGTGAAGCG GTCGTTGCTA CACTGGCCAC GATACTCTCC CATTCTTCCG GTGACGACCC
GGATACACCC GGACCACTCC AACTACAGAG TCTCTCGTTG GGCCGCAACC AAATCGGGAT
AGAAGGGGCG CGGAGTGTGG GGGAGCTCTT GGCTCTTTGC CCTCATCTGG AGTCCTTCTC
CTACGCTTCT TCCAGGCCGC AATTGGCCGG AACCTTGGCC TTGGTCCAGG GTTTGGAGAA
AAGCGAGGTC ACATCCTTGC GGTACTTGAA CTTTGAAGAT TGCGTCTTCC GTGGCGGTGA
CGGGGAGGAT CCAACCCAGG TCCTCAAGAC TGTGCTCTGT CGCTCTCCGA AACTCCACAC
CCTGCTACTC CCCGACTGCG AACTGGGTCC GGCTGGACTC CTTCTCGTCA TCCTCGGAAT
CCGGTACGCC AAGCCACCCT TGACCGTCCT GGACTTGAGC GCCAACAATG CTGGTCCCGG
GGGCACCTAC GCTCTTGGTG AGTTATTGGA ATACACCGAT GCACCACAGA CCCTCCAGTC
ACTCGCCTTT GACAATAACG AGGTGGAAAC CGCCGACGTC CTGAATTGTG TACTCCGTCC
CGTGATCCGG TCACCCCGGG TCGCCCTCCG CGAACTACGC TTGGAGGGCA ACGAACTCGA
GGGCCTAGCA CTCCAATTGC TAATCGATAA CCCCATTCCC TCCCTCCGTG CCCTGCACCT
CGAAGAAAAT ATGGACATTG GACGCCAACG CGCTCAACGA CTCCAAGCTC TTTACCAAAC
TATGGGTTGT GTGGTGTACG TGGACGAGAA TTTGGAAATC GACGAAAACG ACGACCATGA
TGAGGAGCAA AAGGATGACG ACGCGGTCGA TGCGTTGGTG GACAGCGTGA AAGGGTTGGC
TGTTTAG
 
Protein sequence
MTVHVTLHTS GPRNVVQVAD AESILQNFQA TLARYTTDDG SALVDKLDLS GRAWPLSSLQ 
VLEAFFEAHV VDTVRVLKID DIIASLPTVD GLASLRWFAR VFQHAPVAVL NLNDNALGTR
GMAEIRPLLS NPHIRHLALD NVGISEAVVA TLATILSHSS GDDPDTPGPL QLQSLSLGRN
QIGIEGARSV GELLALCPHL ESFSYASSRP QLAGTLALVQ GLEKSEVTSL RYLNFEDCVF
RGGDGEDPTQ VLKTVLCRSP KLHTLLLPDC ELGPAGLLLV ILGIRYAKPP LTVLDLSANN
AGPGGTYALG ELLEYTDAPQ TLQSLAFDNN EVETADVLNC VLRPVIRSPR VALRELRLEG
NELEGLALQL LIDNPIPSLR ALHLEENMDI GRQRAQRLQA LYQTMGCVVY VDENLEIDEN
DDHDEEQKDD DAVDALVDSV KGLAV