Gene PHATRDRAFT_49046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49046 
Symbol 
ID7195297 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp424921 
End bp426461 
Gene Length1541 bp 
Protein Length454 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183606 
Protein GI219126736 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.208062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACGACCGAAC GTGTGTGAAC GGAATGTAAC TGCTAGTACA TGCCTCCACT CTTTCTCGCC 
AGGTGTTGTA GCTAGGAGTA GAAGATAGCA AATGACAGCT TTGCATGGCT GGAAAACTAG
CTGGCGGGGG GTAATACTTC ACGTAGCTCT GCTGTTGGAA AGCAACGCCT GGTTGGCAAC
GTACTCGACC CGTTGTCAAA ACCGAGCCAT TGCCTGCTTC AACCGCGACT GTAAGTCTCA
ATCGCGTATT TGCACTCAAT CTGTTGGCTC CTGGAGTATT TTAGGGGAAA AGTCAACCTC
CATAGACCAA GTTTCTTGGG TAACGAACGA GCACGTCTCT AGCAGCGGAA CTGCTCCTGG
TACTCCTCTC GAATGGTACA ATGATCTTAT TGATCGCCGG TTGGAAGAGG CTGATGAATT
TCCGATTGAG CCGCAAGAGT CGCTCGAAGA TGGAGAAAAT GTTCCTTTAA TTGGATTTCT
TGATCTCTTA CACATGGCCT TTGCGGCGGC GACATCGGCC ACAATGCGGG CTAAACGCCG
AGATTTTGTT AGCACGGTAC AGAACGCGGG TGGATACTGC ATTGTTAAAT TGGAGGATCC
GGAACTGTCA GTTGTCGAAG GAATGTGGGA TGGCATTGAC GAAATTTTTG CGAGGCCACG
GCACAAAGAT ACGGCAGTGA CAGGAGCTAC GCAACTCGAA TTACGCCATC AAACACTTAC
TCGCGAGGAC ACGACTGAAT TGCACCAAAA CAGTGGGTAC AAATTCGTTC AAATTTCCCT
GATAGACAAC AGTATTCCCT ACTTGGCGGA CAGCGTTGGA AAGCAATCCG CCGAGCAAGC
TGGACGGGTA TATCAGCTTT TTTCGCTGCT GGCCAAGGCA TTTGCGTCGG TATCCTACGC
TGGATCATCC ACGGAATCAG AGCACGTTGC AAATAAGGGC GATCCCAAGC AGGCGTCCAA
TTTGCTTACA AAAATGCTAG ACGACCCAGG CAAGCCTTTC AGCGGGACTT TCCATCGTTT
GGCTAAGTAC GTACCGGTCC TGGAAGAGGA AGAATGGAGC GAATCCCTCC GATCCCATTG
CGATTGGACG CTCGCGACTC CCATTCCAGT GTCGGCGACG GCTGGACTGG AAATTTTCAA
TCCGACTAGT CAAACTTGGA TTCGACCCGA GCAAACAGCT AAATCTCTAT GGGAGCACGA
AAACGGAGAG CATAGACCCA CAGACGATAA ACGATGGCAC AGTCGCTACG TGATTGTAAT
GACCGGGAAG TGGCTTGAGT TGATCAGCAA GGGAGAGATC TCGTCTTGTA TTCACAGAGT
AGTGTCGGTA CGCGGAGAAA ATTCTCGTTT GAGTGCTCCT TTTTTCATGA GACCAAGGCC
ACAAGTTTTT TCGGATGCCG AGGCCGTTCA ATTCAAAAAT AGGACATCTG GTAATATCGA
GTCGATGCAA GCCATTGGTG AGTATTTGTT GCAAAAGTAC GGCACTGACA GTGAGTATAT
TGAAAGATGA AAGCAAGTAT TCAAACGAAG GTTTTTCTAC G
 
Protein sequence
MTALHGWKTS WRGVILHVAL LLESNAWLAT YSTRCQNRAI ACFNRDWEKS TSIDQVSWVT 
NEHVSSSGTA PGTPLEWYND LIDRRLEEAD EFPIEPQESL EDGENVPLIG FLDLLHMAFA
AATSATMRAK RRDFVSTVQN AGGYCIVKLE DPELSVVEGM WDGIDEIFAR PRHKDTAVTG
ATQLELRHQT LTREDTTELH QNSGYKFVQI SLIDNSIPYL ADSVGKQSAE QAGRVYQLFS
LLAKAFASVS YAGSSTESEH VANKGDPKQA SNLLTKMLDD PGKPFSGTFH RLAKYVPVLE
EEEWSESLRS HCDWTLATPI PVSATAGLEI FNPTSQTWIR PEQTAKSLWE HENGEHRPTD
DKRWHSRYVI VMTGKWLELI SKGEISSCIH RVVSVRGENS RLSAPFFMRP RPQVFSDAEA
VQFKNRTSGN IESMQAIGEY LLQKYGTDSE YIER