Gene PHATRDRAFT_47956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47956 
Symbol 
ID7203203 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp549581 
End bp551553 
Gene Length1973 bp 
Protein Length541 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182416 
Protein GI219124239 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGTCTCCATC GCAACGTTCG GTTGTTTCGT GCCGGGCTCG CCAATCCTTT TTTTTCGCGG 
ACGAGCTCTA TCTCACAATA GTTGCTATTG CAAACCTTAA GATGAAGAGT TCTCTCGTCC
TTGTTGCTGT TGTGCTCACC TTTTCGGCCG AGGCCTTTAT ACCACATGTG CGGAGACCAG
CGTTGCTTCG TCCCGTTGCC GTGTCGTCGT CCATCAAGAT TTTTACGGCC AAAAACGCCA
GTGAAATCGC CTTTGAAGAA GTAGAAAGCT ATCGGGATGG CATGAGCATC ACTCGATCCG
GTACCAACGA AAACAAGGTA CGTAGCAAAA GTCTATAATC CGGCAAATAT AAGGCCCCAC
GGCTCCGACT CCCTTGCTCA CCTCGAACCG TACGACTTTC ATGGATTCGT AGGTGATGGA
CGTTGTCATG AAATTTGGTG GTAGTTCCTT GGCAGACAAG GATCGAATCG ACCATGTAGC
AAACTTGATT AAGAATCAGA TTGAAGCGGG GTACCGACCT CGGGCCGTGG TTTGCTCGGC
GATGGGCAAG ACGACCAATT CTCTGCTGAG TGCGGGGGAA TTTGCCTTGG GTACGTCGGC
TTTGCTCTTT TTCACGTTGC GAGCGCATGT GCGGTGATGT TTGGCACTGC ACAATCGCAA
AGTGTCTCAC TTTCAATGGT CGTACTTGTC TCTGTAGAGG GCCGTGTCAA CGTTGATGCG
ATTCGTACCT TGCATCAGTC CACTATGAAT CATTTTGAAT ACTCTCAACA CATCATAGAC
GACGTCAATG CACTCTTGGA CGAATGCCAG GACATGCTGA ATGGTGTGCG GATGATACAG
GAGCTTAGCC CGAAGTCTCT AGATCAGCTT GTCTCCTACG GGGAACGATG CTCAGTTCGT
ATTATGGCGG CCCGTTTGAA CCAGCTTGGT GTACCCGCCC AAGCGTTCGA TGCTTGGGAT
GTCGGTATGA TTACGGACAG CGAATTCGGG GATGCCAAAA TTCTTGCCGA GTCCGAAGAT
GCCATTCGAA ATGCCTTTGA CCGGATCGAC CCGAACATTG TCAGTGTAGT GACTGGCTTT
ATCGGCCACG ACCCCAATAA GCGTATCACG ACACTGGGTC GAGGAGGGTC GGATTTGACG
GCAACGCAAA TCGGCGCTGC TTTGAAACTG GACGAGATTC AGGTCTGGAA AGATGTGGAC
GGTATTTTGA CTAGCGATCC TCGGTTGGTG CCTAATGCTG TCCCGGTGGG CGACGTGAGT
TACGAGGAAG CTAGCGAATT GGCTTACTTT GGCGCGCAAG TGCTGCATCC GATCGCAATG
CAGCCAGCCA TGAAACACAA TGTTCCCGTA CGGGTCAAGA ATTCGTACAA TCCATCAGCC
GTGGGAACAA TTATTCGTAA CAGAAAGGAA ACCGAACGGT TAGTGACCGC CATTACCTAC
AAGCGTGATA TAAAATTGAT GGATATTGAA TCGACACAGA TGTTGGGAGC GTACGGTTTC
TTGGCACGCG TATTTGGAGA ATTCGAGAAG CACAAACTCT CGGTTGACGT GCTCGCTTCG
TCCGAAGTCT CTGTGTCTCT GACCTTGGAC AAGAAACAAA AGGATGCCGA AATTGACGGT
CTCATGCGGG ATTTGGGCAG CTGCGCGAAG GTCACGTGCC ACAAGGACCG ATCCATCCTG
ACACTCATTA CTGACGTTGG TCGCAGTTCG GAAGTACTCG CTACTGTTTT CCGTGTTTTT
TCGACTTGCG GCATTAAAGT TGAAATGATG AGTCAGGGAG CCTCGAAGGT AAACATTTCC
TTCATCGTCA AGGACGAAAG CCTGGAACGA GCTATCCTGG AGCTTCACAA ATGCTTCTTT
GAAGAGACCT GCTCAGTGGA GCCTTTCAAA CCAGAGGCTG GCAGGAATAA GACATTGCTC
GTTGTGTAGA GAAGCGGGTA GATATTATTT TCTGGTATTG CTTTGATTTC GCC
 
Protein sequence
MKSSLVLVAV VLTFSAEAFI PHVRRPALLR PVAVSSSIKI FTAKNASEIA FEEVESYRDG 
MSITRSGTNE NKVMDVVMKF GGSSLADKDR IDHVANLIKN QIEAGYRPRA VVCSAMGKTT
NSLLSAGEFA LEGRVNVDAI RTLHQSTMNH FEYSQHIIDD VNALLDECQD MLNGVRMIQE
LSPKSLDQLV SYGERCSVRI MAARLNQLGV PAQAFDAWDV GMITDSEFGD AKILAESEDA
IRNAFDRIDP NIVSVVTGFI GHDPNKRITT LGRGGSDLTA TQIGAALKLD EIQVWKDVDG
ILTSDPRLVP NAVPVGDVSY EEASELAYFG AQVLHPIAMQ PAMKHNVPVR VKNSYNPSAV
GTIIRNRKET ERLVTAITYK RDIKLMDIES TQMLGAYGFL ARVFGEFEKH KLSVDVLASS
EVSVSLTLDK KQKDAEIDGL MRDLGSCAKV TCHKDRSILT LITDVGRSSE VLATVFRVFS
TCGIKVEMMS QGASKVNISF IVKDESLERA ILELHKCFFE ETCSVEPFKP EAGRNKTLLV
V