Gene PHATRDRAFT_43112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43112 
Symbol 
ID7196886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2065496 
End bp2067756 
Gene Length2261 bp 
Protein Length708 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176900 
Protein GI219110297 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AACGATAAGG CTATGAAAAG AATCGAGAAA CCGATATGAA ATATATTAGT AGGACCAGGT 
CAAGCGAATG AAGCTCCTTC GAAAAGATCG AGGCCGTGAA AGGCACCCGC TTTAAATTCG
GTTTGGGTTG TTCAATGATG TGCTCGTTTG GCATTGCTTC AGCGTCGATA CCGAAGCCGG
CCTTGTCGCC TTGGACGACT ACCTCCAAGA AGTTCTTGAC ACAACCTGCA CCATGGAACA
GTGGGTGTGT CTCGACGAAA TTCCCAAGCA AATTACACCG ACCCTTTATT CAATTTCAAA
GATATCATAT AGCTGCATGC TACAAGCAAC ATATTTGTCG TCAGCGTTTG ACCTGCTTCC
AACGCTCATT GATTTTTACA AACTGCGGGG GCTTTAAATA TTGGCATCTA CCTGGTTTCC
TCGACGATAC CTCAATCGAA GAAAAGGAGG ATTGGCTGCA AAACCTACTT CAAGATAAAG
AGGATCTGTC TGATACCGAG GCTTACCTAG TTGTTTTACG GTCGTTGGCA ACGTCCACCC
AACCAGATGC ACCAATGAAA GCTGAGCGAT GGCTACGACG TTTGGAAGCT CGTTCAGTAT
CGAATCCGGA TGCACCTCAG CCCACCACGG AGTGCTATCA AAGAGTAATC GAAGCTTGGG
GTCAAGCCAC TAACGAAGAC CCAAACCTTT TGATTACCCG CACCCAGCGA TGGCTTATGA
AGCACCTGCG AAATGACAAC ATTGATTTGC GACCCGACAC TGCCTGCTTC AACTCTTTCC
TGGACTTATG TTCGAAAGGT CGCGCTTTGA AACGGGCAAA GGCAACAGAC GGAAACCTGG
TGAGGGATCA CGCTTTAAAA GCAGAACAAA CGCTGCGTTT GATGATTTTC AAGAGGAGGA
AAGAAGGGGA AGATTCTTCC ATGGCCCCAA ATGTTGATTC GTTCAATTTT GCTATTCGAG
CATGGACGCG TTGTCGTAGA AGTCCTGACA TCGCGGACCG ATCAATCTCA GTATTGCATT
TGCTGGAAAA TTATGAAAAA ACGTTGGACT CGTCGGTACG TCCCAATGTC AAATCGTATG
CCATGGTGAT GGATTCCATT GCTGTGGTCG CTAGACTTAA AGTGAAACGA TGCCAAAGTA
TGCCGAAAAC CGTGGAAAAC CCATCGACTA ACGGTTTGAA CGAGATCAAT TTGCTTCAAG
AAGTAGTCTC ATATATGAGA AATCAAGCCA GCCTTGGAAA ACATCACTTG GCGCCGAACG
GAGTCATTTT CAACACTCTT ATTTCTTGTT GGAGTTCTTT GGCTAAAATT CATTCTCATG
CCCCAAACGA AAGCGAGAAA ATACTGCAAA GCATGATACG CATGAAAGAC ATGGGGGAGA
ATCACACGGC TCCCGATGCT ACATCTTATC TGATGGTGAT GCGGACATGG CTTAATTCAC
AACAGAGTAT TCGTGCCGAA CGCATATCAT GGTGGTTGTC AAAGCAATGG AAGGATTATG
ACTTCGAGGG CGACGAGGGG CTTCGGCCAA ACACTACTAC ATACAACCTT GTCATGCGCG
CCTGGGCGGA AAAGGGAGAG CCAAAGCGTA CGGAAGCGCT CCTCGCTGAG CTCATTGGTC
ATTCAGAAAA AGACCGAGCT GGCAACCTGT TCCCTACATC CGAATCCTAC ACGCTGGTCA
TTCGTGCGTG GCTCGTTTTG GCGAATAGGG GTGATAAATC AGGCTTTGAA ACAGCTGCTT
ATTGGTTTTA TTGCTTGGAA GCACGCGAGA GAGACGAGAG CGGATTGGTG GCTCCTAGCG
AATTTTATAC TTTGTTATTG GCTGCCGGTC GAAAGTGTGC CTCTCAGCAC CCTGACATTC
TCGAAACTGC TGTAAAGATC TTTGATCTGT TACGAGAATC TCACCATCGT GTCGACTGTT
TACACTACTC GAGTTTGCTA CAGATAGGAC TACTAGCCCT TTCGCGAGCA GAACAAAACA
AAGTACGACA GGCGTTTATT GATGAAATTT TCAAAAATTG CTGTGAGGAC GGTCTCGTCA
GTAGCCATTT TCTACAGGCT CTCGCGAACG GCCCCGTCTA CTACGATGGT TGGACGGTTG
AGGAAAGCCA GCGCACTCTA AAGCGTATCA TTCCCTGTTG GCCTCTTCCA TATACATGGA
CGAGAAATAT TAGACAAAAA GGCTTCTTCC CGCAGCGACA AGGATTGAGA AGAAGTAACT
TTGTTTGCTC ACCGCACGGA AAGGACCCAT ACAAGACCTA A
 
Protein sequence
MMCSFGIASA SIPKPALSPW TTTSKKFLTQ PAPWNSGCVS TKFPSKLHRP FIQFQRYHIA 
ACYKQHICRQ RLTCFQRSLI FTNCGGFKYW HLPGFLDDTS IEEKEDWLQN LLQDKEDLSD
TEAYLVVLRS LATSTQPDAP MKAERWLRRL EARSVSNPDA PQPTTECYQR VIEAWGQATN
EDPNLLITRT QRWLMKHLRN DNIDLRPDTA CFNSFLDLCS KGRALKRAKA TDGNLVRDHA
LKAEQTLRLM IFKRRKEGED SSMAPNVDSF NFAIRAWTRC RRSPDIADRS ISVLHLLENY
EKTLDSSVRP NVKSYAMVMD SIAVVARLKV KRCQSMPKTV ENPSTNGLNE INLLQEVVSY
MRNQASLGKH HLAPNGVIFN TLISCWSSLA KIHSHAPNES EKILQSMIRM KDMGENHTAP
DATSYLMVMR TWLNSQQSIR AERISWWLSK QWKDYDFEGD EGLRPNTTTY NLVMRAWAEK
GEPKRTEALL AELIGHSEKD RAGNLFPTSE SYTLVIRAWL VLANRGDKSG FETAAYWFYC
LEARERDESG LVAPSEFYTL LLAAGRKCAS QHPDILETAV KIFDLLRESH HRVDCLHYSS
LLQIGLLALS RAEQNKVRQA FIDEIFKNCC EDGLVSSHFL QALANGPVYY DGWTVEESQR
TLKRIIPCWP LPYTWTRNIR QKGFFPQRQG LRRSNFVCSP HGKDPYKT