Gene PHATRDRAFT_47544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47544 
Symbol 
ID7202781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp70783 
End bp73859 
Gene Length3077 bp 
Protein Length935 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181993 
Protein GI219123358 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCACCTTTCC CTTTCCCATC GCGTCCCCCA TTGTCATTCG ATATAGACTC GTGGCCGAAA 
CGAGCGAGAA TCATCCTAGT CATAAGAGAG AGCGCGAGCA ATTGCGGTAA GAAGCTATTG
TAGAGGGGGA AGTGAACCCA ACCTTTGTTT CGATTCACAC GTCCAAGCCG AGCAAAAAGC
TAGACCTAGG CTTACGGCAA TTATCTTTTC CCGACTTTTT TTCAAAAGGA CCACTATCAT
GCGTATGCAA GTGCTTTTGC TCGTGAGCGC CATTAGTGGA ACAGCGGCTA GGCTCGGTGG
AATCGCACCG ACGTCTATAC AGGAGAGTGC GACCGCTTCG GTTTCGACGG CCAGAACATC
CGGCATTCGT TCGCACTATC CCATCAAGCA ACGTCGTGAG CTCGAAACGG AGCAGCTTTC
GTGGGCCGAC TTCGAGGAGT GGCGCATCAG CTACTCAGAA GATAGTCGAC CCAAAGTCCT
GCCTGCTTTT TACGTCGATT TGTTCGAGAC TAGTGGTGAC TTGTCGGCGG ATTCTCTTGC
CCTCTACGAG ACCGCTATGG AGCAATTTCT GACCGCCGAG CTGGGTGCCG TCTACAATGA
ACGCCCTCGC GTTGCAAGTG TTCGTGCCAA AGTGCTTTCC CAACGCGTCC TTTCACAGTC
CGTCACGCGC CAACGCCGTC TTGACGACAC TCTTGGTGAT GGTGGTACCA CCGGAACAGA
CGGTGTCATG ATAGGGACGC AGTTCGAAAC GCAAGTTAAC GTGACTTTTG TCAATGATCC
ATCACCCAGT GTGTACGCCA TGCGTACCCT CATAAGTAGT ATCATGGATC GCGACATGTC
AGGCTTTGTT GGAAATCTAT CGGCTGCCGC GGGCGACGAT CCCGCTCTTA CCCCTACTGA
GGCTAGGTAC CGTGGCGCAG AGGGGGATGG TGTCAGCATC ACAACCGACG ATCTCGTTGG
AGGATCTGAC CGGGATGACG ATATCACCGA TGACGGTGTT GCTGGGATCA TTGATGGTGG
TAACAGGCCG GTCGAGGAGG ATGATCCGAA TCTCACTTTT ATTGTTCCGA TTCTCGCTGC
CGCTTGCATC CTCATTGCGC TCTTGGCGCT ACTGGTGACC CGTCGTAGAA AGCAAAACAG
CAGCACGCAT CTAGATGCCG TTGATGATGA CATGGATATT TACACCGTGG ATGGGAGGGA
TGTTTTCAGC GAGGAAGACG TTGTCAGTCC CAAAGAACAC AATCGTGAAA CCGAAACAAA
GGTTGCACCT TCGTCTCCCG TGCTAGATTC TACACAAGGC GACAAGGGGG CTCTGGAAGA
CGGCGACGAC GTCTTTGCTG ATTTGCCAGA CTCTCCCCAT CGCCAAACCG GGTTCGGATC
CGTCTTTTCT TTCTGGTCAA ATTTTTCAAG GGCCTCTACT ATATTGGCTT CGAATCGTAA
CAAGCCCGCC AAAGATTCGG AAGGGTCCTC TAAGGAGGCT GCCGCGATTG GTACTACGGC
AGGTGTTGGA GCCATTGTCG TTGCAAGATC CAGCCGTCGG CAAAGATCGC CGGAGAACAC
TCCAGGCTCG CACATGTCTT CGCTCTATAC ATCTGATGAG GAGGACGGAG TTCCAGACAC
GTCTGGGAGC GATATAAATT CGACGTTTGA AACGAATGCC ACTACCGACT CCGTTCTTAT
GTCCCCCCAA TTTGAGAAAA AGCGTTCTTT CGAAGAGCCC TCCTTCCACA AATCCCAGAT
TCAGGCACCA ACAGCTATAG AAACGACCAA GGAAGTTGAT TCCACTCAGT TTGTATCCCA
AGTTCTAAAG CAGACGCAAG ATGTCTCTGT CGCGCCTCAA GACGAGGACT CGGCGAAGGC
GTCTTCGCTT GCTTTGGGCC TTGTCGGCGC GACGGCTCTT TCCGAGTCCC CAGATGTTGA
GGTCCTTTCA CCTGAAGCTG CTGCAAACGA TCCAGCCGAC GAGAAAAGCA ATCAGGTCCC
ACTCGGAGGG GCGGCGGTCG TCCTCTCTGC TGCCGATCAC GAGTCCGTCA AGTCAAGACC
TCAAGAGGAT CAACCAGGAA GTCCTAGAGC AAACAAACGG AACTCAGTGA AATTGGAGTG
GGACAGTCCA GAAAGAAGTA GCAAAAACAT GGCGATAGCT AAATCCAATG TCGAAAAATT
CTCTCCGAAA GCAGCTAGAA GCTCTGGCGG TCCTTCACTC ACCTCGACTC CCCGACGAAG
TGGGCGGCGG CATGCTAAAT CCACAACGGG AGATGGCACT ATGGATTATC AGGCACAGAC
GATGAATGCT GGAGAAGCCC AGACTTCTTT TGACGTTTCA CTTAGCGATA CAGAAACACA
TGGTCCTTCC GAAAGCCACG GAATCACTCG CAGCCTGTTT TTCAATACCT CCAAGATTAA
GAGTCCGAAA TCACCCAGCG AGGCACATAA CGTTGCGAGC CCATCTTCAC CAGCTTCTAT
GAGCTCCTTG GGATCCCGTA ACAGCGTCTC CGTGAAATCC GGGGCATCGG AGCAGTCCGC
TAGTCGCAAA GTGATAGCGG ATTTGGTCTG GCTGGAAAAG AAGATTGCGG ATGCTAGTCG
TAGAATTGCT ACATCACCAC AGTCAACGAA CGCCCCTGGC AAACCTAAAA CGTCTGCCAT
AGACCAGTCT AGCGATTCAC TTTCCTTTGC ATCTAAGGAA GGAGTAGTTT CTGCCTCCAC
TTCATGCGAT TCAACCTTTG AAGTTGGTTC TCCCCAGAGC GGTTCCGGGA TGCCTCAAGC
CGAGGAGCTC ATTGTATGCC GCGACTGCTT TGCACCTCCG GGTAAACTTA AGATAATCAT
CCATTCCACC AAAGATGGAC CGGCAGTTCA CACGGTTAAA GACGGCAGCA GTCTCACGGG
TCACGTGTTC GCCGGAGACC TGATCATCAG CGTCGACGAT ATCGACACGA GGTCATTTAC
GGCGGAGCAA GTTATGAAGA TGATGACAAG CAGGACCAAA TTTGAACGAA AGATTACCGT
ATTGCATTTC GAAGCTGCCG TGGCGAAGCA GGAAGTAACG CTGTGAAGAA ATTTATTAGG
GACAGGTTAT ATTTCGT
 
Protein sequence
MRMQVLLLVS AISGTAARLG GIAPTSIQES ATASVSTART SGIRSHYPIK QRRELETEQL 
SWADFEEWRI SYSEDSRPKV LPAFYVDLFE TSGDLSADSL ALYETAMEQF LTAELGAVYN
ERPRVASVRA KVLSQRVLSQ SVTRQRRLDD TLGDGGTTGT DGVMIGTQFE TQVNVTFVND
PSPSVYAMRT LISSIMDRDM SGFVGNLSAA AGDDPALTPT EARYRGAEGD GVSITTDDLV
GGSDRDDDIT DDGVAGIIDG GNRPVEEDDP NLTFIVPILA AACILIALLA LLVTRRRKQN
SSTHLDAVDD DMDIYTVDGR DVFSEEDVVS PKEHNRETET KVAPSSPVLD STQGDKGALE
DGDDVFADLP DSPHRQTGFG SVFSFWSNFS RASTILASNR NKPAKDSEGS SKEAAAIGTT
AGVGAIVVAR SSRRQRSPEN TPGSHMSSLY TSDEEDGVPD TSGSDINSTF ETNATTDSVL
MSPQFEKKRS FEEPSFHKSQ IQAPTAIETT KEVDSTQFVS QVLKQTQDVS VAPQDEDSAK
ASSLALGLVG ATALSESPDV EVLSPEAAAN DPADEKSNQV PLGGAAVVLS AADHESVKSR
PQEDQPGSPR ANKRNSVKLE WDSPERSSKN MAIAKSNVEK FSPKAARSSG GPSLTSTPRR
SGRRHAKSTT GDGTMDYQAQ TMNAGEAQTS FDVSLSDTET HGPSESHGIT RSLFFNTSKI
KSPKSPSEAH NVASPSSPAS MSSLGSRNSV SVKSGASEQS ASRKVIADLV WLEKKIADAS
RRIATSPQST NAPGKPKTSA IDQSSDSLSF ASKEGVVSAS TSCDSTFEVG SPQSGSGMPQ
AEELIVCRDC FAPPGKLKII IHSTKDGPAV HTVKDGSSLT GHVFAGDLII SVDDIDTRSF
TAEQVMKMMT SRTKFERKIT VLHFEAAVAK QEVTL