Gene PHATRDRAFT_44908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44908 
Symbol 
ID7199603 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp609454 
End bp612506 
Gene Length3053 bp 
Protein Length877 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179032 
Protein GI219116474 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGCTTTTCCG GCACTCACAT TCAAAGAATG ATATATGATG CGCGAGTCCG TGCTACTGTT 
GCATTTCCTC TTATTGGGTG CTACCGATCT CTGGACGGCT GAAGCTGCTA TCGCAACTAG
CGACACCGTT ATCTCGTCAC GGCGGATCGA CACGGCGGTT GCCGCCGAAT CGGCATTGAT
TGATAAGGAA ACCGGCCGAA TTTATTGGGA AGGAGGTTCG CAGACGACTT TGTCGGCAAC
ATTGGGGCCA ACGTTAGCTC AGTTTGGAAT CCAACACGTA AAGGCGACTC TATTGTCGTT
ACTGCTGGTT TTTGCCGTCG TGGGCGTTCT CTTAGGTTGG CTGCGACACA AAGTAGAAAC
GTCACTATTG TTCCAATCCG ATCATCGGAG AGTTTACATT TCGATAGTTT ATCATCTGCT
ACAATGGACG ATATTGAGAA CGCCACGGCT GCCCCCAAAA CTGGTTACGG CGATTGTGTT
ACTGTACTTC CTGGAAGCCT TCCAATGCAG CACTCGGACG TACTTGGCTA ACGCAATTTG
CAGTCCCGAA GAAGTTGAGC GCTACATCGA AAATCTAAGG AGTCAGGATC CCAAAATTCA
ATGGACCGTT CGGTCTTTTC ACTATGAGCC CTTCTATACG GCGCTGTTAC GAATATTTCA
GCGGCAACGT AAATCTACGA GCGAAATCGA TTGTGATGCT ACTGAAGGCA CTACTTGTTT
TTCAAAATTT ACAGGCCCAA ATACTGACAC AGATGAAAAT ATTCGGGAAT CAAGGAGAAA
AGGGCCGACT TTGGATTCCA ATCACTGGTG GGTGCGCAAA CTCATTACTC ATAACGCCAC
TGGCACATAT AATTACCAGC AAGTGACAGA TCTCACGACA GCCGGAGTTT GGCGACGTGC
TCCGGCCTCT CCCATTGCGC CATTCTCCAA ATTAATCCTG AGCAAGCATG TTGTCCTATT
GGACGGCAAG ACAAGGGGCG ATTATCTTTC CCAGCAAGCC GACTTTGCTA CAAAGCACGG
CCAAGAGGAC CGGATGGCGG AATACGCCAC CAACCTTGCC GTCGAAGGCT TTCAATCCCG
TGTCCTGGCC GTGAGAGCGA ACAGCAACGA TTTGACAGGG GAAGGTTGCT GGTGGACTAC
CCGCTTTTTT CAGTCTCACA TGTTTTGGCT GGCTACGGCC TTCGGGCTCA CAGTCCCGTA
TCGGTATTGG TTTGCGCGAC ATTGCGACGA AATTCGTATC CGCGTTGTTA AAGAGATATC
AGCCGCTCCA GTTCCAGCAC CATCCTGGTC TTGGTTTGGA CCATCGAAGA ACAACGTGGC
TGATAGCAAG ACTTGTCGAA CGTCAAAAAT GGGCAACGGA GATAGTGACG AAAACTACCG
TTCACTCATG CAGACACTGA GGTTGTACGG CACAACCAGC GTTAAAGACA TGAAGCCGTC
TTCAACAACC AAACTAGTAG AACCAACAAA AATCTCGGGA GAGAATAACA ATGTGACTGT
TGCTGAGCTG CAGCGCGAAG TGGATGATGC CAAAGAAGCT GCTTCGCTGT TTTCCGATTT
GTTGGCGTCT GACACGGAAA CAGCTTCGGA GGCACTACAC GAAGAGGCTG CTCCACCGGG
GGGCACAAAA GACAATTCTG TCTCTGAAAA AGATGCTAGC TCTATCACTT TGGCAGAGGA
CCTGTCGGCT AGTTCTACAA ATACAACAAC AACTAGCAAA AAAGACCAGT AGTATAACTC
ACAGTCTGTT TTGAGCGAGT AGTTTCCACG TGTCACATTG CCATCAAAAA GGGAATCTGG
ATTTGCCCTG CAAAACAATT GAAGATCCGA ACTTGAAGTT CCTTTGGTCA GGCTATGCAA
CCCTTTCCAA CTCATTGGAA CGAACCACAT ATTGTTGGCG ATCACGATGA CGATGATGCG
AGTGGGTTGC CTTTTTCTCT TGGTCGCAGG GTTTGCAGGC GCCTTTACTC CGCTCCAACC
ATTCAACTAT GGATCATCCA CGTCGGTGGT TAAGAGCCGT CGTTCGGCCT TGAGCGCGAT
GCCCGATGGT GGTGTCGTGA TTACTGGTAT GTGCAACTCG AGGGGTGTGT TCCAATTTCA
ATTCTACCGT GCAAGCACGT GTCACACAGT CAGTGCAACT CTGCTGGGGC TTTCTATGGA
TCTGTCTTCA TTTACTTACA CAATGTAATT TAGGCGCCGC GGGCGGGGTC GGCTTTGCGT
ACGCTGGGGA ATTCATGCAG CGAGGCTACG ATGTTGTAAT TTGTGACGTA CGGGATTGTT
CGTCGGCCGC CAAGGCCCTG GAATCACGAC ACCCCGAAGG AGGAAAAATA CATCACGTTA
AATGTGATGT GTCTTCCCAA AAAGACGTCC TAAACTTGGG AAAATTTGCC AAGGAAAAGC
TCGGAACAAT CGGATATTGG ATCAACAATG CGGGCATAAA TGGTGGACGA CGGGATTTAC
GGGAAGTGTC GATGGACCAA GTAGAGATGG TTGTCCGGGT GAACCTAGTC GGTGTGCTAC
TTTGTACTAA AATAGCAATG GAAATTATGG GCGAACAGGA AGAAGTGGTC GGGCACATTT
TCAATACAGT CGGATCAGGG GTCAAAGGTG GCGGTACACC AGGGTACGCC TGTTATGGTG
CCACCAAACG TGGTTTGCCA CAATTGACTG CCACACTCGT CAAAGAACTC GATGAAGGAG
TACAGGGTTA CGAAAAGAAA AAGACCAAGG GAACAATTCA GGTTCACTCG CTATCGCCTG
GTATGGTTTT TACTAAATTA CTGCTGGACG ACTCAACTCC CGAACTACGC AAGTTCCCCT
TTGGAGTTCT GGCCGCCCAA CCCGAAGAAG TGGCAGCAGA TTTAGTACCC AAAATTTTGG
CCCAAAAGAG CAACGGTGGG TCGGTCGAGT TTTTGACGAC TGATCGTATC CTGAATAAGT
TCTTTGAAAG ATTCATTTTA CAAAAGAAGT CCGCTTACAT TGATGATGAC GGTAACGTCA
TCAAAATGCC GGGCGAACAG TACGACGAAA CCGGTGCACG AGCATTATAC TAA
 
Protein sequence
MMRESVLLLH FLLLGATDLW TAEAAIATSD TVISSRRIDT AVAAESALID KETGRIYWEG 
GSQTTLSATL GPTLAQFGIQ HVKATLLSLL LVFAVVGVLL GWLRHKVETS LLFQSDHRRV
YISIVYHLLQ WTILRTPRLP PKLVTAIVLL YFLEAFQCST RTYLANAICS PEEVERYIEN
LRSQDPKIQW TVRSFHYEPF YTALLRIFQR QRKSTSEIDC DATEGTTCFS KFTGPNTDTD
ENIRESRRKG PTLDSNHWWV RKLITHNATG TYNYQQVTDL TTAGVWRRAP ASPIAPFSKL
ILSKHVVLLD GKTRGDYLSQ QADFATKHGQ EDRMAEYATN LAVEGFQSRV LAVRANSNDL
TGEGCWWTTR FFQSHMFWLA TAFGLTVPYR YWFARHCDEI RIRVVKEISA APVPAPSWSW
FGPSKNNVAD SKTCRTSKMG NGDSDENYRS LMQTLRLYGT TSVKDMKPSS TTKLVEPTKI
SGENNNVTVA ELQREVDDAK EAASLFSDLL ASDTETASEA LHEEAAPPGG TKDNSVSEKD
ASSITLAEDL SARFAGAFTP LQPFNYGSST SVVKSRRSAL SAMPDGGVVI TGAAGGVGFA
YAGEFMQRGY DVVICDVRDC SSAAKALESR HPEGGKIHHV KCDVSSQKDV LNLGKFAKEK
LGTIGYWINN AGINGGRRDL REVSMDQVEM VVRVNLVGVL LCTKIAMEIM GEQEEVVGHI
FNTVGSGVKG GGTPGYACYG ATKRGLPQLT ATLVKELDEG VQGYEKKKTK GTIQVHSLSP
GMVFTKLLLD DSTPELRKFP FGVLAAQPEE VAADLVPKIL AQKSNGGSVE FLTTDRILNK
FFERFILQKK SAYIDDDGNV IKMPGEQYDE TGARALY