Gene PHATR_44066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44066 
Symbol 
ID7204015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp867046 
End bp869032 
Gene Length1987 bp 
Protein Length629 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186144 
Protein GI219113121 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGGTCACCGA ATTCTTTCTT GCCTTCCTTG ACGTAAACAA AATTTGCTAG TACTGGGTCG 
CTGTGAATTT CGTCGAACTA ACAACCACTC CCATCCGCCA TGCCGACGTG GGTCGGAATC
GAAGACGGAT CAGCCCAAGC CTCGAAGGTG TGGCTTGCCC GCCAAGGTGA TCATCAACAG
CAGCAGGAGG ATTGGTGGAA GCCCTTACGC AAATTGGATT GCGAGGCTTT GAACGTTTCG
CCCGAGAAGC CTATTTATAT AGAAGAAGGT CGATGCACCG CCGACCCGGT TAGCGGTATC
GTCTTTTACA ACTTTTTCCG GGGCCACCAG AGACAATTGT GTTCGGCTAT TTGGTTCCTT
CGGGAAGAAA CCTCCAACAA GGACTTTCGG CTTGTGCCTA TCACCAATGA TACGGACTCA
GCAGCCATTG AAGCACTCTA TCGAGCTGCC GTTCAAGCGA CGTCTTCATT AGGAAAAGGT
ATCGCGAGTG TTCTTGATCA GAGAGTCGAC TTGGAACAAG GAAACGCCAA GGTTAAAGTT
GTGCAAACCG GAAATACTCT CACTATGAAG CAGTTCCCGA GCAACAGCTG GTTCATTGGT
GGTGGCCAAG ACCTTCAGCG AGGATACGGT CCTTATGTAG TTGAAGGGGA AGAAGACGAG
ACAGTATTGG GACCTGTCCG TCATCCAGTT TTTGTCGTGC ACGGGATTGG AGAGGCCTTC
TTTGCGCGGG ACGATGTAAA GATACCCTCC TTGATTAACC AGATGAACGC GACACGTATC
CACGTGCAAC AAAAACAAGT CTTGTTGTGG AAAATCGCGT GCCAAAAGGC CAAAAAGACT
GGCCAAGCGC TACCACATCC TCCAAATCGG ATTGAGTTTA TACCCATTGA ATGGTTCAAT
CGCCTGCACG ACAGCAGTAC AGCACTAATG AAGTCGCTTA AAGCCACAAC ACTGCAATCT
ATTCCTGCGT TACGAGCTAT TGCGAACGAT GTAATTTTTG ACGTGCTTAT GTACCTGACA
CCCAACTTTT GTGAATCTGT TTTGGAATGC GTCACGACGC AAGTCAACGA ACTGTACGGG
GCCTTTGCCA AGGTGCATCC TGGTTTTTTG CCACACGGTG GCAAATGCTC CTTTATTGGG
CACTCCCTGG GGTCAGTGAT CGTTTGGGAC TTACTGTCGA TACTAAAGGA TCGCGACGAA
GCCAAAGCCG GAACAACGGC TTCGATGAAT ACACAAGGTG TAGCAATCGC ATCCCCGGAA
AAAGGAGGGT CTGACTTGGG ATATCAAGCG TATGCCAAAG AGACGGGGGC GAACCAAGCC
TGTAACGGTA CTTGGGGTCC ATCATTAACC AAACCTATGA CCCGCACGAT TCCTTTTATA
CCCGAGTGTA CCGTGTTCCT GGGGTCACCA CTGGGGATGT TCTTGACATT GCGAGGTGCT
CACGCTGTGT TCGACGAGTT ACGCGACGTC GCCATCAAAC AGGCCACTCA CCGGGTAGGG
GAAGGAAAAA GTGGTAACGA CGAGCACGTA GATGTCCCGG TCACATCTCC GTTCTCACTA
CCGACTGGAC GCTTGTACAA TATTTTCAAT CCAAGCGATC CGGTGGCCTA TCGGATCGAA
CCCTTGTTGC TACCGCAAGA TCTGGACTCG GCGGAGTTGC CGGCGCCGCG TTACTTGACA
GCGCCTGGAA AGGACTTGAA GTTCCACGTC AAGGCTAAAC AGATTACTGA TGATCTCCGC
AAATCAATTA TGGATCAAAA GATGGTGTGG GGCTCATTGA TTGAATCCGC GGTTTCCGCT
TTGTCCACCG ACGTGGCAAC GACCAAAGCA GCTGGGTTAG CCAAACCTCA TATCCACGGC
GGAGCTTTGA ACTTTCCTTT AGGCGGCAAA AGTGATCGGG TTGATTATTC CTTTCAACCG
GCCGTAATTG ACAACGAATA CATTAGCTCC GTTTTGGCTC ATTCGACGGC CACATATTTC
GGGAACA
 
Protein sequence
MPTWVGIEDG SAQASKVWLA RQGDHQQQQE DWWKPLRKLD CEALNVSPEK PIYIEEGRCT 
ADPVSGIVFY NFFRGHQRQL CSAIWFLREE TSNKDFRLVP ITNDTDSAAI EALYRAAVQA
TSSLGKGIAS VLDQRVDLEQ GNAKVKVVQT GNTLTMKQFP SNSWFIGGGQ DLQRGYGPYV
VEGEEDETVL GPVRHPVFVV HGIGEAFFAR DDVKIPSLIN QMNATRIHVQ QKQVLLWKIA
CQKAKKTGQA LPHPPNRIEF IPIEWFNRLH DSSTALMKSL KATTLQSIPA LRAIANDVIF
DVLMYLTPNF CESVLECVTT QVNELYGAFA KVHPGFLPHG GKCSFIGHSL GSVIVWDLLS
ILKDRDEAKA GTTASMNTQG VAIASPEKGG SDLGYQAYAK ETGANQACNG TWGPSLTKPM
TRTIPFIPEC TVFLGSPLGM FLTLRGAHAV FDELRDVAIK QATHRVGEGK SGNDEHVDVP
VTSPFSLPTG RLYNIFNPSD PVAYRIEPLL LPQDLDSAEL PAPRYLTAPG KDLKFHVKAK
QITDDLRKSI MDQKMVWGSL IESAVSALST DVATTKAAGL AKPHIHGGAL NFPLGGKSDR
VDYSFQPAVI DNEYISSVLA HSTATYFGN