Gene PHATRDRAFT_44959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44959 
Symbol 
ID7199486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp796899 
End bp798629 
Gene Length1731 bp 
Protein Length500 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178845 
Protein GI219116100 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00935633 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGTAGTCGAC CACAACCGAG AATCAGCACA ATCCTCCATT CCAAAGAAGC AAACGGACCG 
ACACAACTCG AGTCGTTCCG TCCCATCACT GCGGTCGCCT ACCTCTAGTA ATTGATTGAC
TATGAGGACA ATTCCTTTTC AGTTCTGGAC GCGAATGCTG CTGGCGACGG CGTTGTGCCA
GCATCCCTCC GTGTCGTCGC TGGTGCACGC GGCAGCGGAC GACACTGGCA CGGTGGGTCT
ATCAGCGGGT AAACTGCGAT CCAACGCGGA AGAAGCCATG GCGGTGGGGG ACTACACCAC
CGCTGTGCAG TATTTGCAGG AGGCGATCAC GTTGGAACCG GAAAGTGCGG TGAATCACTA
CAAACTGTAC CGGATTCGAC ACCGAAAACG TCACTACCTC GAAGCCCTCA GGGATATTTC
CCAAGCCGTT GAGTTGGAAT CGTCGGCTTC TTATCGCAAG CTCAAGGCCA AACTCTTGGT
AACGTTGGGA CAATGTGATC GGGCCGTGGC GGAACTGGAT TTGTTGGCAC CCAACGATCA
GGACAATGCT CAGTATGAAA CAGCCAAGAT GTGCCACGAA ACAATACAAC TGGCGGAGTA
CCATTTTCTC AATCAAGAGT ACGAGCTTGC TGCAGAATAT TTTCAGCAGG CCATGTCGTT
TGTTGAGATT GCATCAGATC TTGTTTGGCC CAAAGCCAAG TCCCTGTTCG AAACGGGGGA
CTACTACGGC GTTATCTCCG ATACCGGTAT GTTGTTGAAA CAGCACCCGC ATCACGTCGA
AGCGTACTGT TTGAGAGGGT CTGCATATCA TCGCCTGGGC GAACACGATC AAGCGGTGCT
ACATTTTCGG GAAGGACTCA AGTTGGATCC CGAGCAGGCG GACTGTAAAA AGGGACACAA
GAGTGTCAAA GCGCTCGAAA AGAAGAAAGC GAAGGGCGAC GAAGCCTACG CTGCCGGTGA
TTTTGAAAGC GCATCGGGGC ACTACGAAAG GGCTATGATG CTGGATCCGA CTCACCATGC
CTTCAATCGT CCCGTCCAAC TCCAACTCGT ACAAACATAT TCCAAACTAG GCCAACACAA
AAAGGCCATG GACACAGCAC AGAAGTATGT GGAAGAGCTA GAGTCACTAG AGGGACTCTG
GGCTCTGGCC AACGCCCAAC AAGCTGCAGA CAGCTACGAA GATGCCGTGC GTACATTTCA
GAGGGCAGTC GAGGTTGCCC CAGATGGTAG CGAGCAGGAA CGGGAAGCGA ATCAAAAATT
GAAGAACGCT CAAGTTGCGT TGAAGCAAAG TAAAGAGAAG AACTACTATA AAATATTGGG
TGTGTCCCGA TCAGCCACAG CAAAGGAAAT TAAATCAGCC TATCGCAAAC TCGCACTCAA
GTACCACCCG GATAAAGTTT CGGATGAAGA AAAGGAAGGT GCCGATTCCA AGTTTGCCGA
CATCGGCGAG GCCTACGAAG TCTTGTCGGA TCAAGAATTA CGCACCAAGT ATGATCGGGG
CGAGCAGGTT TTTGAAAATC AAGGAGGCGG TCCGCGGCAT CAAAATCCGT TTCAGTTCTA
TCAACAGCAG TTCCAACAAG GTGGCGGTGG TGGTGGACCA CGAGTGCACT ACCGCTTCAA
CTAGATGGCC ATTTCTCGCA AAAATAGCGA GCGATGACAG TAGCTAATCC TCAAATCGAA
CCACTATAGC TTAATCCAGA GCTCCTCTTG TAGGCGCTTC TTGTTAGACA G
 
Protein sequence
MRTIPFQFWT RMLLATALCQ HPSVSSLVHA AADDTGTVGL SAGKLRSNAE EAMAVGDYTT 
AVQYLQEAIT LEPESAVNHY KLYRIRHRKR HYLEALRDIS QAVELESSAS YRKLKAKLLV
TLGQCDRAVA ELDLLAPNDQ DNAQYETAKM CHETIQLAEY HFLNQEYELA AEYFQQAMSF
VEIASDLVWP KAKSLFETGD YYGVISDTGM LLKQHPHHVE AYCLRGSAYH RLGEHDQAVL
HFREGLKLDP EQADCKKGHK SVKALEKKKA KGDEAYAAGD FESASGHYER AMMLDPTHHA
FNRPVQLQLV QTYSKLGQHK KAMDTAQKYV EELESLEGLW ALANAQQAAD SYEDAVRTFQ
RAVEVAPDGS EQEREANQKL KNAQVALKQS KEKNYYKILG VSRSATAKEI KSAYRKLALK
YHPDKVSDEE KEGADSKFAD IGEAYEVLSD QELRTKYDRG EQVFENQGGG PRHQNPFQFY
QQQFQQGGGG GGPRVHYRFN