Gene PHATRDRAFT_44920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44920 
Symbol 
ID7199826 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp639253 
End bp640783 
Gene Length1531 bp 
Protein Length417 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178819 
Protein GI219116048 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCTTTGTCT TTGTGAATCT ATTTGACAAA GGCCAAAATA TCCTCATTTA CATTGTTATT 
ATTTTAGCAG CAATTACGAA CCTTTATTCT AACAACCTTA CATGCTCGTT AGGATGCGAT
TGCGACCCCA TTTTTGTGTC GTTTTTCTCT CGCTGCTTTG CTGTGGAGTC GATGCAACGA
GCAGTGCTGC GCTTCGCCGG CATTTAGAGT TTGTAAACCG GTCTGGTGAA CGCATTTCGG
TCGACTGGCT GAACCCTTTG ACCGGTCAAC CGGTTCTTTT GGGAGCACCT GTTAACGGCG
AGACGATTCC TTTAGATTCG TTTGTCAACC ATACTTTTGC CATTCGACAA CACCAAGACG
GACAGAAAGT CAAGAGATTT GAGACGCCTT CTCTATCTAC GACTCATACA AGAACTCTAG
AAGCACCGAC AACGTACGTT ACTGTCAGTG AAGAGCCTCA TGATCAAAGG TTTGTAATTC
ATAAAGGATT GACGGTTGAA GAATCAAAAA CAAATGAAGA GATCTTTCCC TTGAAATCCG
ATGAGATATC TTTTGATGTG AGTTTGTGAG AGAGTGCCAT CAAAAAGCAA AAAGTCTACT
CCAAAACAGC AAATCCTCTC TGCGAGCCAT TGAGTCATTG CAAACGTGTT TGGAATACTA
TACAGCTTTG ATATTGGAAG ATAAAAATGA AGAACTGGCA TTTCAGGCAC AAGTCCGTGA
ACAAATTTCG GCTTTGGCTG AGAACCACAC ATGCGCCGAT CCGATGCGCA CAACAACAGA
ACCCATAGAA ATGCGCTCAT GGAGATATCT TGATGAAGAT CCACGAACTG TTCAGGTGCT
GCACAACCGC CCAAGCAGTC AGATTCACGT TCTGGAAGGA TTTATTTCTC CCGAAGAGTG
TCAGGCCATC AAGGATGCAG CAGCGCCAAA GCTACATCGC GGAACCGTCG CCGATGGCAA
AGGAGGCTCC AAACTCAGCG AAAGTCGTAA AGCTTGGCAA GCTGGCGTAG GGGTCGATTA
TTCCCATAAA AATGCTATCT CCGATCTCAA AAAGCGTCTC TTTGCGTACA CCAACGAAGT
CACTGGGTTC AATATGAATC TAGATGGACA GGAGGATATC ATGAGCATTC AATATTTCGG
TGACGGTGTC GGAAATCCGA CACCGGATAG GTACACGCCA CATTGTGACG GTGAATGCAA
CGGTATGCCG CACAAGCGAG GAGGCCGAGT CGCAACTATG GTCATGTACT GTGATATTCC
TGAAATCGGC GGTGGCACCA ACTTCCAGCA TTCAAACGTG TTCGTCGCTC CAACCATTGG
TGCTGCTGCA TTTTTTTCAT ACATGAACAA TGATACTGGT CTTCACGAAA CAGGCTTTAC
TACGCACTCA GGTTGCCCCG TTTTGGAAGG AACGAAACGT ATTGCTGTAC AGTGGATGCG
AGTGGGGGTG GATGAGGATA GTCCATGGGA TTCGTTTGAC ACTAATACTG TTCAGAAGGG
ATCTTTCATA GAGGATAGTA GAGTTGAATA G
 
Protein sequence
MLVRMRLRPH FCVVFLSLLC CGVDATSSAA LRRHLEFVNR SGERISVDWL NPLTGQPVLL 
GAPVNGETIP LDSFVNHTFA IRQHQDGQKV KRFETPSLST THTRTLEAPT TKSSLRAIES
LQTCLEYYTA LILEDKNEEL AFQAQVREQI SALAENHTCA DPMRTTTEPI EMRSWRYLDE
DPRTVQVLHN RPSSQIHVLE GFISPEECQA IKDAAAPKLH RGTVADGKGG SKLSESRKAW
QAGVGVDYSH KNAISDLKKR LFAYTNEVTG FNMNLDGQED IMSIQYFGDG VGNPTPDRYT
PHCDGECNGM PHKRGGRVAT MVMYCDIPEI GGGTNFQHSN VFVAPTIGAA AFFSYMNNDT
GLHETGFTTH SGCPVLEGTK RIAVQWMRVG VDEDSPWDSF DTNTVQKGSF IEDSRVE