Gene PHATR_44101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44101 
Symbol 
ID7204036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp987553 
End bp989020 
Gene Length1468 bp 
Protein Length415 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186165 
Protein GI219113163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACATACAA GCTTACACAG GCAGCAAACA TTCCCTTCAC TTGACCATAC GGCTATTGAT 
CGTAAATCCG GGCCATTTTA CTGTAAGCGA ACCCCGCTTA CCGAAGATGG AGTCTATCCC
AAGTGAATCG TTCTTGGATG AGGGGCAGAT GTCAACAGCA AATAATAAAA GCAGTCTTGC
CAAGGGAGAG GCAGCAAAAA ATATGCAGCC GCTCCCAGAT GACTTCCAAC CTGGAGAGAA
CGATGTTATT TGTGGGCGCG GCCGCAATGT TTTCAACCAC ATTGGTAATG ATAGCTTTCG
TACGATTGTT GCGGGGTACT TGGATCATTA TAATCAAGCT TCAGCCAAGT TAGAGAAGAG
TTTTATCCTC TCAGAGATCG TTACGAAGGT CCGTGAACTC AGCCCGAACG GTGGATTTGT
GAAAAAGGAT CCCAAATCAG GCCGCTGGTT TGAAGTAGGT GACTTTCTGG CCCGGGAGAA
AACTTCCCAA GCATTCCGCG ACGCTCTTCA CGACAAGTAC AGATCAAGCA ACACAGCAAA
AAAGCTGAGG CGCCAAGTCG AACAAATAGA CAGGCTGCAT TCCTCCCAAA GCGATGAGGA
ACGAGATTTT ATTTCGCAGA ACTCGGCTTC CCTCCAATTG GGTGATCTTG AATCTGTACA
ATCATCCTTA CTTGGAACGG ACCTACAGAG AGCGAACGCT TTTTTGGGGG AGAACTTACA
GTTAATGCAA GCACGCCGGT CAGCTCGTTC AGTACTGGAC TTTCATGCCA ACAACAAGGC
TACAGGCCCC CTGAAACAAA ACAATTTCAA TTGGTCATGC CCTAATCTCG GGAGCAAAAG
AAACACAACT CCGCTCAATA CCGAAGCTTC GATGCAAAAT TTCGACTGGG GTACGGCCGG
CCAAACGAAT ACCAAGCCCT CAGCGAGCCT TTTTGCAGAT TTGGGTGATT TCCCTCCTCT
GCCCATCGCT AACGGCATGC TGGCCAACCA GACAGGGGGT TTGCCGGGAA GAAATCAAAA
CAGTGCATCA TTCGCCAACC TTCCTCTCGG GAACACATCG TTGCTTGCAA ATTTTTTGTC
GCAATCAATT GCTCCCATCC ATGAGAATGC GCCTGTGGAA GGTCCATCGA TTCAAGCTCT
AATGGGAAGA GCTTCCAAGT TTGATCCGTT TGACAGCATG CTTTCTCCCG AAGATTCGCA
AAAAGAATCA CTTAAGTCTT TGGATGATAT CATGATCACG CCTATAGGAG CAGGTAAGGA
TTCGGACTTG TTTGCTAAAC TAGCACTACT CACTGATGAA TATAAGGGGG ATGGCAATGT
TTTTGAGCCG ACACCCATCG GAGAGACCGC ATAAAGCCAA TTCATCTCCC CTGATGTAGA
TACATGGATG CCTTGCCAGT TTCTTTACTT TTGATTGGCC TTATACACTA CTTCATATAA
TATTTTATGG ACTGAATGTC AGCGTCAT
 
Protein sequence
MESIPSESFL DEGQMSTANN KSSLAKGEAA KNMQPLPDDF QPGENDVICG RGRNVFNHIG 
NDSFRTIVAG YLDHYNQASA KLEKSFILSE IVTKVRELSP NGGFVKKDPK SGRWFEVGDF
LAREKTSQAF RDALHDKYRS SNTAKKLRRQ VEQIDRLHSS QSDEERDFIS QNSASLQLGD
LESVQSSLLG TDLQRANAFL GENLQLMQAR RSARSVLDFH ANNKATGPLK QNNFNWSCPN
LGSKRNTTPL NTEASMQNFD WGTAGQTNTK PSASLFADLG DFPPLPIANG MLANQTGGLP
GRNQNSASFA NLPLGNTSLL ANFLSQSIAP IHENAPVEGP SIQALMGRAS KFDPFDSMLS
PEDSQKESLK SLDDIMITPI GAGKDSDLFA KLALLTDEYK GDGNVFEPTP IGETA