Gene PHATR_44088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44088 
Symbol 
ID7204027 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp930302 
End bp932392 
Gene Length2091 bp 
Protein Length583 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186436 
Protein GI219113705 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATTC CAACATACGG TGCAAATCAG GGAGCCGAAA TCGATGCTTA CATTTCGCCA 
TACGCCAAGG TACGTTCGCT AGGAGCGAGT AACCGAATTT TGTAGCCGGT GTCTTGCACC
GCATACCCTC GAAAATCTTA ACCTCTTCTC TTGATCAAGG AGGAACCAAT GTCTCTTGAT
CCTCCGAAAA TAGCAGTACA AGGTCAAGAA GTTGAGGATG CGCAATCGAT ATCAATTGGT
TCAACCTTGC ACCGGTCATC CTCGGATTTC TGTAAGAGTC AACCGATTGA CGAGCCGAGT
CTTCGCCGCG TTAGTAGTCG CCGTAAAGCA GAGCATCCGT TGAAAACAAG CAAATCCCCG
TCGAGGCCGC CATCGGCTCA AAAGTTGCGC AAAAACTTCG TCGGCGTTTT GGAGTCTTTC
GTCAGACACG TAGCATCGGT AGAGCCGAAT GTGGTTTCAT GGACTGGCGA TGGTTCCGGT
TTCTTCTTGA ATGAGTTAGA GGATCCCCAA GCTTTGAACG TTGCCATTTC CAAGTTCTTT
TGGTGTACGT AGGTAGCAAC ACATGTTTGC AGTCACACAG ATGCTGATCT ACATCTCACT
AATTCATTTT GGACTTTCAG ATGGACGCTA CCCCTCGCTT CGCCGCCAGC TTAATGTCTA
TGGATTCAAA AAATATAAAA GGAGTCACAG GTACGATTTG TGCGTTTTCG TTCTCGAGCT
GGACATCCCA TCAGATGACA GGCCCTCATA TACCATTGTA CTCTGTTTCA GATATAAGGG
AGCATTTCAT CACCCGTCAA TACACCGCAA CATGACCAAT TGGCAATCCC TAATGCTAAA
ACCCATTGTC CGTCCAAGTA GAGCTTCTAA ATCGAAGAAT GCGCCATTTC TTCGGCACGG
TAAACCGGTA GATGAAAAGC CATCACCAGC ATCACCACCA AATCGCATGG GAGAAAAAGC
GGCTATATCC TCCCTGACGG GAAAGTGTGC ACTCGAGAAG CTCAATATTC CAAGCTCCGA
TGGTTCTTAC GCTGGGCTTC CGGTTGTATC AGGACGGACC GACCGGGCAG CCCTCACGTC
AGCCTCATCA GGAACGGAAG CCGCTTTGGA AGAGTACGGT ATGTACGACA AATATCACAA
CTTGATTTCA GGAGGCAATG AAGGTGCAGA TGTGGGGGAA ACTGAAGAGA CTTCCTTTTC
AATGGAATAT ACCACACCAT CAGAAATAAA GCCACAAATA GGATTGTTTG ATCCTGTTTC
GTATCTCGAA GGGGGCGGGT TTTCCGAGAA GCGAGACCTC CGGACTCTTG CCGATGCTGC
GGAGCTAGCG GCGGCTAAGG GCTGGAATAC CAATTTGACA AAGTATTCGC CTGTGATCCG
AGCTGACTGT AAAGAGAACG ACACTCCTAT TCACGCAATA TATACAAGAG ACCTGGGCTC
ATTGTGGGAC AAGAGTCCCG CTGCCGTGCG AATGCAGTTT GATTCAACGA CGCCCTTGGT
TTCAGCCTCG CCTATTGATC CTGGAGTCAG GAAAAATATT GGCAAGTATC CGCATCCTGG
TATGCCCGAC TACTCGCCTT CGTTCATAAC CTCAGTCTTG GAACGATCTG GAGCTACATT
GTGGACACCG AATTTTGATG ACAAGGAAAT CGACATTCCG TCGCCTCCAA TGGAACAGTG
GTTCAGTCCC CCTAAATTTC CATCTCTTCC CATGCACAAA TCATTCAGTC CGGGGAAAAA
CCTGGATCTC GATACACTCG AAACTTACGT CACAAGGATT GACATTGAGT CGAAAAAACG
CAAAAGAAGC ACTGTGCTTT GTTCTCCCCC GTATCTGTCT CCTGATTCCC CTACAATGAC
TTTTGAAGAC TTTTGTAGTG TGTCTAGCGA AATTTCATTC TCAACGGAAA GATTGAGCGG
CGAGGCTGAA ATCTGTACGG CATCTAGCTA GAGAGCTTCC AACCTTGGTG TTGGGTGTTC
TTCTTTCAGC ATGCTTTATA TTTGTGTTTG CCAGATGGTG CCAACATCCA AAATGTTGCC
TGTCAAGGAC GCGTTCCGCT AAACTTTGAG ATACAATCTT TCAGTCAGCC C
 
Protein sequence
MSIPTYGANQ GAEIDAYISP YAKEEPMSLD PPKIAVQGQE VEDAQSISIG STLHRSSSDF 
CKSQPIDEPS LRRVSSRRKA EHPLKTSKSP SRPPSAQKLR KNFVGVLESF VRHVASVEPN
VVSWTGDGSG FFLNELEDPQ ALNVAISKFF WFTQMLIYIS LIHFGLSDGR YPSLRRQLNV
YGFKKYKRSH RYKGAFHHPS IHRNMTNWQS LMLKPIVRPS RASKSKNAPF LRHGKPVDEK
PSPASPPNRM GEKAAISSLT GKCALEKLNI PSSDGSYAGL PVVSGRTDRA ALTSASSGTE
AALEEYGMYD KYHNLISGGN EGADVGETEE TSFSMEYTTP SEIKPQIGLF DPVSYLEGGG
FSEKRDLRTL ADAAELAAAK GWNTNLTKYS PVIRADCKEN DTPIHAIYTR DLGSLWDKSP
AAVRMQFDST TPLVSASPID PGVRKNIGKY PHPGMPDYSP SFITSVLERS GATLWTPNFD
DKEIDIPSPP MEQWFSPPKF PSLPMHKSFS PGKNLDLDTL ETYVTRIDIE SKKRKRSTVL
CSPPYLSPDS PTMTFEDFCS VSSEISFSTE RLSGEAEICT ASS