Gene PHATRDRAFT_31718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31718 
Symbol 
ID7196018 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp765983 
End bp767417 
Gene Length1435 bp 
Protein Length468 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176650 
Protein GI219109793 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.160049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCCCG GTCACATTGG GTTCGGTCTC AACGATATCG ACATTTCTTC TCAGTTGACG 
CGCAACATTA AGTTGCAAGC CCCGTTTGTT TCCAGTCCAA TGGATACGGT CACGGAACAT
ACGATGGCCA TTCAAATGGC TTTGCAAGGA GGCATAGGAA TCATCCACTC GAATATGAGT
CCAGAAGAGC AAGCGGATCA AGTGCGCACA GTAAAAAAGT TTAAAAATGG ATTCATTACG
GATCCGATCT GTTTGTCCCC GGATAATACT GCAGAAGATG TTTTTAAGAC CAAGGCCAAA
CGCGGCTTTT CTTCTTTCCC AATCACGGAA GGCGGGAAGA TGGGAGGTAA ACTTTTGGGG
ATTATTTCAA ATCGTGACAC TTCCTTCATT GAAGATCCGA CGGCCAAAAT TTCTGTTTTC
ATGACTCCGC GAGATGCCTT GGTGGTCGCG CAGGACGGTA TTTCCTTACA GGAAGCCAAT
GATGTACTGA AGATTTCCAA AAAAGGCAAA CTTCCCGTTG TCAACGAGCA GGATGAATTA
GTCGCCTTGA TTGCCCGCAC GGACTTGCAA AAGCAGCGAG ACAACCCCTT GGCGTCCAAA
GAGTCGGTTA ACAAACAGCT ACTAGTTGGA GCATCGATTG GTACACGCCC AGAGGACCGT
GATCGCGCCA GACTTTTGGT TGAAGCCGGT GTGGACGTGA TTGTGGTCGA TTCCAGTCAA
GGCGACTCTA TCTATCAGCT CGACATTATT CGTCATTTAA AAGAACTAGT ACCCTCGAGT
GGACGTCATT GGGGGCAACT GCGTCACTCC ATCGCAAGCC TATCATTTAA TTCAAGCTGG
CGCCGACGGA CTTCGTGTAG GTATGGGTAT TGGCAGCATT TGCACCACCC AAGAAGTGTG
CGCTGTCGGC AGAGCGCAGG CAAGTGCAGT ATATCATGTT GCCAAATTCG CACGTAAACA
TGGCATTCCG ATTATTGCGG ATGGTGGTGT CAAATCAACG GGGCACATCA CTAAGGCCTT
ATGTTTGGGA GCTGGTTGTG TTATGATGGG AAGTATGTTG GCAGGAACCG ACGAATCGCC
TGGTGAATAT TTCTACCAAG ACGGTGTTCG CTTGAAACGC TATCGGGGAA TGGGAAGCTT
AGAAGCTATG AATAAGGGTA GTGAGAAGCG CTATGTTTGG GATGACACGA CGACTGCCGT
TAAGGTCGCG CAAGGTGTGA GTGGAGCTGT TCAGGATAAG GGCACACTTC GTCGATACGT
CCCCTATCTC ATGCAAGGTG TTCGTCACGG TCTTCAAGAT GCTGGTTGCA AGAGCGTCAA
AGAGGCGCAA GAAAGGCTTT ATTCAGACAA GTTGCGATTT GAAATCCGTT CGCCGGCTGC
TCAAGCAGAG GGTGGCGTCC ATGGCCTCCA CAGCTTCCAG AAGCGACTTT ATTAA
 
Protein sequence
MMPGHIGFGL NDIDISSQLT RNIKLQAPFV SSPMDTVTEH TMAIQMALQG GIGIIHSNMS 
PEEQADQVRT VKKFKNGFIT DPICLSPDNT AEDVFKTKAK RGFSSFPITE GGKMGGKLLG
IISNRDTSFI EDPTAKISVF MTPRDALVVA QDGISLQEAN DVLKISKKGK LPVVNEQDEL
VALIARTDLQ KQRDNPLASK ESVNKQLLVG ASIGTRPEDR DRARLLVEAG VDVIVVDSSQ
GDSIYQLDII LDVIGGNCVT PSQAYHLIQA GADGLRVGMG IGSICTTQEV CAVGRAQASA
VYHVAKFARK HGIPIIADGG VKSTGHITKA LCLGAGCVMM GSMLAGTDES PGEYFYQDGV
RLKRYRGMGS LEAMNKGSEK RYVWDDTTTA VKVAQGVSGA VQDKGTLRRY VPYLMQGVRH
GLQDAGCKSV KEAQERLYSD KLRFEIRSPA AQAEGGVHGL HSFQKRLY