Gene PHATRDRAFT_50848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50848 
Symbol 
ID7199533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp1042944 
End bp1044719 
Gene Length1776 bp 
Protein Length544 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179112 
Protein GI219116634 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.104715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACATCGCT GTGGTTGACA ATGGAGAGAG CAAGGTCCAC TACTGTCAGG CAGCCGCATT 
CATTCCGCTC TGCTCTATTG AAATTACAAA TTAATTATCG GTGTTACTTC ACAAACGTTC
TGAAAACGTC ACGCGCTCAA CATGACGTCG CAATCGTCCC ACAAATCGAA TTACGCGCTG
GCCATTTCCT TTCTGGTTCA AGAGCTGATG GATGCGTACG ACAACGGTGA TACCGTCAAT
TTGACGCAAC TCAAAGGCAA GGCCTCGCGC AAGTTCAAGT TGAAGGGAAT TCCCAAAATG
AGTGACATAC TGCAGGGTCT ACCGATCAAT TACCGCAGCA AGCTATGGCC GTACCTGCAG
ACCAAGCCGG TCCGTACGGC CTCTGGCGTT GCGGTCGTCG CCGTTATGAG CAAGCCACAC
CGGTGTCCGC ATATTGCCTA CACTGGAAAC GTTTGCGTTT ACTGCCCCGG TGGACCGGAT
AGTGACTTTG AATATAGCAC GCAGGCGTAC ACTGGATACG AACCAACCTC CATGCGCGCC
ATTCGAGCCC GTTACGACCC CTACAGTCAA GTCAAAGGAC GTGTCGCGCA ACTGCGAGCC
ATTGGACATA CGGTAGACAA GGTAGAATTC ATTGTTATGG GAGGGACCTT TCTGAGTTTG
GATAAGGAGT ACAAAGATTA TTTCATTCGT AACCTACACG ATGCTTTGTC GGGATATCAT
TCGCAAACAG TGGAAGAATC AGTTCGATAC TCAGAGCAAG CTGTTACCAA GTGCATTGGA
ATTACTATTG AAACCAGGCC TGATTACTGC TTAAAGCCAC ACTTGGAAGA GATGCTTTCG
TACGGCTGTA CGCGAATCGA AATCGGTGTA CAAAGTATCT ACGAATCCGT GGCACGAGAA
ACTAATCGCG GACATACGGT GGCGGCTGTT TCTCACTCGT TTCAGCTAGC AAAGGATTGC
GGCTTCAAAG TTGTCACGCA CATGATGCCG GATCTACCAA ATATGGGCTA CGAACGAGAC
TTGGAGGGCT TCAAGGAGTA CTTTGAAAAT CCAATGTTCC GAAGCGACGG TATGAAGCTA
TACCCGACTC TAGTCATCCG AGGAACGGGC TTGTACGAAT TGTGGAAAAC AGGTCGATAC
CAGAATTATA CTCCAGATCA ATTGGTGGAA CTAACCGCGC AAGTTTTGAG CCTCATTCCA
CCGTGGACAC GTTTGTATCG AGTCCAGCGT GATATTCCGA TGCCACTCGT CTCATCTGGT
GTTGAGCACG GCAACCTCCG TGAACTCGCC TTGCAAAAAA TGCGGGAGCA AGATTTGCCG
TGTCTGGATA TTCGATCGCG AGAAGTTGGG ATGAAGCAGA TTCATCACTC CGTCACGCCT
GATCAGGTCG AACTAGTGCG GCGCGACTAC GTTGCCAATG GCGGATGGGA AACTTTTCTT
AGCTACGAAG ATCCGACACA GGACATTCTG ATCGGGCTGC TGCGATTGCG CAAAACATCG
CCTGCTGCAT GGTTGAAGGA AGTTGCCGAA TATCCTTCAA GTATTGTACG AGAATTGCAC
GTATACGGCA CTGCTGTTGC CGTCTCGGCT CGCGACCCGA CTCGTTTTCA GCATCAAGGC
TTTGGTATTC TGCTAATGGA AGAAGCCGAG CATATTGCCC GGGACGAGCA CGGGTCCAAA
AAGTTACTCG TCATTGCGGG GGTCGGAACG CGGCACTATT ACCGCAAGAT GGGGTACCAC
CTGGACGGAC CGTATATGAG CAAAATGTTG CTGTAA
 
Protein sequence
MTSQSSHKSN YALAISFLVQ ELMDAYDNGD TVNLTQLKGK ASRKFKLKGI PKMSDILQGL 
PINYRSKLWP YLQTKPVRTA SGVAVVAVMS KPHRCPHIAY TGNVCVYCPG GPDSDFEYST
QAYTGYEPTS MRAIRARYDP YSQVKGRVAQ LRAIGHTVDK VEFIVMGGTF LSLDKEYKDY
FIRNLHDALS GYHSQTVEES VRYSEQAVTK CIGITIETRP DYCLKPHLEE MLSYGCTRIE
IGVQSIYESV ARETNRGHTV AAVSHSFQLA KDCGFKVVTH MMPDLPNMGY ERDLEGFKEY
FENPMFRSDG MKLYPTLVIR GTGLYELWKT GRYQNYTPDQ LVELTAQVLS LIPPWTRLYR
VQRDIPMPLV SSGVEHGNLR ELALQKMREQ DLPCLDIRSR EVGMKQIHHS VTPDQVELVR
RDYVANGGWE TFLSYEDPTQ DILIGLLRLR KTSPAAWLKE VAEYPSSIVR ELHVYGTAVA
VSARDPTRFQ HQGFGILLME EAEHIARDEH GSKKLLVIAG VGTRHYYRKM GYHLDGPYMS
KMLL