Gene PHATRDRAFT_44534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44534 
Symbol 
ID7198067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp829471 
End bp830881 
Gene Length1411 bp 
Protein Length417 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178313 
Protein GI219115035 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.058235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACAATGAGCA GTTGCACACG TGCGGGCTTT CCGATAAAGC GGACAGTCAT TTCATCAATT 
ACAAGCTTGA TTTGACACAA GTGAGCACTC TCCGTTTGAC TCAACGAGTA TGAATGAAAC
ATATTCGCCT TTTGAGTGCT TACCAGTGGC TGACTTGACT TCGGCTGGTC TACTACAGAA
GCAGGATGAT ATTGAAGTCG AGCTTATTGC CCGAGAAAAC GATTGCGAAT TGCGTAGTGG
AGTGGACGAT CCCATGGTAG CAATAGGTGC CAACGGACAA GCGGTATCCA CACATGGGAA
GGCTTGGGTG GATCGAAAGA TGGGCTTGAC TGTCACTATC ACAACCCACC GGATCGTCCT
CATGCAGCAA ACGTCGGACA AACGAGTCAA CGCTCGTTAT ATCCACCTTT CTCACGTTCT
AGCCGCTGTG ACTGAAAACC AACTTTTTAA GAGTCCTAAA ATAATATTGG ACTCCTACAG
CGGGGAGTTT CTTCTCGTGT TCAAAGGCAA AGAGGCCAAT AAAGATCGAG ATGCCGTGCT
CTACCACATA CAAAAGGCGC TTTCACGTCA AGATTGGGAG ACAGCCGACC GGGCAGCGCA
ACACCGAAAG GCTGTAGCAA ATTTGACTTC CCGCAAGGTA GGCGTCGACG CGGTTCTCGC
CAAGCACAAA ACTCGGCACG CTCAAGCGGC TCGTCTCACG GACTCCGCTT TCGATGGAGA
CGCCGAAACG TTGCTACGGG AAGCCCATGA ACTCGTCGCT GTCATTCACA AATACGTGGC
AACGCTCGAT AAGCAAAAAG AAGTTTCCTC ACAAGACGAA CAGGATGCAA CCCGTTTGGC
AGATTTGCTG CAAAACATGG GAATGACGTC GGCCCTGTCC AAAGCGAACT TTCTAGGCTC
GGAAGATGCA TACTATACGC AATTGGCCCG ACAGCTGGCC GACTTTTTAG AACCCCATTT
ACACAAGGCT GGTGGTATAC TAACACTGAC GGATGTGTAC TGCTTGTTTA ATCGTGCGCG
TGGCACAAAC CTGATTTCGC CCGAAGACTT GACCAAGGCA GCGTCTCAGA TGGACGCATT
GTCCATCGGG ATGTCTCGAC GGGTTTTTCC AAGTGGACTA ATTGTTATTC AGGATGACTC
CTTTGACGAT CACGCTATGG CAGAGAAACT GCAAGCTTTG GCTTTGGACG CCCCACAGGG
TTTGACGGAA ACGGAAGCCT CACGACAGTG TCAAATCTCA GCCTTGCTGG CTCACGAAGA
ACTACTGGCG GCTGAACGCA TGGGCATTTT GGTGCGGGAC GAAACATTGG AGTCGACGCG
ATTCTTTCCT AACCGATTTG AAGCTTGGGC AGACATACAA TAGTCTTTCC GAAAAAGTTA
AGCCGTACAG CAATAGAGCC AACTGATTTT G
 
Protein sequence
MNETYSPFEC LPVADLTSAG LLQKQDDIEV ELIARENDCE LRSGVDDPMV AIGANGQAVS 
THGKAWVDRK MGLTVTITTH RIVLMQQTSD KRVNARYIHL SHVLAAVTEN QLFKSPKIIL
DSYSGEFLLV FKGKEANKDR DAVLYHIQKA LSRQDWETAD RAAQHRKAVA NLTSRKVGVD
AVLAKHKTRH AQAARLTDSA FDGDAETLLR EAHELVAVIH KYVATLDKQK EVSSQDEQDA
TRLADLLQNM GMTSALSKAN FLGSEDAYYT QLARQLADFL EPHLHKAGGI LTLTDVYCLF
NRARGTNLIS PEDLTKAASQ MDALSIGMSR RVFPSGLIVI QDDSFDDHAM AEKLQALALD
APQGLTETEA SRQCQISALL AHEELLAAER MGILVRDETL ESTRFFPNRF EAWADIQ