Gene PHATRDRAFT_18036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_18036 
Symbol 
ID7197081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp103947 
End bp105235 
Gene Length1289 bp 
Protein Length385 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177864 
Protein GI219112225 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCATACAAT CACATCCCGT AGTCTCGGGC AAGGAACGCA AGGATAGCAA ATCCTTTACC 
AGTACCGACA GCTCCCACAT CAACAACACC GTACGTGCAG TAGCAGTCAC AAAGGGTCGT
TACAGTACAG CATGGTAGTA CGAACCCCGC GCCAGGGCGC GTTTGTACGT CTATGGCAAA
CGTGCATCGT CTTTTTGTTG CTAGGCAGCC ACGTGTTGCT CGTACCGGTA GTGCAGGCTG
CATCTCGCGA CTACTACCAA ATTTTGGGAG TATCACGCGA CGCCACCATC AAGGAAATCA
AAAAGGCCTA CCGGCAAAAG TCGCTCGAGT TTCATCCCGA CAAGAACAAG GACGAAGGTG
CGTCGGAGAA GTTTGCCGAA GTGGCTCGGG CCTACGAAGT GCTTTCGGAC GACGAATTGA
AAGCCGTCTA CGATCGACAC GGGGAAGACG GTTTGAAGCA ACGCGAACAG CGTGGTGGGG
GAGGAGGAGG AGGAGGCTTT GAAGATCTTT TCTCGCAGTT TGGCTTTGAC TTTGGCGGTG
GACGACAGCA GCGCGATCAA GAACAGCGCA CGCCGGATGT CGAAATTCCA CTCTACGTGT
CACTCAAACA GTTATATCTC GGTGAAACCA TCGATGTCGA CTACGTCCGT CAGGTACTCT
GTTTGCAGTG GGAAATGTGC GTCAAGAGTG CGCCCGATTG TCAAGGGCCG GGCGTCCGGG
TACGCCGACA ACAACTCGCC CCAGGATTTG TACAACAGGT CCAACAAAGG GACGACCGCT
GTGTGGCCCG GGGTAAGCAA TGGCTGGATA AGTGTCGCGA ATGTCCCCGC CAGACGGAAA
CGGAACGAAT CCAAGTGACT ATTGAAATCC AACCAGGATT CCGTGCGGGA GAAAGGGTTA
GCTTCGAAGG CGTGACGGAC GAAAAACCCG GCTTCAAACC GGGCGATTTG CATTTTGTAC
TCATGGAAGA ACCGCACGAT GTGTATCACC GGGATCGGGA TGACTTGTAC AAGACTATGG
AAGTCCCATT GGTGGATGCG TTGACGGGAT TCTCCGTCAC GCTCAAGCAT TTGGACGATC
ACGAGTACAC GGTGACGGTG GAGGATGTGA CGGATTGTGA TCACGTCTTG CGCGTGCCGG
GAAAGGGAAT GCCGCGACGC AGCGGGCGTG GCTTTGGTGA CCTGTATCTC ACCTTTGAAG
TCGACTTCCC CGATACACTG ACTCGTGAAC AAAAGGACGC CATTCGCAGT ATTCTGGCTC
CGGGAGAAGA AGCGAAGCAA GAATTGTAG
 
Protein sequence
MVVRTPRQGA FVRLWQTCIV FLLLGSHVLL VPVVQAASRD YYQILGVSRD ATIKEIKKAY 
RQKSLEFHPD KNKDEGASEK FAEVARAYEV LSDDELKAVY DRHGEDGLKQ REQRGGGGGG
GGFEDLFSQF GFDFGGGRQQ RDQEQRTPDV EIPLYVSLKQ LYLGETIDVD YVRQVLCLQW
EMCVKSAPDC QGPGVRVRRQ QLAPGFVQQV QQRDDRCVAR GKQWLDKCRE CPRQTETERI
QVTIEIQPGF RAGERVSFEG VTDEKPGFKP GDLHFVLMEE PHDVYHRDRD DLYKTMEVPL
VDALTGFSVT LKHLDDHEYT VTVEDVTDCD HVLRVPGKGM PRRSGRGFGD LYLTFEVDFP
DTLTREQKDA IRSILAPGEE AKQEL