Gene PHATRDRAFT_12989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_12989 
Symbol 
ID7201726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp849007 
End bp850098 
Gene Length1092 bp 
Protein Length363 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180908 
Protein GI219120335 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAGATAGCAG CTGTCATTGA GGCTCTTCGG GATGGATTTC TCGCCCCTGG ACCTAAAACA 
GAGGACTTTG AGCACCAGGT TTCGTCGCTC TTTGGCAAGC AACATGGGTT AATGGTGAAC
TCGGGTTCAT CGGCAAATTT GCTTGCGTTA AATGCTTTTG GGTTCAAGCC AGGAGATGAA
GTTGTCACGG CCGCCTGTAC CTTTGCAACT GTTATTGCAC CACTCCTACA ACTCGGAGTT
AAGCCTGTCT TTGTTGATGT TGATCCTTCT GCCTATGTTC CCACAGTCGA CGCAATCATG
GAAGCCGTCA CATCCAAGAC GGTAATGATT TGGCTGGCAA ACCTAGTTGG TGCAAAGCCT
GACTGGGAAG AGCTACGCTG CCGCACCAAC TTGCCTCTGT GGGAAGATTC CTGTGACACG
ATATCTGTTA CTACGGTAAC TGACGTTTCA ATGACCAGTT TCTATGCTAG CCATATGATT
ACTGCAGGCG GAGGCGGAGG CATGATAATG GGTAACAACC GCGAATTTAT CGAAAAGTGC
CGCATGTTCC GTGATTGGGG ACGAGTTGGC AACAACTCGG AGGCTCTAGA AGATCGCTTC
ACTTCAAGTA TTGATGGAAT CCCATATGAT GGAAAGTTTT TGTACGGAGT AGTTGGATAC
AACATGAAGT CAACCGAGAT GAATGCCGCC TTTGGACTTG CTCAGCTGAA GAAGTTGCCG
TCCTTCCGTG CCATCCGTCG GGCCAACTTC GACCGCTTTA TGTTAAAGTT GAAAGCTTCA
AAAACATTTG TTCTCCCCAA AGAGAAAAAG GCATTTGATT GGCTGGCTTT CCCTCTTTTA
CACTCCAAAC GGGGTGAGGT TTTGCAGTTT CTGGAGGGCA ATGATATTCA GACTCGCGTA
TTGTTTGCCG GAAATATCAC TCGGCACCCA GCGTATCGTC ATCTCTTTGT CTCGGAGAGT
GCATTTCCCA ATTCTGATCG TATCATGGCA GAGGGTTTTT TGCTTGGTTG TCACCATGGA
ACCACCTTTG AGCAGATCGA TCGTGCCTGC GAGCTCCTCT TGCAGTTTGA GAAGAATCTG
GAAGTTATCT AG
 
Protein sequence
EIAAVIEALR DGFLAPGPKT EDFEHQVSSL FGKQHGLMVN SGSSANLLAL NAFGFKPGDE 
VVTAACTFAT VIAPLLQLGV KPVFVDVDPS AYVPTVDAIM EAVTSKTVMI WLANLVGAKP
DWEELRCRTN LPLWEDSCDT ISVTTVTDVS MTSFYASHMI TAGGGGGMIM GNNREFIEKC
RMFRDWGRVG NNSEALEDRF TSSIDGIPYD GKFLYGVVGY NMKSTEMNAA FGLAQLKKLP
SFRAIRRANF DRFMLKLKAS KTFVLPKEKK AFDWLAFPLL HSKRGEVLQF LEGNDIQTRV
LFAGNITRHP AYRHLFVSES AFPNSDRIMA EGFLLGCHHG TTFEQIDRAC ELLLQFEKNL
EVI