Gene PHATRDRAFT_18128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_18128 
Symbol 
ID7197160 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp469820 
End bp471244 
Gene Length1425 bp 
Protein Length247 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177625 
Protein GI219111747 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0972526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACATA TAAGTAGGCA GGGTGTCCAC GACAATGTTA TGGGAGTATA TGACGTTTTG 
CAAGATGAGG AATATTTGTT AATGTTCATG CCCTACTGTT CCTCGGGCGA TCTTTTTTCT
TTTGTACAAC AAGCTGGAAG GTTTCCGGAA CCAATGGCAC GATACTGGTT TAAGCAAATT
TTAGAGGTGA GTATTCTTGA GTAATTGGGG TATGCCATAT GTGACGATTG GGCTCTGATC
AATTGTATAC AAATCTTTTG GGTTAAGGGT GTCTCACATT TGCAGAGGAT GGGCGTTTGC
CATCGGGATT TGTCGCTAGA AAACATTTTG GTCGACAAGT ATCGAACATC CATCATCATC
GATCTGGGAA TGTGTTTACG GGTTCCATAC GACGGTGGTA ATCACGTTAC CGATGTTTCG
TCTGGTACAA TCCGGCGTCT GATTAAACCG CTTGTCCCCT GCGGTAAACC AAACTACATT
TCTCCGGAGG TCTTGAAATC GCATGAGGCG TTCGATGGAT TTGCTGTTGA TTTGTGGGCG
TCGGCTGTGA TTTTGTTCAT TATGGTTGTC GGTTTGCCGC CCTGGGAATA TGCCAACAAC
GATGATCCAA GATACCGGAT GGTCATTGGA GGCGGTCTGG AACGTATGCT ACGAAGTTGG
GACCGAGAAA TTAGCCCTCT GGTCGCAGAT CTATTGCAAA AAATGCTGAG AGAAGATCCT
AGACACAGAT TGTCTCTGAT GGAAATCAAG GACCATCCTT GGATTACAGT GGATGAATCT
GTTGATGGGA TGCCATACGC ACCTGAAGAG GAATGGAGAA GGTAAATGAT TCACTCATCA
AACGAATAGC TTGCAATAGC GGGAAGGGAG TAATGTTATT AGATCACGAA AATTAGTGCC
TCGGTGAAGC ACGCACCTCA CTACAAGCCT ATCCAAACAA GCTCCAGTCT CGAATCTGCA
AACTCGAAAT GATCTCGGGA AAGATCGCGC TGGATTGAGA CAAGTCGGCT TCGTTGGGAT
TTGGAGAAAT TGGCTCAGAG AGCGTCACCA ATATATCCGA GCCAACCGAT GGAAGACGTA
TCGCGCCGAC GTCAATACGA ATACACCTAA TTTCTTGTTC CCGACTATCG CCTCTCGGGT
CATGGTCTCT CCCTAAGGCT ACCTTTTGAT ATCCCATTCC AAAACATATT CTTGCAGTAT
CCGGTAAATT CGCAAGCTGG AATGGTTGCC TTTGCGGCTG AAAGTGGGTA TTTTCTGGAC
TAGAACCGTT CGCTTCCGCC AAATCACGAA AAAAATGCCC TGGCGCCTGA TCATCTGCAA
TGTGGCTTTG GTATTCAAGA ATTTCAACGA CTAGCATGCA CCCATCGTTG ACATCGTGCC
AACATTCTTG ATGATCCGGA ACTTGACGAA CCTCGGAGAC ATCCA
 
Protein sequence
MQHISRQGVH DNVMGVYDVL QDEEYLLMFM PYCSSGDLFS FVQQAGRFPE PMARYWFKQI 
LEGVSHLQRM GVCHRDLSLE NILVDKYRTS IIIDLGMCLR VPYDGGNHVT DVSSGTIRRL
IKPLVPCGKP NYISPEVLKS HEAFDGFAVD LWASAVILFI MVVGLPPWEY ANNDDPRYRM
VIGGGLERML RSWDREISPL VADLLQKMLR EDPRHRLSLM EIKDHPWITV DESVDGMPYA
PEEEWRR