Gene PHATRDRAFT_37861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37861 
Symbol 
ID7202659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp247439 
End bp248560 
Gene Length1122 bp 
Protein Length373 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182035 
Protein GI219123445 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.095088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGGAC TGAATGGATA TGACCAAAAG GCTCGTAGCG TGCAAGAAAT TGGCTGCCGT 
GTACTGTGGA CTTTGGCACT GAATGCCGAT ACCACCAAGA CCTCCATCGG AAAACAAGGG
GGAATTAGAG TGATATTGGC TGCGATGCAG CGTCACAACA GAGTCAATCA GAGCGACGCA
GCAGTGCAAG CATCTTGCCT CGGGGCGCTC TGGAGCCTCT CTTTACTCGA GAGTAACGCA
ATGTGGATTG CTTTGCGAGG TGGAATCGAT CTTATCATTT CCGCAATGCT CAGGCACAAT
TCTGGGACTG ATCTAGACGG CGAGCTTCAA AGAGTCGGCT GTCTCCTTTT GACTAGTCTA
TCCAGAGAGG GTCTCACCAA AAGCGCTGGC ATCCGTACCA TGCTAATGAG ACAAGGCGGT
ATAAGCGTAC TGCTTAGGGC TATGCGTCGA CACAATTCGG GTAGCCATCT TTCGGCTATG
TTGCAACAGG CCGGTTGCAC CGCAATCGCT AACCTTGCTA AAGATAGCAA AAGCCGTCAA
CTCTGTTTAG CAGAGGATGG AGGTATCCAG GTAGTCTTAG AAGCGTTGCA GAAACACAGT
GGAGCCGAGT GTTTGCTGGT ACAGCGCGAA GGTTGCAAAG CGCTGGCTCA TATTGCACAG
AACATTTACA ACTCAATCTC GATTGGCAAA CAAGGAGGAG TCATGGCTGT TCTTGCAGTC
ATGCGAAAAT TTGGCTCTGC TTCCGATTCT GATGTCAGCC TCCAAGAATC CGCTTGTCTT
GCGCTTTCAA ATTTAGCGCA AACCTACGAA AACAGAGCTT CCATCATGGA GTCCGACGGT
TTAGATCTTG TTCTCGCAGC GATGGAAGCA GGAAGGAATC GTTCTATCCT CGGTTTAGAC
GCAGACTTGC AGTTGGCAGC TTGTGGTGTT TTGTCGCGAT TAGCAAAAGA CTCGGAAAAT
GCCAATGTCA TCGCTGAACG TGGAGGGATT GAGGTCGTCT TACTCGTATT GAGCAAATAC
AAAGCAAGCA ATTTCGTTAT CCGAGACTGC GGCCGTGCCG TCTTGAAAGA GCTTGCGAAG
AAATGCGACG ATGAAATTGT TATCAGATCA TGCCGGGCAT GA
 
Protein sequence
MRGLNGYDQK ARSVQEIGCR VLWTLALNAD TTKTSIGKQG GIRVILAAMQ RHNRVNQSDA 
AVQASCLGAL WSLSLLESNA MWIALRGGID LIISAMLRHN SGTDLDGELQ RVGCLLLTSL
SREGLTKSAG IRTMLMRQGG ISVLLRAMRR HNSGSHLSAM LQQAGCTAIA NLAKDSKSRQ
LCLAEDGGIQ VVLEALQKHS GAECLLVQRE GCKALAHIAQ NIYNSISIGK QGGVMAVLAV
MRKFGSASDS DVSLQESACL ALSNLAQTYE NRASIMESDG LDLVLAAMEA GRNRSILGLD
ADLQLAACGV LSRLAKDSEN ANVIAERGGI EVVLLVLSKY KASNFVIRDC GRAVLKELAK
KCDDEIVIRS CRA