Gene PHATRDRAFT_39221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39221 
Symbol 
ID7194924 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp638895 
End bp640688 
Gene Length1794 bp 
Protein Length567 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183132 
Protein GI219125741 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAAA ACATTGTTCG AGAACATGCT TTTGATTTTG ACGCCCAAGT TGTATTTGCA 
AAAATCGTTA AACATTACAC GGCATCCACA GCCGCGAAAA TCAGCTCCGG CACTACTCTC
TCATACTTGA CCTCTGCAAA ATACGGCAGC TCCTGGACCG GCACTGCGGA AGGTTTATCT
TGCATTGGAA AACCATCTAC GCATCTACAA CAATACCGTG CCAACTACGG AACATTTGCC
ACCGCAACTC TGCCTCAGTT TGCTCGAGTC CTCTGTTCGC GACGTCTCCG AACTACGTCA
AGTCAACACT ACCGCGAATT TAGATTTAGC TAAAGGGGGG TCTCCTTTTA ACTATGAAAA
TTATCTAAGT CTACTTCTCG CTGCAGCAAC TTATACGACA AAGGGAACAA CCTTTCCAAC
TCTCGTAGCC CTAAGACCAA GCGTAGTGCC TTTGTTGCCG AAACCATCTT CCCTGACAAC
GACTACGGCG TTGATTACGA CATTGATTTA TCTCCGTCCA TTCTGTACGA AGCGAATGCT
CACAACCGCA GAGCAGGAGA TCAAAATCGA GACCGCCAGG GCAATGTCAA CCGTGAACAA
CCGTATATCC CCCGTGAGAC ATGGGATAAA CTGTCCGAGG ATGCAAAGGC GATTCTCCGA
GGCATGTCTT CTCCCGCGGA AGGTCAAGCC TCGCCTAACA GCAAGTCAAC ACCCGCATTT
CATGCCAATT CTCATTCTCT AGCCGACATG GGACACCCCT CCCCAACCAA CAACTCGTTG
AATGAAAGCG ACAACGAAAA ATTCCACGAT TGTGGAAACG ATTCGGAGTT ACTTGTCCAC
CTTACTGATT GTTCCAGTCC TATGGCAAAT GGAGACATTC GCAAAGTCCT TGCCTCTGCC
TCTTCCCACA AGAAAAATGA AAACAACTGC CTCCAGTCAA ACATGCTTGA GTACACCATT
TCACGGCACT CCATCATTGG AACCACATCT TCTCTCATTG ACAGAGGTGC CAATGGCGGA
CTCGCTGGAA GCGATGTTAA GGTTATCAAC AAAACCGGCC GTTCGGCAAG CATCACCGGT
ATCAACGACC ACACTCTGCC TGATTTAGAT ATTGTCACCA CTGCTGGTCT TGTTGAATCC
CAGAACGGAC CTATTATTGT CGTACTACAT CAATATGCCC ATCATGGAAA AGGAAAAACT
ATCCATTCTA GTGCACAACT AGAGTACTAC AAGAACACTG TCAAAGACCG ATCTTGTGTA
CTTGGAGGTA AACAACGCAT TGTAACTCTA GATGACTATG TTATTCTTTT ACAAGTTCGT
CAGGGACTTG CATACATGGA CATGCGCCCT CCTTCCGACG CAGAGTTTGA TACACTTCCC
CACGTTGTAC TTACTTCCCA TGTTGATTGG GATCCGTCCA TCATTGACAA TGAGATTGAC
CTTGCCACGG ATTGGTATGA CGCCGTTCAG GATCTCCCGA ACGACCCATA TGTCGAACCT
CGTTTCAATT CAACTGGGGA CTACTGGCAT AGACATGTTG CGAATTTTGA CATATTTTTG
TCATCTGAGA TCATTGCCCA TTCCACCGCT ATTGACAATA TACTCTCGTC CAATAAGCAC
AACATGGTTC GAAATGAACG CAATTACGAA GCCTTGCGCC CTTGTCTTGG CTGGGTCTCT
ACTGACACAG TCAAGAAAAC TATCCTGGCC ACCACGCAAT TTGCTCGAGA AGTATATAAT
GCGCCCATGC ATAAACATTT CAAGTCCCGC TTCCCAGCGC TTAATGTTCA TTGA
 
Protein sequence
MGKNIVREHA FDFDAQVVFA KIVKHYTAST AAKISSGTTL SYLTSAKYGS SWTGTAEGLS 
CIGKPSTHLQ QYRANYGTFA TATLPQFARV LCSRRLRTTN LYDKGNNLSN SRSPKTKRSA
FVAETIFPDN DYGVDYDIDL SPSILYEANA HNRRAGDQNR DRQGNVNREQ PYIPRETWDK
LSEDAKAILR GMSSPAEGQA SPNSKSTPAF HANSHSLADM GHPSPTNNSL NESDNEKFHD
CGNDSELLVH LTDCSSPMAN GDIRKVLASA SSHKKNENNC LQSNMLEYTI SRHSIIGTTS
SLIDRGANGG LAGSDVKVIN KTGRSASITG INDHTLPDLD IVTTAGLVES QNGPIIVVLH
QYAHHGKGKT IHSSAQLEYY KNTVKDRSCV LGGKQRIVTL DDYVILLQVR QGLAYMDMRP
PSDAEFDTLP HVVLTSHVDW DPSIIDNEID LATDWYDAVQ DLPNDPYVEP RFNSTGDYWH
RHVANFDIFL SSEIIAHSTA IDNILSSNKH NMVRNERNYE ALRPCLGWVS TDTVKKTILA
TTQFAREVYN APMHKHFKSR FPALNVH