Gene PHATRDRAFT_38409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38409 
Symbol 
ID7203439 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp25255 
End bp26621 
Gene Length1367 bp 
Protein Length440 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182598 
Protein GI219124622 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.377002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCAAA ACGGACCCAT CATCCTCATC ATGAACCAGT ATGCATATCT GGGAAAGGGT 
AGAACTATTC ATTCTAGTGC CCAGCTCGAG CACTACCACA ACCATGTTGA GGATCGCGCT
TGTACCGTGG GCGGAAACCA ACGTATCGTC ACATTAGACG ATTACATTAT CCCGCTGCAT
ATCAGGCAAG GCCTTCCTTA TATGGATATG CGCTATCCCA CCAATGAAGA GTTCGAGTCA
CTATCCCATG TTGTCATCAC CTCCGATGTA GATTGGGATC CATCCGTCTT AGACAACGAA
ATTGACCTCT CTGTGGAATG GTCCAGCAAC TTTCTTGATA TACCTGGTCA CCCCTATGTT
GAACCACGCT TCGATAACAA CGGCCAATAT CTTCACCGCC ACGTCGCCTC GTGTTCATCC
CTTCGTGAAG GTGCACTCGA CCGTTTAATA CAGTGTAAAC AGCATAATAT TGCACGCAAC
GAGCATGACT ATGAAGCGCT TCGTCCATGT TTCGGATGGA TTTCGGCTGA CACCGTCCGA
AAAACTCTCA TGGCCACAAC CCAGTACGCA CGAGAAGTAC ACAACGCCCC GTTGCGGAAG
CATTACAAGT CCCGTTTCCC AGCCTTAAAT GTACACCGGC GAAATGAATC CGTTGCCACT
GACACCATAT GGTCTGATAC CCCTGCCGTT GACAATGGTG CTAAATTCGC TCAACTCTTT
GTTGGACGTC GTTCCCTCGT TACGGACGTC TATCCTATGA AAACGGATAA AGAGTTTGTC
AATGCACTTG AAGACCACAT TCGTTATCGT GGTGCCATGG ACAAACTGAT CAGTGACCGT
GCACAGGTTG AAATCAGTAA GAAGGTCACC GATATTAGAC GCGCCTATAA TATCGATCAG
TGGCAGAGTG AGCCTAACCA TCAGCACCAA AATTTCGCTG AACGCCGTAT TGCAACCATC
GAAGCCAATA CTAATAATAT TCTCAACCAC ACTGGTGCCC CTGATTTCAC GTGGCTACTT
TGCGTCTCCT ACGTTTGCTA TGTGTTCAAC CATTTGGCAC ACGAATCTTT GAACAACCGC
ACACCCCTAG AAGTTCTTAC TGGTTCTACC CCTGATATCA GTGTTCTTTT ACAGTTCCAC
TTTTGGGAAC CGGTTTATTA TCGCCTTGAC GATGCGACAT TCCCTTCAGA TGGTACTGAA
CAACGAGGAC GTTTTGTGGG CATCGCGGAT TCCGTCGGGG ACGCACTTAT TTATAAGATC
CTCAACGACG GCACCAACAA AATTCTATAC CGATCTAGCG TTCGTTCTGC CAACATCCCA
GGAGCAACCA ACCTACGCCT TACACAGGAT GGGGAGAGTG GTCCTAA
 
Protein sequence
MSQNGPIILI MNQYAYLGKG RTIHSSAQLE HYHNHVEDRA CTVGGNQRIV TLDDYIIPLH 
IRQGLPYMDM RYPTNEEFES LSHVVITSDV DWDPSVLDNE IDLSVEWSSN FLDIPGHPYV
EPRFDNNGQY LHRHVASCSS LREGALDRLI QCKQHNIARN EHDYEALRPC FGWISADTVR
KTLMATTQYA REVHNAPLRK HYKSRFPALN VHRRNESVAT DTIWSDTPAV DNGAKFAQLF
VGRRSLVTDV YPMKTDKEFV NALEDHIRYR GAMDKLISDR AQVEISKKVT DIRRAYNIDQ
WQSEPNHQHQ NFAERRIATI EANTNNILNH TGAPDFTWLL CVSYVCYVFN HLAHESLNNR
TPLEVLTGST PDISVLLQFH FWEPVYYRLD DATFPSDGTE QRGRFVGIAD SVGDALIYKI
LNDGTNKILY RSSVRWGEWS