Gene PHATRDRAFT_38038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38038 
Symbol 
ID7202918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp722174 
End bp724435 
Gene Length2262 bp 
Protein Length753 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181963 
Protein GI219123296 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAG GGGATTCCCG CCGAAAGATT GGTGCTCAGG TGACAGCGAA GGCCTGTCAT 
GTTGTCCATT TGAGTGAGTG TGCTCGGCGA TACGGTGCTT TGAGGACCAC CAAGGTCGTT
GTGGGGACTG TTGTGGAGGT CAACAATACC AGAAAGGCGC CAAACAACCG TGTATCAACC
TTCATTACTG CTGACTTTGA TATTGGTGGA GGATCAGTCA AGCGGAACAC TCTGAACATC
CGTAGCGTCA AACTCTTCAA ACCGGACCAG TCGACAGTAC CATCCAGTCC CGCAGCACCA
ATACCGGCAG TAGACAACGC AGACACAGAT TTGGCCGTTC CAGAGCAAGA GGAAGGAGAA
GCGGTCTTGC AGGAGACTTC TCCTGATGAA GAATTGGAAT TTCCAGCACA ACCGATGATG
GAAATTGGAA TAGCTGCGGG GGAACAGGTA GCAGGACCTA CCGCACAAGT AGCCACGCAG
GTTTGGGGTG TTGAAGACGC TTCCTTTGTA ATGGCTCATG AAACAAAGTG GTATGCTGAC
GAGCAAGCTA CATTGATTGA TATAAATGGC AGTGTCCAAA GTAAGCAGTT TGGCATCAAT
ACACCAATTG GCGACCTTCT TGGTCCAGAC TCTGACATTG ATGGAAAATA TTCGCGGCTG
CAATATTTTC TACTCATGTT TCCACCCGAC CAACTGAGCG CCATGTGTCA GCTAACAAAT
GTGCAGCTTG CCCAACAGAA CAAGCACTGC ATGTCAACAG GAGAGCTGCT TCGATTCTTT
GGCATTCTAA TTCTTGCGAC AAAATTTGAA TTTAGCAGTC GATCGCAATT GTGGTCCACA
ACCGCGCCGT CAAAATACAT TCCTGCCCCT GCATTCGGAA AAACAGGAAT GTCGCGGCAG
CGCTTTGATG ATCTTTGGCG AAATATCCGA TGGAGCAACC AGTGTCCTGA ACGGCCGGAA
GGTATGAGCT CCCATACGTT TCGGTGGCAA CTTGTCGACG ATTTTGTTGA AAGATACAAC
AATCATCGAG CCAATACTTT CAAACCATCT CATCTTATTT GTGTGGATGA ATCAATGTCG
CGATGGTATG GACAAGGGGG GGAATGGATA AATCATGGAC TCCCCAATTA TGTGGCTATT
GACCGAAAGC CCGAAAACGG TTGTGAGATT CAAAATGCCG CATGCGGCTG TTCGGGTATC
ATGCTTCGAT TGAAGGTTGT AAAGGGTAAG ACAGCAGCAG AAGATGATGG GGACTACAAT
GAACAGTTGC TGCATGGAAC AAAGATCCTC AAAGAGCTTG TCCTTCCTTG GTGGTGGACG
GATCGGATTG TTTGCGCTGA CTCGTATTTT TCATCTGTCG GTACAGCTAT GGAGTTGCAG
CGACATGGTT TGAGATTTAT TGGAGTTGTA AAAACAGCAA CAAAACAATA TCCGATGAGA
TACCTTTCGA CTTTAGAGTT GAACCAGAGA GGCGAACGGA GAGGGCTTGT GATGCGAGAT
GTTGATACAA ATTATAGCAC TCTGTTGGCT TTTGTGTGGA TGGACAGGGA CCGCCGATAT
TTTGTGTCGA GTGCTTCCAG TCTGGATGCA GGCAAGCCCT ACGTACGCTA TCGTTGGAGA
CAGATTGACC AATCTCCGGA TGCAGATCCA GAGAGGCTGG AAATTATCAT TCCACAGCCC
AAAGCAGCGG AATTATACTA TTCTGCATGT GGGATGATTG ACAGGCACAA TCAAAGTCGT
CAGGATACAC TGATGCTTGA ACGAAAGTTG GGTACAACAA ATTGGTCGAC AAGGGTTAAC
CTCTCAATAT TTGGAATGAT TGTTGTTGAC ACTTGGTTGG CCTACAGTCT GTGTACAGGA
ATAGGAAGAG CTAACGGGAG AGAAGAAAAG CAGAAAGACT TCTACACTGC CTTGGCTGAG
GAGCTAGTGG ATAACCAATA CGACAATGTT GGAAGTCGCA GAGTTTTCGT GGAGGCAAAT
TTGGACAATG ACAGCCCAGC ACTTTCAAGG ACTACGGGAG AACCAAGAAG TGGCCTGTAC
GCACACCTAA CACCAACCAA AAAAAGAAGA AAGAACAAAG ACGGTAGTTT TAGCAGCAAT
AGACTACAAG GACGATGCTT GGTGTGTTCC AAGAAGACAA CATATGTTTG CTCAGTGTGC
AAAGATGAAG AAACACCTCA CTCCAGAGAA CCGTGGGTTT GTTATACCAC CAAGGGGAAG
CTGTGCTATG CAAACCACAT GGCTACCTGT CACGGCGCCT AA
 
Protein sequence
MSEGDSRRKI GAQVTAKACH VVHLSECARR YGALRTTKVV VGTVVEVNNT RKAPNNRVST 
FITADFDIGG GSVKRNTLNI RSVKLFKPDQ STVPSSPAAP IPAVDNADTD LAVPEQEEGE
AVLQETSPDE ELEFPAQPMM EIGIAAGEQV AGPTAQVATQ VWGVEDASFV MAHETKWYAD
EQATLIDING SVQSKQFGIN TPIGDLLGPD SDIDGKYSRL QYFLLMFPPD QLSAMCQLTN
VQLAQQNKHC MSTGELLRFF GILILATKFE FSSRSQLWST TAPSKYIPAP AFGKTGMSRQ
RFDDLWRNIR WSNQCPERPE GMSSHTFRWQ LVDDFVERYN NHRANTFKPS HLICVDESMS
RWYGQGGEWI NHGLPNYVAI DRKPENGCEI QNAACGCSGI MLRLKVVKGK TAAEDDGDYN
EQLLHGTKIL KELVLPWWWT DRIVCADSYF SSVGTAMELQ RHGLRFIGVV KTATKQYPMR
YLSTLELNQR GERRGLVMRD VDTNYSTLLA FVWMDRDRRY FVSSASSLDA GKPYVRYRWR
QIDQSPDADP ERLEIIIPQP KAAELYYSAC GMIDRHNQSR QDTLMLERKL GTTNWSTRVN
LSIFGMIVVD TWLAYSLCTG IGRANGREEK QKDFYTALAE ELVDNQYDNV GSRRVFVEAN
LDNDSPALSR TTGEPRSGLY AHLTPTKKRR KNKDGSFSSN RLQGRCLVCS KKTTYVCSVC
KDEETPHSRE PWVCYTTKGK LCYANHMATC HGA