Gene PHATRDRAFT_48932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48932 
Symbol 
ID7195221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp69123 
End bp71411 
Gene Length2289 bp 
Protein Length721 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183667 
Protein GI219126862 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTCAAATCAC TTTAACTGTG GCGAGCCTAG CTGAACCCTA AGAAAAGACG TGGCGTTTTC 
CTGTCAAAGT CGAAAACCTT GCGGACGTGA TAAACATTTA ACATAGTTGC GCTCGAAAAC
GAAATGCTCT CCTCCTTTAG GTTTCGCGTC CGTCGCATTC GAACTCCGTC TGGCCATTTT
TCCACAAGGA GATGTTTGTC GACGCACCGA AACGCTTCCA GACCCATTGT AATTGCCGGC
GGTGGTCCCT GTGGTCTGTT CATGTCCAGT TTGCTAGCGT CGTACAGAAT CCCTTCGATT
CTGTTGGAAG CCCAAACAGC AGAATCGCGG TTTCGCCATC CGCAAGCGCA TTTTTTGAAT
ACACGAACAA TGGAAATTCT GCGGCATTCA TTACCCGCGG TCTATAAACG TGTCCTCCAA
AGTATGGGTC CTGTATCAGA ATGGCAGCGG TTTCGCTTCA CCACGAGCAT GAGTGACGAC
CAGCCTCTGG CAGAAGTGCT TCACTTCGTG GATCGACCCT TACAAGCCGG ACGGGACTCC
AACGGCATTC TGCTAAAGGA CGAGAATCTT GCAACTGATG GTACAGACAA ACAATTATCA
TTCGATCGTG ACCTTTCGCC CTGTACTGTG GGACACTTGG CGCAACATAC GTTTTGTCGA
ATTCTCTACG ATTACGCAAA GCATGAAGCC CGGAGTGTTC CGGGAGCCGC GCTCCATTAC
GGAACGCGGA TTACAAATAT GGAGACTTCT CAAACTAACA ATGGATGCCT TGTAACAACT
AGCAACGGAG TAAAAATAAA GGCAGCTACA GTTGTGGCCG CCGATGGCTC AAATTCGTTG
ATTCGGCGTT TAGCCAACAT TTCTCAAAGC GGAGAACAAG GGATGCAGTA TCTCATGAAC
GTGCACGTTA AGCTACCACC CGATCAAGCG ACCGTCTTGC ACGCCAACAA CAACCATGCT
ATGCTCTATA CCGTGTTCTC GCCTCTGGTC GTCGCTATGA CGGTATGTCA CTCGGTAGGT
GAGTACATAA TTCAAATCCC CTACTTTCCT CCTTTTCAAA AGCCCGAAGA CGACTTTGGA
CCAGAAAAGT TATCTGCAAT CGTTCAAGCC GTATTTGGTT CCAAGATTTC CCATTTCGAA
ATTGTGTCGG CCAAAAGTTG GACAATGTCG GCGCTGATAG CTGATCGCTA CTACGGGGAT
AGTCTTTTGC TCGTGGGTGA CGCCGCGCAT GTGTTTCCGC CGGCCGGCGG CTTCGGTATG
AATACAGGAT TACAAGACGT ACACAATGCG GCTTGGAAGC TGGCGTGGCT TTACCATAAC
AGCAACCAAT TCAACTCCAA CGCCGGCCAT CTTGTCTTTT CCCAGATGGC CAAATCGTAC
CAAGCCGAGC GTCGGCCAAT TGCGCAGCAG AATGCGGCCC TTTCCGTTCG AAACTACCAG
CGTCTCTTGC AAGTCATGAA AGCGTGCTAT CTGAATGACC AACACCCAAC CCTGCTGCAG
ACAATACTGA AAAATTCCCC GCTCCCCCTC CAGGCTCAAC GTTCTCTCTT TTGCTCGTTG
CTGCAAACAG CCCTGCATCC TCTGTCGTGG CTGGCATCGG ACCCGCAGAG TAGGTACGCT
AGGCACATTC GATCCAACCT GCACCAAATA TTACAGAGAG GAGCCGGCTT GCCGTTGCTA
TTCCCAAAAC ACGAGTTGGG CCTTGACTAC GATCGAGATG AAAAGATTGA AGCGCCTGAG
GTTGACGAAT GGAAGAACGA TACACAACCA CACGACCCCT GCATCCAAGT GGGTCGACTC
GTGCCACATT TGCCAGTGCA AGTGGCGGAG GGATATTCAG CAGCAACTTA TCCGAACATA
CAATGGCTTT TAAACGATGT TCTGAGCACG TGCGACTTGC CGTCACAAGT TGCTCGCACT
CCACAGCCTA CATTTGCTTT GCTACGTATT GATCGGGGCG ATCCCGATGG CGTCACAATA
GAAGACTTGC GGAGTCTTGG TCGGGAAGTC AGCGATCGTA TTGGCTTACC TGTCGAGGTG
TTGACACTAT GTGTTGGTGA GAAAATGTCG CATAGTGATT GCGACGAAGG GCTTGTTTTC
TTCCCTGTCA ATTCAAGTAA GGGAGGATCG TGTGTGCGTG TGTTTGCCGA GCGTGGGTTG
ATTTGGATCC GACCTGATGG GCACGTCGCA TTCACTTCAT CGGGTATTGA GACATCGGGA
CTTTGTAGCT TGCGAGAAGA GTTGTGTGGT ACGGCGTTTT CATCAGCTTT TGGCAGAAAC
GAATCGTAA
 
Protein sequence
MLSSFRFRVR RIRTPSGHFS TRRCLSTHRN ASRPIVIAGG GPCGLFMSSL LASYRIPSIL 
LEAQTAESRF RHPQAHFLNT RTMEILRHSL PAVYKRVLQS MGPVSEWQRF RFTTSMSDDQ
PLAEVLHFVD RPLQAGRDSN GILLKDENLA TDGTDKQLSF DRDLSPCTVG HLAQHTFCRI
LYDYAKHEAR SVPGAALHYG TRITNMETSQ TNNGCLVTTS NGVKIKAATV VAADGSNSLI
RRLANISQSG EQGMQYLMNV HVKLPPDQAT VLHANNNHAM LYTVFSPLVV AMTVCHSVGE
YIIQIPYFPP FQKPEDDFGP EKLSAIVQAV FGSKISHFEI VSAKSWTMSA LIADRYYGDS
LLLVGDAAHV FPPAGGFGMN TGLQDVHNAA WKLAWLYHNS NQFNSNAGHL VFSQMAKSYQ
AERRPIAQQN AALSVRNYQR LLQVMKACYL NDQHPTLLQT ILKNSPLPLQ AQRSLFCSLL
QTALHPLSWL ASDPQSRYAR HIRSNLHQIL QRGAGLPLLF PKHELGLDYD RDEKIEAPEV
DEWKNDTQPH DPCIQVGRLV PHLPVQVAEG YSAATYPNIQ WLLNDVLSTC DLPSQVARTP
QPTFALLRID RGDPDGVTIE DLRSLGREVS DRIGLPVEVL TLCVGEKMSH SDCDEGLVFF
PVNSSKGGSC VRVFAERGLI WIRPDGHVAF TSSGIETSGL CSLREELCGT AFSSAFGRNE
S