Gene PHATRDRAFT_49563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49563 
Symbol 
ID7198229 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp52044 
End bp55519 
Gene Length3476 bp 
Protein Length973 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184293 
Protein GI219128173 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCTG CTCGCTCTCG AAATACCCGG GATGGCACCC GAAAGCAAGG TACGGCACGA 
AGAAGTGGCG GTACTACTAG TAGCAATGGC AATCAAGGCT TCGACGAGTT TGGATTTGGT
CAACCTGCCT TTCCGGATTC TGCGTTTGAC AACCACGGCT TTGAGATGCC GCAAACTCGA
ATTCAGCCAA CGAAGATTCG TTCCCGCCGT CGAGCATCTT TAGCTGCGGC GCCGAACATT
GACGTTGTGT CGGAAAACCC GTCAATAGGT TTTACCAATC AATTTCAATC TTCACAAGAC
GAACAGGTGT CTCGTGGTGG AGCCCGCTTG GCAAAGGCGG GACGCTCATC GCGCTCAATG
GACGGCATCG AATTCCCAAC TGCACGCAAG GACGTTTCTA GTCAAAATCG TCCTCGTCGT
TCGGGTCGCC GGGCTTCAAT GGCTACTTCT TCCAACCACA GCCTTTCCGC TTCCAATCAC
ACCAACCCGG AACTCGGTTA CGGAGACGCC ATTCCGTCTG TTGCTGCTAA CCATAGAAAA
GGAGACTCTA ATAGCGGAAT TTTGGACTTC GGCTTTGGTG GTGGTAAGAA TGCCGGTACA
GCCAATGCCG ACTACGGCTA CGGTGACACA ATGTCGTCGG GTTTTGGTAA TTTCGAGTCT
ATGCCATCCG CGCCTTCCAC CACACCCGAA TCTGAACGTC CGCGTCGCAG CGGACGACGC
TCCAGTATCA GTGGAGGTCT TGAAAGTCTA CGGTCTGACT TGCGCGGAGG CGACCTGAGT
GGTGCTCCGT CTAGTCGGGT GTTGGGTGGA AATTCTCGCG CCCAGAACAT TGTGTTGCCC
ATGGCCGGGC CGGAAAAAGT GGCCGGTGGC AATGTTCGTC GTGGACGTCG CGGATCCTTA
CTGGGTAGTG TTGGTAATGC AGTCGGAGCT ACCATGGGGG GATTCACTGG TGGAAATAAG
GACAAGGAAA AACTCGACGA CGATACCACC AAAAAGTCTA AGTCTTTTCT AAAGGATCGC
AAGGCTGAAG GTCGGCGAGG CACGACACGT CAACCATCGG CCGATGGCAA TATAATCTCT
TCCTATACCG GCGATCGCGA CCGACGCCGC AAGCCGGCAG CGTCGTCCAA GACCCTGGGC
AAAGAGAGCA ACGTGTCGTA CTCGGATCGT ATTTTAGCAC AGCGGTAAGA GGCAACATAA
AAACACAGCA ATTCAATAAT TTGGCGGTGT ACAGACTACC AATCTAAATG TTTAAAGCCT
AGCGGTATAG TCCGCTCGGC CAAGATATAG AAGGGCAGGG AATTGCAGCA AAGTAAAGGC
ATATTAGATT CAGTGTACGT GATGTACCGG GACCAGTGTG AGAGAATAAA GGTCCGAATG
TGACTCGCCC GAGATCGCGG AATCGCAGAA AAACCCGAGA CACTGTCAAT CCGTTTCTTC
GCGAAATCCT GGCCGCTTTC GCGCATTTAC ATTACATAGT TCGCACCATG TGGGGGAGAC
GAGGACTTTG TCGTATTTCT CTTCTAACGT TGCTATTATT ACTATTCGTT TCTAACAGTG
ACTGTAGTTG TGGAGGTGAA GGGAGCAGCA GTAGCGGTAG CGGTAGCAGT GACGGTAGTA
GTGCTTGTTG TCGCAGCAAA ATTTGGCCGG CGGCTCGATG TGAAACCTAC CGAACACTCG
AAATCGATGC TTCCTCCTCT ATGACATTGC GACGGCACGG CTTGCGAGGA CTGCATATCT
CCCCTACTCA AAGCGTAGGG GACGAAAGTA GCTTCGATGT GGACTGCCAT GGATATTGTC
AAGACGTTCA ATCGATACTG GACGCCGCGT ACGTTCGCTT TCTCAAGGCG CTCCGGCGAA
GCGTGTCTTC CACGCCTTTG GCGCATCACG ACCGTCGCGA AAATGAAAAG GTCGCTCAAC
ATGACAACGT ACAGGCTCTG TTGGGCATTC ACATTTCCAT TACTACGAAT GAGTCTGCAC
TCGTACACGA CGCGGACGAA CGATACCAAC TGGACGTCCC AGGGCCTACC GTCACTGAAA
ACGACGACGA CGACGATGGC AGCTACATTC ATCTCACTGC ACCCACCGTC TACGGCATTC
TGCACGCCTA CCAAAGCTTA CTGCAGCTGG TGACGTTTGT TGGTAGGGAC TCTCAAACAG
GCGCTTTCGT ATTCGCCATG CCGGACACAA CCCTCATTCG AATCCGTGAT GGACCCGTGT
ATCCCTACCG GGGACTCATG ATCGACACGG CCCGACATTT TTTGCCACTA CCGCTTATCT
TGCAAAACTT GGACGCCATG GAGGCCAGTA AACTGAACGT CTTGCACTGG CACGTGACTG
ATTCGCAGTC GTGGCCCTAC GTCAGTACTG CTTTTCCGGA GCTTAGTGCT CGGGGAGCCT
TTGGTCCTGA AGAAACCTAC ACGGCTACAG ATATTGCCCT CGTCGTGCGG GAAGCCGCCG
CACGGGGTAT TCGGGTGATT CCTGAATTCG ATTTGCCTGG ACACTCGTAA GCGATTGGAC
GCTCACATCC GGAATGGTTA ACACCCTGTG GGTCCAAGCC ACGGCCGCAA GAACCTTTGG
ATGCGACCAA TCCGGCCGTC TACGAATTCG TACACCGCCT CTACGACGAA TTGGCAATAC
TCTTTGCGCA CGAATCCTTT TTACACGTCG GAGGAGACGA AGTCAATTTA GATTGTTACC
ACAATAGCAC GACGGTCCAA AGATGGATGC GAAAACACAA TATGACACAG GAACTTGAGG
TTCTGAGCTA TTTTGAGCGT GATTTGCTTT CGTACGTCAC CGCTGTATTA AATCGTCGTC
CCATTGTGTG GCAGGAACTC TTCGATTCGG GATTGGGTCT TCCCAATCAG ACAATTGTCG
ATGTCTGGAA ATCGTGGGAA CCTTCGTCGC GATACAACGC CACTTTGCGG GGCCACGAAG
TTATTTTGTC CTCGTGCTGG TATCTCGATC ATTTGAACGA AGATTGGCAA AGCTTCTACG
CCTGTGATCC ACGGGAGTTC AACGGTACGA AAGAACAGAA GAACTTGATT CTGGGCGGTC
ACGCTTCCAT GTGGGGGGAA CGGGTGGATG CGACCAACTT TCTATCTCGT GTTTGGCCCC
GTGCCAGTGC TACGGCCGAA AAGCTGTGGA CAGGCAACTT AACAGCTGCG GCGGATTCGG
CGGCTTCTCG ATTGGCCGCC TTTCGCTGTC ATTTGGTCCG CAGAGGAATT CCGGCCAGTC
CGGTCGGTCC GGGAGCAAGT TGCGGCAGAC AACCAAATGG TTTTCCGGCT GTGATCGATA
GCTTTCATGA CGAGGAGTTG CAGGAAGGAA AGGTTACTTG AGCAGAGCTT TCTCTGGTTT
TACTGGCCAC CTAGCAATGC GCAGACACTA GATATCGTGC CGATTGAATT TCACGTCGCC
GAGTCGCTTC TTCCCTGGCG TGCTACTGCA ACCAACCATA AATGAGTTCA CTTCAT
 
Protein sequence
MESARSRNTR DGTRKQGTAR RSGGTTSSNG NQGFDEFGFG QPAFPDSAFD NHGFEMPQTR 
IQPTKIRSRR RASLAAAPNI DVVSENPSIG FTNQFQSSQD EQVSRGGARL AKAGRSSRSM
DGIEFPTARK DVSSQNRPRR SGRRASMATS SNHSLSASNH TNPELGYGDA IPSVAANHRK
GDSNSGILDF GFGGGKNAGT ANADYGYGDT MSSGFGNFES MPSAPSTTPE SERPRRSGRR
SSISGGLESL RSDLRGGDLS GAPSSRVLGG NSRAQNIVLP MAGPEKVAGG NVRRGRRGSL
LGSVGNAVGA TMGGFTGGNK DKEKLDDDTT KKSKSFLKDR KAEGRRGTTR QPSADGNIIS
SYTGDRDRRR KPAASSKTLG KESNVSYSDR ILAQRDCSCG GEGSSSSGSG SSDGSSACCR
SKIWPAARCE TYRTLEIDAS SSMTLRRHGL RGLHISPTQS VGDESSFDVD CHGYCQDVQS
ILDAAYVRFL KALRRSVSST PLAHHDRREN EKVAQHDNVQ ALLGIHISIT TNESALVHDA
DERYQLDVPG PTVTENDDDD DGSYIHLTAP TVYGILHAYQ SLLQLVTFVG RDSQTGAFVF
AMPDTTLIRI RDGPVYPYRG LMIDTARHFL PLPLILQNLD AMEASKLNVL HWHVTDSQSW
PYVSTAFPEL SARGAFGPEE TYTATDIALV VREAAARAIG RSHPEWLTPC GSKPRPQEPL
DATNPAVYEF VHRLYDELAI LFAHESFLHV GGDEVNLDCY HNSTTVQRWM RKHNMTQELE
VLSYFERDLL SYVTAVLNRR PIVWQELFDS GLGLPNQTIV DVWKSWEPSS RYNATLRGHE
VILSSCWYLD HLNEDWQSFY ACDPREFNGT KEQKNLILGG HASMWGERVD ATNFLSRVWP
RASATAEKLW TGNLTAAADS AASRLAAFRC HLVRRGIPAS PVGPGASCGR QPNGFPAVID
SFHDEELQEG KVT