Gene PHATRDRAFT_47447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47447 
Symbol 
ID7202569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp643840 
End bp645736 
Gene Length1897 bp 
Protein Length613 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181604 
Protein GI219122547 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTAG CCTTCCGTTG GGTCGCGGTG GGACTTGAAA TCGGTACCAG AATAAATCCC 
ATGCTCCTGC GTCTGCATAT GGTAAACAGC TCCCTGTACC TTCAGGCTAA ATGTGTTTCC
AGCACACCAC CATCGACGGT GAGCTCCATA ATCCGCGATC TGGTTTCCGC GAGAGCGAGG
CATGTGGTCG TCGGGACAGC GATGATCTAC CACAACATAC GAAACACTGC CGCCGTTGTA
CTGTCTCTAG GCTACACCAT GACGAGGCCT TTATATGTAA AGGAAAGCTC AGCGTCGCCC
AACCGGCGCT TAAAAAAATC TCGTGGAAGT TTGCGAAATC AGCCGTCTCT TCCGGAACAG
TTGGTCACTT GCTATGCACA CAATCGACGA CTATGGGCCG CCTTGATAAT GGGTCTCTTT
GTCATTTCGC AGTGGTCCTT CCAATCCTCT CCTTGGAATG TTGACGACGA CGACGAAAGA
ATGTCGGGCG TTCCGGAGCT AGCAGAAGAT TCCCGCCCCA CGAGTGGGCG CCAGACTTGG
CCCTACCACA AAGACTCCAG AGTCGAAAAT ATAGACAATG ATGGATTGGA GCGGCGGGCA
CGACAAGCGG AACAATCCGC AAGAAATTTA GAAACGTACG ATTGTACGGA AAGTCCCGTC
ATTCCCCAGT CAGTCTACCA ACCACAATCG GCTCTGCCCT TGCCTGTGGT TTTGTTAGGG
AGTCGGAATG CAACAGAAGA CGAGTATCAC GGGAGCTTGC TAGACGGCAT TTCTCGTTCT
CGTTATTTGT ACGTGTCTAC AATTGCTTTG TACGATCTTG GAGAACGAAC ATGGCAATCG
GACACTTGGC AAGTACCTTC AGCAAAAGTA GTACTTCCAA CTGTATATAT CGTGGACTGG
GCTGCCTTGG AGCGCGATTG TCACGCGCTG GAGCATATGA TGGCGGAAAC GCAAGTTCCA
TCGAATGCGT TTTTTGTTTA TTTCGACTAC TCCAGTAGTG CTCGGACCAA GGTTTGCCCT
TGCATTGAGG ATCATTTTAC CAAGGAACGG ATACGACTTG CCAAGCGAAG CCTGGTACAA
GAACGATACT GGAATCAGTC GAACGACTGG GTGGAAATCG GCGAGGTTCC TAGCAATATT
GGCAACACAA TTACAGGAGG ACCTGTTTTA CACGCGCCTC ACGCATTACG GGAGCCCTTT
GTTGACCTTG TAAAACGTCA AGTCGGCACA ACGCCATCCG CGAAGAAATT GGTTAATCGG
AATCGCAAGA CAGATGTTTC TTTTTTCTGG AGAAAAGGCG ACAACTCTCA CTACAGTTTC
TTGCGACGCC TCGTAGGTTA TGTGGTAATG GATATGGGTA AGAAATACAG GTGGTTGGTA
AAAATCCTAG GCGATGAAGA CGCGATCGAA TTCAATCTGG CTCAGCCCGA ATACGCTGAT
CAACTGTTGA ATAGCAAAAT TGTGGTCGTC GCACAGCGCG ACGAGTGGGA AGATCACTAT
CGCTTATTTG AATCAATGGC AAGTGGCGCT CTTGTCTTTA CGGACGTTAT GGTGGCGGCA
CCACGGAATT TGCGAAACCG CACGAACGTT GTTATCTACG ACGGTCCTGA GTCACTGCAA
AAATTGCTGC GCTACTACTT GAGTTCCAAG CAAGCCAAAA CAAGGCTTAC AATCGCGCGT
CGTGGATACG AATATGTCAT GGGCAAGCAC CGTTCATGGC ATCGCCTAGA AGCGCTCCTT
TTCGGAACGC AACTAACCCA GCCCAATCAG CCTGACATGG ATGCGCCTCT CCGAAAAAGG
GGAAAATCGA GAAAACAGTA TAGTGATAGA GCCACTACTT GATCGAAGCG AAATTTAGTT
ACTATGAAAC CAGGTAAGAC TACAAATTAT TAGAATT
 
Protein sequence
MNLAFRWVAV GLEIGTRINP MLLRLHMVNS SLYLQAKCVS STPPSTVSSI IRDLVSARAR 
HVVVGTAMIY HNIRNTAAVV LSLGYTMTRP LYVKESSASP NRRLKKSRGS LRNQPSLPEQ
LVTCYAHNRR LWAALIMGLF VISQWSFQSS PWNVDDDDER MSGVPELAED SRPTSGRQTW
PYHKDSRVEN IDNDGLERRA RQAEQSARNL ETYDCTESPV IPQSVYQPQS ALPLPVVLLG
SRNATEDEYH GSLLDGISRS RYLYVSTIAL YDLGERTWQS DTWQVPSAKV VLPTVYIVDW
AALERDCHAL EHMMAETQVP SNAFFVYFDY SSSARTKVCP CIEDHFTKER IRLAKRSLVQ
ERYWNQSNDW VEIGEVPSNI GNTITGGPVL HAPHALREPF VDLVKRQVGT TPSAKKLVNR
NRKTDVSFFW RKGDNSHYSF LRRLVGYVVM DMGKKYRWLV KILGDEDAIE FNLAQPEYAD
QLLNSKIVVV AQRDEWEDHY RLFESMASGA LVFTDVMVAA PRNLRNRTNV VIYDGPESLQ
KLLRYYLSSK QAKTRLTIAR RGYEYVMGKH RSWHRLEALL FGTQLTQPNQ PDMDAPLRKR
GKSRKQYSDR ATT