Gene PHATRDRAFT_48672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48672 
Symbol 
ID7194861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp551027 
End bp554924 
Gene Length3898 bp 
Protein Length1123 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183242 
Protein GI219125971 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATGCT GCACATCCCA ATCCGCAAAT AGAAAAAGAG TATCGACGTG TTGGCTGTCG 
CCTTTCAAGT CGTCGCTTTT ACTTTTGATA TGGGCTTTGA CTGTATGTCT TTCGCTCGGC
TTTTCCTCCC GTCCGTTACT GTACAGTCCG CGTTTTTTCG CTTCTTCTAT TGTGGTGCAA
AAAGAACAGC GCATTACATC AACAAAACAG GCTTCAGACG ACGGTATCGA CGAAACCGCC
AAATCTCTTT CTAAACCAGA TTCAGCTTCG AAAAGAGTAT CAGCCACAAC AGAAAATAAG
CTTATTAGAA ATGCTCCCAA TCGACCTTCG AATGGAGCAT CGAAAAGACC CCTTCGGCGA
AACACCCCTG CTGGTCGTCG TGGCAGTCCG GGCAGCAGTA TCTTGCAAAA CTCGAAACGA
CTGAATCAAC TTCTAGTAGC CTGCGAGAGC GCTTCCGAGG TTTTGACACT ATTGCAAAAT
ACAAAAGGTT CCTTGACACA AAAGGCCAGC GGTGGTACAA TGAACAGTGT AAATTTTTCC
ACTTCGATCC ATCGTCTTTG TCGACATTCG CTTAACCAAC GCGATACCCG TGCAGCAACG
CTAGCCGACC CCCGGTTCGC CTTGTTGCTA GCGTCGACGG CCGAAGCCAT GGTAACTATG
CCATTCCAAT CACGTGAATT GTCGAACATT GGTTGGGCCT TGGCGAAACT GAAGATTGTA
CCTCCATTGA CGGCCATGCC TTTTGAACAA TCCGACGACG AGGCCCTTAA AGCGGCCGCT
CAAACAGTCC GTGACGGCGT TTTCAAAGCA GCCAAAGAGC GGCAAGAATC GGGAACACCC
TCCAAGGCAT GGATTACTGC CCTTTCACAA CTGGCGGGTC AAATTTTGGA TCGCATATCG
CAAAATGTGG TCTCGACACA AACCGACGGC TTTCGACTCC AAGAATGGGC AAATTTGATG
TGGGCTTGGG CCACAGCAGA ACGAGCTGAT CCGGTAGCCT TTGGAGTGGC TGTGGACAAG
ATGATTGATC AACAGCAAGA GGCGGATCGG ACGGGTGAAC CTAATCTTCG ACCGCAGGAA
TGGACCAACT CTGTTTGGGC GTTTGCCACA GCACAGGTTT ACGGAAAACA CGAGAAGTTG
TTGATATTCG TCGCGGAACT TATGGAACGA GAGTACGCAT TTGTGCAGAT GTTCAAACCT
CAAGAATTAA GCAATACCGT TTGGGGAGTG GCAACCTTGC TCTCGAATAA GGAAGGAGCA
TTAACAGATG CGGAACAAGA AGCGGCACTA AGCATTGTTC GAATAGTATC GAAAGCTTTG
CTGAAGCGAT CAAACGAGTT CAAAACTCAA GAACTTAGCA ACACTTTATG GGCCTTTGCC
ACTCTGGGCT TCGGCTTGAA GTCATCGGGA GAGCAGTCAT TGAACAACTA TGTCGTTTTA
GCAAGCAATC AATTCGAAGA AGACAGAGAG CTCATGCAAC AGGCTGTTGA AGCTGTAGTA
CTGGCAGCCT ACCCTCAACT CGACCGATTT CGCTCTCAGG AGCTGAACAA TCTTGCTTGG
GCTCTCGCTC GTTTAGTGGA TCACAAATCG GCTCTTGTCG AAAATATTTT GAGAGGTATA
GGAATGCAGC TCTGCGATCG AAAGCGATTT GTGACACCGC AAGATATTGG TGCCACTATC
TGGAGCTTGG CTACTTTAGA ATTTTTTGAT GAAGAGATCT ATCGAGGCAT TGCATTTCGT
CTCACTCCTG ACAAAGGGGG CAGTTGTAAA CCTCAAGAGT TGTCAAACAT AGTGTGGGCG
ATTGCGACTG CCGAGGTCCA AGTGAAAGAT CGGGACGCTT TTGACACGAC GTTGGTTCCA
GAATCGAAGC GCCAGCCCGT GCGTGACCCA ATAACCCGCA GTTTCGCCAT TGCGGCAACG
GAGCTCATGC GAAGGCCTTC TCAATTTAAA TCTCAAGAAA TTAAAGACAT TTTGTGGGCA
TTTTCAAAAA TTGGTATTCG CCACCCGAGC TTGTTCAAAA GTGTTTCCGA GCATCTTGTG
GGGATAATCG GACCAGGAAA GCCTCGTGGG TTGACTGAAT TCTCACCTCA GGGGTTAGGA
AATACAGCAT GGGCATTCGC AAGACAAGCG CAACTTAGTG AAGAAGCAGC CAATCGCCTT
GGTGGTGCTT CACTGTTGCC TTCGAGCAAC GGTCGCCTTG CAATTTACAC AGCTTGCTAT
TTTGATATTG GCGAGGAACT GATTCACCGA TTGTTTGCAG CTATCGCTGA GGCAGGCATC
ACTAAGCATG TCAATTTGAC TAGTTTTAAA CCCCAAGATT TGTCGAACAC AGCATGGACA
TTTGCAGTGC TTGGTTTACG ACATACAGCT TTTATGGAAG TCGCAATGCA CGAACTTGAG
CGGAGATTAT CCCTGTTTCT AAAGGGAGAG CGGACGTCCA TTACGACCTT TAAAGGCCAA
GAATTGGCAA ACTTACTGTG GGCGCTAGCA ACGCTGAACA TTCGAGTCGA AAACTCTCTT
GAGATAGTAA CTCCGTATCT TCAAGAGGTT TGCTTTGAAG GCAGGACTGG AATGCCAGTA
CAAGCGATAG CCCAAATTTT CAAACGCCAA GAACTTGCCA ATGTAGCTTG GAGCTGTGCT
GTCTTTGGCA AGTATCCAAC GGCTTTAATG CAACTGTTAT ATGCTGGCTT GATTGGACTT
GATAAAGAGT GTGATGCCGA GAAATTGTCA AACGTGTACG GAGACAAAGG TCTGCAATCG
CAGGCATTGA TGAGTTTGAT CTATGTTCAG GCTTCTATGG ATCGCGCCGG CAAAAGTACG
CTGGGGCTTC CGCCAAACTT TCCTGACGCC TGGCGACAGT CTACTCCCTC CGAGGATGGT
CAACGCATGA CAGAAACGAA CATAGAACTT TCTCTGAGTA CAAGCAAAAT CCAAAGAGAC
GTTTCCGCTG CTTTCAATCG CATCGGATTC AAGCACATAG AAGAGCACAC TATTTCCATG
CAGGAAATGG TAGTCGAATA TGGGGTAAAT TTTGCTCCAC AACAACTTGA CATTTTGTCA
ATTGATATTG CGAATGTACC AGAAAAGATT GCTATTGAAG TTGATGGACC TGCCCATTTC
ATCAACCTTA TCGACAACGT TGACGAAAAC GACTACGGTT CTACGAAGGC GCCCAATGGG
AAACTAGAGT ACCAGTTTCA ATGGACCGGT GACCGCCAGA TGATGAATGG CTCTACAAGC
CTCAAGCATC GCCTTCTCGA ATCGCTCGGC TGGAGAGTAA TACATATTCC GTTTTGGGAA
TGGTACCAAA TGGGGAGTGA CGAGGAGCAA GGCGAGTACT GTCGAGACGC TCTCGATACC
CTTGGAGAAT AGCATGCGCC GACGAGCAAC GCGATTTCGC TGTATTCCTT AGTACCGCTA
ATATATTCCA TAGACAACCA ATAGCGACTG TTCTATAGAG TGTAGAAAGG AGATGTAAAC
CAGTCTTTTA GAGATCAAAC ACAGAAGCGC TGCATCTCTC TCCGCTACTT ATCAGCTTTA
CGTTTCAAAA ACACCAACGC GGACGAACGG AGAATGATAA AAACTGCCAA TAAGGCCAAC
CAGTACCACC AAACGTCATC CAGGTCAACG CTGGCATTGT CTAACACGCT GTCGCAATTC
TGGTGGGCCT GATCCGAACC ACAGTCACGG TCGAATTCGC CAGCCAATTC CAATTTCACA
GCGTACGTCA AAGGCATGAC GTATTGAAGC CAACGCAGCC AAACTTGAAT AAGTGACTGC
GCAATGAAGA AACCCGAAAA CAAAATTTGT GGCACAAAAG TCATTGGAAG AAATTCAACA
GCCAGTTTTG GATCTTCGAC GCTGGAACCA AGAAGCATTG ACATGGCAGT ACCGGACA
 
Protein sequence
MKCCTSQSAN RKRVSTCWLS PFKSSLLLLI WALTVCLSLG FSSRPLLYSP RFFASSIVVQ 
KEQRITSTKQ ASDDGIDETA KSLSKPDSAS KRVSATTENK LIRNAPNRPS NGASKRPLRR
NTPAGRRGSP GSSILQNSKR LNQLLVACES ASEVLTLLQN TKGSLTQKAS GGTMNSVNFS
TSIHRLCRHS LNQRDTRAAT LADPRFALLL ASTAEAMVTM PFQSRELSNI GWALAKLKIV
PPLTAMPFEQ SDDEALKAAA QTVRDGVFKA AKERQESGTP SKAWITALSQ LAGQILDRIS
QNVVSTQTDG FRLQEWANLM WAWATAERAD PVAFGVAVDK MIDQQQEADR TGEPNLRPQE
WTNSVWAFAT AQVYGKHEKL LIFVAELMER EYAFVQMFKP QELSNTVWGV ATLLSNKEGA
LTDAEQEAAL SIVRIVSKAL LKRSNEFKTQ ELSNTLWAFA TLGFGLKSSG EQSLNNYVVL
ASNQFEEDRE LMQQAVEAVV LAAYPQLDRF RSQELNNLAW ALARLVDHKS ALVENILRGI
GMQLCDRKRF VTPQDIGATI WSLATLEFFD EEIYRGIAFR LTPDKGGSCK PQELSNIVWA
IATAEVQVKD RDAFDTTLVP ESKRQPVRDP ITRSFAIAAT ELMRRPSQFK SQEIKDILWA
FSKIGIRHPS LFKSVSEHLV GIIGPGKPRG LTEFSPQGLG NTAWAFARQA QLSEEAANRL
GGASLLPSSN GRLAIYTACY FDIGEELIHR LFAAIAEAGI TKHVNLTSFK PQDLSNTAWT
FAVLGLRHTA FMEVAMHELE RRLSLFLKGE RTSITTFKGQ ELANLLWALA TLNIRVENSL
EIVTPYLQEV CFEGRTGMPV QAIAQIFKRQ ELANVAWSCA VFGKYPTALM QLLYAGLIGL
DKECDAEKLS NVYGDKGLQS QALMSLIYVQ ASMDRAGKST LGLPPNFPDA WRQSTPSEDG
QRMTETNIEL SLSTSKIQRD VSAAFNRIGF KHIEEHTISM QEMVVEYGVN FAPQQLDILS
IDIANVPEKI AIEVDGPAHF INLIDNVDEN DYGSTKAPNG KLEYQFQWTG DRQMMNGSTS
LKHRLLESLG WRVIHIPFWE WYQMGSDEEQ GEYCRDALDT LGE