Gene PHATRDRAFT_48114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48114 
Symbol 
ID7203471 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp241834 
End bp244829 
Gene Length2996 bp 
Protein Length213 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182496 
Protein GI219124408 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGAAA TCGATATTAA AAAGATGAAA GGTAGGTACC GGCATTGCTT TTTCATTTGC 
GAGAATCGGA TAGTTTTGGT CGAACCCACA TGTACGCATT TGCGGATTGC AGTCCTCGAT
TTTTCAACCT GTGACACATA TTGACTCACA AAACAGCACT GTACGTTGCA CAGTTGCGGA
ATTAAGGACA GAGTTGGAAA GACGCGGTCT TCCCGTGTCG GGGAACAAAC CAGATCTTAT
CAATCGTTTA CAGGCGAGGT TGGACGAGGA GGAATTCGGT TTGATTGATG CGCCCGCTAC
CACCATCAAC ACTACCATGA CGAATTTAAA CGAGACGGGA GAAGCAACGG GCGACAACCC
CAGCGATTCA GCGACCGAGC GAGAAAAGAA GGGGCTTGTG CCAACAAAAA CCGAAGAAAT
GGGGGAAGAA TCCAAAACAG ACAAATTCAT CAGCGAGGCT CACGATGCAA ACAACGTGAA
AGTGTCGGCG CAGCCCACTT TTGACGAAAT TAAGCGTCGC CGCGCGAATC GTTTCGATAT
ACCGGTCGTC GAAACGAAAG TTCAGTCGCC GCAGAAGCGT GGCCGTGACA AAAAGAAAAG
TGCAAATCTA CCCATTCTAC AAAAGGGCAG CAAGCGCGCC AGGCAGCAAG CAAGTGAAAA
AGATGATGTA CTGTTGCCGA AAGAAGAAAT CGAAAAGCGA CTGAAACGGG CAGAAAAGTA
TGGGACTGGC AACGAAGAAA ACATTCTTAA ATTGAAGGCT ATGCTTCGCA AGTATAGATT
TTGACTGGAG CTAGGGAACA CCTTCAGATT GTACAACCTT GGCTTCGGTC TCGAAATTAA
GTTGTAGAAT CTACCCTGAT CAGGAAATGG TAAATAAAAT ATCTTCGAAG TTTCACAGTT
TCAAAGTGTC TTTGCTCCGA TACCAGCCAG GACTTGTCAT TCAACATTTG AGTCAAAGCC
ATTTCCTTTT TGGAAGTCTC AAAAATAGAC CGAGGGTACT CTGTATCGAG ATAAAACCAG
AGGATTCGAT CCGTTGAAGC TAAATCTGGA AGTTTTTTAG GTGAAAGCAG GCGAAAAGGA
AAAAAAGCAT AGCAGGATGC TTCTAAGCCT CGGATATCTT TCCAGTGGCT ATAAGAGACT
GAGACTCTCA CTTCGGAGTC TGGGCCCGTC GGTTTACCGA CTTCTTAAAC TTTAAATCAT
GCTCCGACTG GCATTTGTTT CTCATTGTAG GTCTATGATC CAAGCTTAAG GCAGCAACGG
ATAGGGCTTC AAGAAGACCT TTTTCAGATT GGTACTTTTT GCGGAGAAGT GGCCGAGAGG
AACGTCCGTC ACATCTTGAC GGAGTCTTAG ACCGATGCTG ATCATGCGTG CGTACGAACG
CACGCTCCGC TTTACTTTTA CCTGGCTCCC TACACACCAC TCGGGTTTCA CACAAGTTTT
GTTCTCCTGC GTGCTGTTCA AGTGGCGCAT TGGGAAGCCC GGCGCAAATT TTGTGTGGAG
CCTTGCCGTG CGCCTGGATG TCAGGAGGCT TGCAAATAGA TTCATCTGCC GCATAAGAAG
AACACCCTTT CAACACGATC TCGTCGTGCT TTTTCGGCGA ATAGGAAAGA TGTGACTTTG
ATTTCTTCGA CGTAGGATGA TGAGTCTCGT TTCTATCAAA TGTGGCGTTG CTTCTGACTC
CAAGGTTAGT CACCTGAACC ATTCTTGCAT GTCGGCTGTG GCTATCCGGT TGATGACGAG
AAGCAGCGGT TGCGCAGACC ATTGACTGCC TAGTGGAAAG GGACCGCTGT TTCATCGACG
TCACAGCCAA TTTCGAAGGA CGACGAGCGC GAATGGGGTT GTCTGGGACT GCAAGTATTT
CCGGTTTGGA ATATTGCCCA CTCTTGTTGC TCCGTCGAGG GCGCGATGTC AAATCTTTAG
ATTTCGTTGA CGAAGACGTC TCGCCCACGA GAAATGAGCT TGTTTGGGTA CCAGCCGTCG
AAATAATTTC ACTCACAAAG CGACGCTCGG ACTTCTGAGC ACGTACTTTT CTCAGTTTTG
CTTGCTTGAT TTCAACTGAG CTCTCCGATT TGTGTATTTT TTTCAATTGC GGACGCTTTA
TTACCTTGGT AGCGGTCGAA GATCTGCAGG GTAAAGATGG AACTTGATCC CGGATTTCTG
GGGCTAAAAG ATCATTCGAC GAGGATGCTC GAATATATTT TTTGAGCTGA CCGCTGTTTT
GAGCTTTCGC CGCGTGTTGC GATCTTCTTA TTTTCCCTTC TGGATTCAAC GATTTGACTT
CTGTTTTAGT GAGAAAAAGA GAAGTCCATA GCGATCGTGA GGAGATAGAT CCATTCGGAG
CAGACAGAAT AGACAATAGT GGCTGAGAAA GAGAACTTTG AGTAGGGCAG AGCACATGAC
TCGAGTTGAA TGAGTCATCG CGAACGGTGG AGGAAGATTC GAATTTCTTG ACATCGAGAT
TTGGATACAA GTTCGATGTA TGCGATGAGT CAATGGCAAT TAGTTTTCGC AACACGGTAC
GGCACGGTGG ATCCGCATGC TGTATCCCGT TAAAGTCCAT CTCGGGAGGC TCTGAATCAA
TGGAATTCGC TTCATCAAAC GTAATAGAGG AGTCCGATTG TGCAATGATA CTCGGTTTAG
TATTTTGACG TCTTTCCTTG AAAGTGTGTC CAGATTCGGC GATATCGTAC TCATCAAATC
CGAAAGGTTT TTCTTCGTTT CTCTCACTGC TAACAGTAAG GCTATCATTC ATGTACCAAG
AGGGTTCGCC CATACTCACG GCTACCACCA CATCCATCTT ATTTTGCTGT TCACTATGAA
TTCCGCGGAA ATTTTCTCGT TCTTCAAATT ATAGGGAGCC AAAAATTGGA TTCACTTTGG
ATGCAAGATT CTACGAATTC AGACCCAAAG AGAGACAGGA TGATCAATAT TTACACGTAC
AGTCCTAGTA TGAATCGACA TATTTCCTAC GATGAGACGA GCCAAAGATG CGCATT
 
Protein sequence
MGEIDIKKMK VAELRTELER RGLPVSGNKP DLINRLQARL DEEEFGLIDA PATTINTTMT 
NLNETGEATG DNPSDSATER EKKGLVPTKT EEMGEESKTD KFISEAHDAN NVKVSAQPTF
DEIKRRRANR FDIPVVETKV QSPQKRGRDK KKSANLPILQ KGSKRARQQA SEKDDVLLPK
EEIEKRLKRA EKYGTGNEEN ILKLKAMLRK YRF