Gene PHATRDRAFT_48106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48106 
Symbol 
ID7203271 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp220225 
End bp223236 
Gene Length3012 bp 
Protein Length419 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182640 
Protein GI219124709 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTGAGAGTG GCCAACATCA GTGGCAAAGT CTATACCCAG GCGTTCGATT TCTGAAAGAT 
ACATTTGTCG ATTCGCCTGA GCGAGAGAAT GGATTATCTG GGCTCTTTCA TCTGCTGCTG
CATTCCCTCG AAAAGCAGCT GTGAGGAAGC GCCATCCGAA CCGGAGAAAG CTCAGTCTCT
CCTCATGTGT CGCAGTTGGA CCCAAACAGC GAAATACATG AAGGGCAAAG TAGGGCGGAG
CGACGCATAA AGTATCTATA CGTGGTCTCT CCGTTTCTCC ACTCATTGCG TTTCCCTTCT
GTGTCAATAA ACCCTTCAAA TGTTCCAGTT GCATCGTGTA CCACTCGCGG GAAGTGTCCG
AAAGGACTTT CCAGAGAACT GATACCCTCG AAAATATGTG CATTGAAGCT GCGGGTACAA
TGCTGTTGGC GTTTCTGGCC GTTGCTGCGA CCCACTTAGG CTGCACTCGC GAGTCTTGGG
GAATAGACCT ACCCATGGGG TCTCCTTCCA GAACAGTTAT TTCGCAATTG TTAATTGAAC
GTAGGCGTTT GGTTAACGCG TGTGCAACAG ATACACCGGC TATCCCTCCA CCTATGATGC
CGACTTTCAC TTTGGGATCT TTTTTGAAAT TTCCAACACA GGTCGTTTGA GATTCCGACT
GCTCCCCATG CGAAGTAAAA TCCTTTTTGA GACTGGGGCT GTTCAAGTAA ACACAAGAAG
CTGGTAGTTT GGCAGTGCAC ATCGGTTGTC TTATTCTTAG GCCAAACGGA TCGGTAAGTG
GCAAGAAGGC CACCTGCGAC AGTACACTCA AATAATGGAA GCTTTTCATG ACGTTGCAAT
AGTGCTCAGA GTTCATTTTG CTCTGAATGT CGGTTGCATG CTCGGCTTAG GCACCTCGGT
GGCATGTGCT CTATAAGAAA AGCTCCTTCG TCTGCCACTC AAATCCTATC AACCGTGAAG
AATACCTTTC TGGGTAGTTC CTTGTTGCGA TCACAGAGTG GATTCAGAAG AACTCGACCG
ACGATTTTTA CCTGTTGTTG ACGCATCAAG AGCTCAGTCC TACGCATCAT GTGACTGAGT
AGGGCATGGT AGACGATACA AACAAATCAA TCGTGACCAT CAAATGATTA TTACATGTGT
TCGCATTTTA CATTTGAGTT CGCATCCTGG AAAGAAAGAG AAGTCCATCG CGTGAACCTT
GACTGTGAGT TCCCAATTGA TTCTGTTGCC GCCATCGACG TCATCAATAT TTTGCTTCCA
TAATCTTTGG CAATTGACTC AAAAGCTGCA GAACTCTGTC AGTCAACATT TCCTCGAGGA
TCGTCATAGG CGACACCTAT TGTACAAGCG AGATCTCACC AGAGGCTTCT CAACTACAGA
ATAACTGTAA AACTGCGACT TCGAAATAAA AAGAATACAA AACCGGCCGT CTTCTTACAC
CAATGGTCTC TCCTTTGATT GAAAGACTTA CAACCTGCGT TGTGGGTGGA GGGAACTCGG
CCCACGTTCT GGTCCCATTC CTATCGGAAG CTAGACACTC AGTAAACTTA CTGACCCGCC
GCCCTCAGGA CTGGAATCAC GATTCCATAA CTTGTCAGCT AACAGACGGA ACTACGGGTC
AAGTAGCTGC GACTCATGTG GGCATGCTGG CCGCTTGTTC CGCAAATCCG GCCGATGTCG
TTCCCAACGC CGACATTGTC ATTCTCTGTA TGCCGGTGCA TTCCTACCGA GAGGCTTTGG
ACCGCATCGC CCCTTATTTG AGCCGCAGCA AATCTCACGT TTATGTGGGA ACGGTATGTT
CATCAAGGGT AACTAGGCGA TTTCCTGATC ACTTATATTC ACACCTTCCT CGTCCTTCCA
GATGTACGGC CAAGCTGGAT TCAATTGGAT GGTCCATGCC ATGGAACGAG AGTTCGGCTT
GACTAATATT GTCGCATTTG CCTGTGGAAG TATCCCCTGG GTTTGTCGGT ACGTCAATTT
GTTTATGCGG CCAGCGATGG CTCTTACATC CAAAGATCTC CTCGCCTCTC AACATAATCT
CTTCGTTGAC AGTACCGTGA AGTATGGAGA GCTGGTGGCC AACTATGGAG GGAAACACGT
GAATGTGGCA GCCGTTACGC CCCACAGCCA ATTCGACAAG TTGAACCGTG TCCTTTTACA
GAACCTGAGC GTAAGGCCGC TCGGTATGGG TGCATTCCGG CAAGCCGAAA GTTTTCTCTC
CCTCACACTG TCCGTCGACA ATCAAATCAT TCATCCCGCC CGGTGCTACG GCTTATGGAA
AAAGTATGGA GGATACTGGA AAGACGAGGC CCACGTACCG TACTTTTATC GTGATTTCGA
CCACACATCC GAGACCATTC TGCAAGCGCT AGATCGCGAC TATGCTGCCG TTAGGAGTGC
TGTTCGGAGG AGCTTCCCGT CGAAACAATT TCCGTACATG TTAGACTATT TCTCGTTGGA
AAAGCTAAAT CACAATTCGT CCCATGCCGG AATCTTGGCA ACCTTTCGGG ACAGTCCACA
ATTGGTGTGC ATCAAGACAC CAACCGTTCC CAGTGGCTCT GGCTCACAGA ATCAGTCAGC
AGCGAGAGTT TTGGATACGA ATTGTCGCTT CTTTACTGAC GATATACCAT ACGGCCTGCT
CGTGGCGAAA CGTTTAGCGG AGCTGTTGGA GCAACCGGTA CCTATGATTG ATGAAGTATT
GTTGTGGGCC CAAACCTTGC GCGGAGAACA TTTTGTGCAC GAGAAGGATG GGAGCGTGAA
CTTGGAATTT TGCTTGCAGA GACAAGGCAA GCTCGCCGTT TGTGGAATCC CCGAAACCTA
CGGCATTACA AAAATTGAAG ACATGTTGGA TTGATGATAT ATCTGCTTTG AGAACGAAAG
CGTCTGCTGG AGCCCTCTTT GTGTTCTCTA CCAGTAACTA TTCTCGAGTA TTGCACGTTT
CACCGGGTAG TGAGTTTGTT CTTTGAGGCT TGGTCTACGG AAAGTACAAA GTCCAACATG
ATATAGGACC TA
 
Protein sequence
MVSPLIERLT TCVVGGGNSA HVLVPFLSEA RHSVNLLTRR PQDWNHDSIT CQLTDGTTGQ 
VAATHVGMLA ACSANPADVV PNADIVILCM PVHSYREALD RIAPYLSRSK SHVYVGTMYG
QAGFNWMVHA MEREFGLTNI VAFACGSIPW VCRTVKYGEL VANYGGKHVN VAAVTPHSQF
DKLNRVLLQN LSVRPLGMGA FRQAESFLSL TLSVDNQIIH PARCYGLWKK YGGYWKDEAH
VPYFYRDFDH TSETILQALD RDYAAVRSAV RRSFPSKQFP YMLDYFSLEK LNHNSSHAGI
LATFRDSPQL VCIKTPTVPS GSGSQNQSAA RVLDTNCRFF TDDIPYGLLV AKRLAELLEQ
PVPMIDEVLL WAQTLRGEHF VHEKDGSVNL EFCLQRQGKL AVCGIPETYG ITKIEDMLD