Gene PHATRDRAFT_48678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48678 
Symbol 
ID7194911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp583732 
End bp586152 
Gene Length2421 bp 
Protein Length806 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183122 
Protein GI219125720 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.26048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGTGT TGGAAACAGC AAATCATTGG GCCTCCCATC CTTCCGACTC GGAGAAGGAT 
CCGAGCAAAC CAGAAGAGAA GATGAGAGCC GCCTGCGCCG AAGGAAACGG GATTGCTGAG
AAGGAAACAA ATGCGGTAAA TGCCATTCGA TTGGCTACAT TTGCAGTCCT TGTGCTGGTT
GCCCTCACAG TCTCTCTTTT GGTATACTTT CATACTAAAG ATACTGAAGA AGGTGAATTT
GCGGCTCAAT TTATTTCGCA TGGGGGGAAA GTCGTCGATT CATTCCGCGC GAATGCTGAG
AGTCGGCTCG CTGCATTGAG TAGTTTTTCT GCGTCTGTCA CGTCCTATGC GTTGCACTCG
AACCAAAGCT TTCCATTCGT CACCCTACCA GATTTCGAGC GCAGAGCAGC CTTTACGTTG
CAGCTGGCGC AGGTGCTTTC AATAGCCGTA AACGCAATTG TTAGCCAGGA CAATCGTGCC
GAATGGGAAG CGTACTCAGT CCTAAACCAA GGATGGCTCG CTGAAGGTCT TTCACTCCAA
CAAGCGGTCG TGGACGAGGA TGAAAACGAA TCAGTTCAGC AATTTCAAAA TCAAACATCT
TCGGGTGTTC TTGGCGAAGA TGTTACAAGC AGCCTAGACA TTGTCCCGTT CATTTTCAGC
CTAGAAGAAG GCGGAACTAC ATCAGCGTAC GAGACAGGTC CTGGTCCGTT CGTTCCAATT
TGGCAGATAG CCCCAGCGAT TCCTCTTCCG TCGCTTATCA ATTTCAATGG GCTCACTCAT
CCAACTACAC AAGCAGAGTT GCGCACGGTT CTTGCGACAG GACAGAGACT TGTCGGTCCT
GCCGCAGACT ATTCTGACGA CTTGGACCCA AGCATAGCGG GAAGGAAAGC GCTACTCAAT
CTTTTCTTAA ATCGCTGGAA GAGTGGGGGG AATGACTATG AGGAGGGACC GGTCAGCGAA
ATTTATGTTC CAATCCAGGA TAGATTTGGC CCAAATAGCA CGATGGCTGG TGTTTTCTCC
TCAACCATAT ATTGGCAGGT CTACTTTACG GATATTTTAC CAGAAACAGC GCAAGGTGTT
ATCTGTGTGC TGGAAAACAC CTGCTCCCAG AGCTTCACTT ACGTGATCAA TGGAGCTCAA
GCGAGTTACC TTGGTCAGGG CGATCTACAT GACCCGTCTT ACGATGAATA CATGATTGAA
ACCGGATTTG GCGCATTTAT TGGACGCGAC AACGTAGCAG CAAGTCGAGA TGGAAATTGT
TACTACAATG TTCGTGCGTA CCCATCGAAA GAAATGGAGG AATTATATAT CACGAGGGAG
CCTTTATACT TCACTCTTAC TCTGGTTGCC GTTTTTGTGT TTACTTCTCT GGTATTTGTC
GCGTACGATT GTCTGGTACA ACGCCGTCAC ACTGTTGTCA ACAAATCCGC TCTACAATCA
AATGCGGTTG TCTCCTCTCT CTTTCCTGAA GAAGTTCGCA GCCGGTTACC CAGTCTGTAC
GCCTCGAAAA CTGAACGGGA CGCCGCAACC AAATCTATGC AGCATGAGAA GGATGATAAC
GATGACAGTT TTGACGACTA CTACGACGAT TCTCTTCCAA TTGCTGATCT TTATCCGAAC
TGCACTGTGC TCTTTGCAGA TATTGCAGGG TTTACTGCAT GGAGCTCCAA CCGATCCCCG
ACCGAGGTGT TCAAGCTACT AGAGACAATG TACGGCCTTT TCGACAAGAT TGCGCACAAG
TATTCCGTCT TCAAGATCGA GACTATTGGA GACTGTTATG TTGCAGTGAC GGGCCTTCCC
AAGCCTCAAG AAATGCATGC AATCATCATG TGTCGTTTCG CCAACGCCTG TATCGTACGT
ATGAGCCAGA TGATGCACGT TTTAGTGGAA AAATTGGGCC CGGATACTGC AAATCTCTCT
ATGCGTGTTG GATTGCACAG TGGCCCAGTG ACGGCTGGAG TGCTGCGCGG TGAAAAGGCC
CGCTTTCAGC TTTTTGGGGA CACCGTCAAC ACAGCAGCTC GTATGGAAAG TACGGGGCAA
AAGGGACGAA TCCACGTTTC CGAATCCACA GCTACATTGC TGATCAACGC GGGGAAACAG
GCGTGGATTA ACGCACGCGA CGAGCTTGTA CAGGCCAAGG GGAAGGGTGA GATGCAAACG
TACTGGGTCA AGCCTCCGGA TGTTGGTACT AAATCTACCA CAACCACTTC TTCGGGCCCC
AGCGGCCGGG ACCTGTCTCT CTCGCAGGCT CTGCTTGAGG CGCATAGTTT GAAGATGGAC
CAGAAGCTTT CCGAAAGCAA AGCGAGCGCG CAAAAGTACG AAGACTTACT TGATAGCTTT
CGTGAGATTG AGGTGTCCGA AAAGAAGAAT GAGGTGGAGT CTTCTCCTGA GCCAAGAAAA
AAAGAACACC GTTTCGTCTA A
 
Protein sequence
MPVLETANHW ASHPSDSEKD PSKPEEKMRA ACAEGNGIAE KETNAVNAIR LATFAVLVLV 
ALTVSLLVYF HTKDTEEGEF AAQFISHGGK VVDSFRANAE SRLAALSSFS ASVTSYALHS
NQSFPFVTLP DFERRAAFTL QLAQVLSIAV NAIVSQDNRA EWEAYSVLNQ GWLAEGLSLQ
QAVVDEDENE SVQQFQNQTS SGVLGEDVTS SLDIVPFIFS LEEGGTTSAY ETGPGPFVPI
WQIAPAIPLP SLINFNGLTH PTTQAELRTV LATGQRLVGP AADYSDDLDP SIAGRKALLN
LFLNRWKSGG NDYEEGPVSE IYVPIQDRFG PNSTMAGVFS STIYWQVYFT DILPETAQGV
ICVLENTCSQ SFTYVINGAQ ASYLGQGDLH DPSYDEYMIE TGFGAFIGRD NVAASRDGNC
YYNVRAYPSK EMEELYITRE PLYFTLTLVA VFVFTSLVFV AYDCLVQRRH TVVNKSALQS
NAVVSSLFPE EVRSRLPSLY ASKTERDAAT KSMQHEKDDN DDSFDDYYDD SLPIADLYPN
CTVLFADIAG FTAWSSNRSP TEVFKLLETM YGLFDKIAHK YSVFKIETIG DCYVAVTGLP
KPQEMHAIIM CRFANACIVR MSQMMHVLVE KLGPDTANLS MRVGLHSGPV TAGVLRGEKA
RFQLFGDTVN TAARMESTGQ KGRIHVSEST ATLLINAGKQ AWINARDELV QAKGKGEMQT
YWVKPPDVGT KSTTTTSSGP SGRDLSLSQA LLEAHSLKMD QKLSESKASA QKYEDLLDSF
REIEVSEKKN EVESSPEPRK KEHRFV