Gene PHATRDRAFT_44876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44876 
SymbolcupD 
ID7199802 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp499783 
End bp502846 
Gene Length3064 bp 
Protein Length933 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179013 
Protein GI219116436 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCTCT CTCCACAAGC CGCCCGTATG GGTACCGTGT GCACGACACT CCGCAAGATA 
GTTTCGACGG CGCGTCAGCG GTCTCTCTCG CATTCGGCGT TCGCGTCGGT CCGTAGCGAG
ACCGCGTTTG GAGCCCGGGC GTTTTCGACG CGTGCGAAAT CTACTCTAGC GGCACTCGAA
GACTCGGATG ACGAGGATCT CACCTTTCGT AGTCAGGGAC ACGCCGCAGC CGCCGCGGCT
TTGAGCAAGG CAGGATTGAA CAAGTCTCAC GATGAGGCCT GGATGATCAA CGTCAATCGC
AACGACGACA ACGAATGGCT GAACGGGCCG CGCAGTGCCG AATGGTTTAC GGGCATTCAC
CCTAGTCAAT GTCCTGGTAA GTTTACAAGA GCAATTGCCA AAACTTGTGG CGAACTTGCA
CGCGCACGCG AACGGAAGTT GTGCTCACTC AGAGGGATCT AAATTATATT TGTATATCCA
ACCAGGAGCC GACCAAGCGG GCACCATCCG CTCGCTTTCA CTTCCGAATC TGTCGGCCGT
TACCCGTGAA GCTGCAAAAG AATACTTTGA CAACTCATGG ACGCTCTACG AGACATTGTT
TGCCGGTCTC AAAGGAGAAG AAGGATTTTA TCGGTGAGTT TGAGTTGTGT GGTGATGTTT
CGTTTTGTAT GACCGTTGTC TAATTTCACC GTCCCACCGG ACTGCAGCCC GCCAGTCCAC
GGTCTCCGCC ACCCCCAGAT ATTCTACTAC GGACACACTG CTTGTCTCTA CATCAACAAG
CTCCGCGTCA GTAAAGTCTT ACCCAAACCT GTGAACGCCT ATTTCGAGTC CATCTTCGAA
GTCGGTGTAG ACGAAATGCT CTGGGATGAC ATGAACAAGA ATGATATGCT TTGGCCCACA
GTTTCGGAGG TGCACGAGTA TCGACAGCAA GTATACAAAA CGGTTGTGGA CGCTATTTTG
AATCATCCTA GTCTTGATCA AAGGAACGGT CCAGTGAAAG TCGATCAGGA TCATCCAATG
TGGGCATTGT TCATGGGCTT CGAACACGAG CGGATCCATA TGGAAACGAG TTCAGTCTTG
TTCCGTGAGA CGCCGTACCA TTTGGTCCAA ACACCCCAGC ATTGGCCTCC GATTCATCCG
TCGGCTTTCA ACGATGCCTC GCCCACAAGC AATCCGATAG AAACTTTGGA TTACCCCGCG
AACCGCATGA TTGCCGTGGA CAATGGAACC GTCGATCTCG GAAAGCCTGC CGACTTTCCT
TCCTTTGGAT GGGACAATGA GTACGGTGAA CGCAATATGG ATGTGCCTCC ATTCTTCGCT
AGTGAACACA TGATCACAAA TGGAGAATAC TGGCAGTTTG TCGACAATGG TGGCTATCGA
AATCGAGAGT ATTGGTGCGA CGACGGCTGG GCGTGGCGCA GTCATCGCAA TCTTAAGTGG
CCTTTTTTTT GGGAGCCCGC AGGACCCGCT GGGTCCAATA AGTTTTCGTT GCGAACCATT
TTCAAGATCG TTCCCATGCC GTGGAGTTGG CCTGTTGATG TAAATTACTA CGAAGCGCAA
GCCTTCTGTC GATGGAAGAC CGAGAAAGAA GGATCTCCGA CTTCAAAACC GTATAGAATT
CTCACCGAAG CGGAGCATCA CATCATTCGA AACCACGATC ACAACTTGGA GGCTGCTCGT
AGAGACGTTT CGGCGGATAA GGTGATGGTG ACTTCAGGGC AAGCGTTTCC CAAAGGATCG
GCTGGATCAA ATTTGAACCT GGCATTCTCT AGCCAAAACC CCGTCGATTT CTTTGAGCCG
TCCCAAACTG GCCACCGTGA TACCACCGGA AGTGCCTGGG AATGGACGGA AGACCACTTC
AACCCTTTAA AGGGATTTGA AGTCCACCAC GTGTACGATG ATTTTTCCAC TCCATGTTTT
GATGGCAAGC ACTCTATCAT TGTGGGGGGA TCTTTTATAA GTACTGGCGA CGAGGCATCA
GTTTTTGCAC GATTCCATTT CCGACCCCAT TTCCTACAAC ATTCTGGTTT CCGTCTGGTT
GCATCAGATC ACGATGCTCC TGCTACGCAC CTTTTTGCCG GAAATTTCGA TGGTCAAGTT
GCCGCACGCG ATGCCGCGGT CGCGCAGGAA GAATCCAAGC CGAGACAGTC TTCATTAGGA
AGCGGCAGTG GCAGCGGCAA TGTTTACGAG ACGGATGACA GCTTGCATAT GTATCTTGGC
CTTCATTACC CTAATTCTGG CGAGAAGGAA GGCGTTGCCC CGATCCTTCC TCACGACAAC
TCTCCAAACC ATGGAACTGG CTTCCCGCAA CGAGTGGCAG GTCTTCTGTC CTCACTGAAA
CCCGAGTTCA ATAACAATCG CGCATTGGAT ATTGGCTGTG CTGTTGGAGG GGCGTCTTTT
GAACTTGCTA AGACTTTCGA TCACGTGGAC GCCTTTGATT TCAGTGGATC TTTCGTGAGC
GCTGCCAAGC GAATGCAATC GGCAGAAAAT ATCAAGTTTC GGGTTCCCGT GGAGGCTGAA
CTATATGAAG ACCTTCAGGC TATCCACGAA CATGGTGTGA CGGATTCGGT GCGCTCCAAA
GTGCAGTTCT TTACAGGGGA CGCCTGCCGA CTCATCGATA TGAAAGAAGA CGGAATTCTT
GGCTCATATG ATGGCGTGGT CATGTCGAAT TTACTCTGCC GCCTCCCAGA TCCAATGGCA
TGCCTCGCCG GACTTCCAGA GATTATAAAT CCCGGTGGAG TGGTCGTGAT GGTGACGCCA
TTTTCGTGGT TGACTGAGTT CACCCCCCGG GGCAAATGGC TAGGAGGATT TTACGACCCC
GTAACAAACG AAGCTATCTA TTCGAAGGAC ATCCTGCGCC AAATTATGGC CTCGAATGGG
TTCGAGAAGA TTCATGAGGT TCAGATGCCG CTCGTCATTC GGGAGCACCA ACGTAAATAC
CAGTATATTG TCAGCGAAGC TACAGGTTGG CGTAAGACAG GATGATGCAT TTCTGTTTGC
ATTGTCTATA CATCCTTTAC ATTAATGATT TTTTCCATTT AAACTAGCAT ATTCTATTGA
CTAG
 
Protein sequence
MMLSPQAARM GTVCTTLRKI VSTARQRSLS HSAFASVRSE TAFGARAFST RAKSTLAALE 
DSDDEDLTFR SQGHAAAAAA LSKAGLNKSH DEAWMINVNR NDDNEWLNGP RSAEWFTGIH
PSQCPGADQA GTIRSLSLPN LSAVTREAAK EYFDNSWTLY ETLFAGLKGE EGFYRPPVHG
LRHPQIFYYG HTACLYINKL RVSKVLPKPV NAYFESIFEV GVDEMLWDDM NKNDMLWPTV
SEVHEYRQQV YKTVVDAILN HPSLDQRNGP VKVDQDHPMW ALFMGFEHER IHMETSSVLF
RETPYHLVQT PQHWPPIHPS AFNDASPTSN PIETLDYPAN RMIAVDNGTV DLGKPADFPS
FGWDNEYGER NMDVPPFFAS EHMITNGEYW QFVDNGGYRN REYWCDDGWA WRSHRNLKWP
FFWEPAGPAG SNKFSLRTIF KIVPMPWSWP VDVNYYEAQA FCRWKTEKEG SPTSKPYRIL
TEAEHHIIRN HDHNLEAARR DVSADKVMVT SGQAFPKGSA GSNLNLAFSS QNPVDFFEPS
QTGHRDTTGS AWEWTEDHFN PLKGFEVHHV YDDFSTPCFD GKHSIIVGGS FISTGDEASV
FARFHFRPHF LQHSGFRLVA SDHDAPATHL FAGNFDGQVA ARDAAVAQEE SKPRQSSLGS
GSGSGNVYET DDSLHMYLGL HYPNSGEKEG VAPILPHDNS PNHGTGFPQR VAGLLSSLKP
EFNNNRALDI GCAVGGASFE LAKTFDHVDA FDFSGSFVSA AKRMQSAENI KFRVPVEAEL
YEDLQAIHEH GVTDSVRSKV QFFTGDACRL IDMKEDGILG SYDGVVMSNL LCRLPDPMAC
LAGLPEIINP GGVVVMVTPF SWLTEFTPRG KWLGGFYDPV TNEAIYSKDI LRQIMASNGF
EKIHEVQMPL VIREHQRKYQ YIVSEATGWR KTG