Gene PHATRDRAFT_44485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44485 
Symbol 
ID7197714 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp705372 
End bp708493 
Gene Length3122 bp 
Protein Length859 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178559 
Protein GI219115527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCAA ATTACGTCGC GCGTCGCGGG GAAAGCCTCC ACACCCTTGA CTCTGTCCTT 
CGTCCGCTGC GAGAGCAAAA TCCTCACTTC GCACCAAATG TTGGTCTGGC CGTATACCGT
ATTCCGCTGG GCCCGCTATC GACGAACACA CCAACAGCAG GTGTCATCCA CGAGACGGCG
ACAACGGTAC AACTCTGCGC GTTCCGAAAC ATACAAACAT ACTCACATAC TTTATTCGCG
TACAGACCAA ACAACACGAA CGAACGAACA GACAAACACA CATACACACG CTAGCACTAG
TAGAGAGATA TCGCCACCAC CAACAATAAC GATAGCAAAT AGTATCAAGT GTTTCAGGGT
CATCACCGTG GTTAATCTTC GGACAGCTTT CACACCTATT TCCTTCGTTT CAACAGACAT
TCTTTCTACA TACCTTCGCA ATGCCGTCGA ACAATACCGG CGGTAGCCGA GCCAAGAACA
AGGACGGCAA GCTCATTGAT AGCGTAAGTC TTCGAGGATT GATATTGAGT GATAAAGCCC
TTTTGGGGAC AGCGGGTTTC GACATAGTCT CCATGGATCC ATCGTCCTAC TACCGCGTTC
AATACGTACG GTTCCAAGAA GAACGCCTCT GGAATGAGTT TCACCGGGGC TTGCTGCTCT
CTTCGTGACC CTCCTTGGTC TATCCGGATC TTGTTTACTC TGCTTTCGAC GACACCTTCG
CTTCACATCA ACCTTATAAA TGTATTCTCT CACCCTTTTC TTTTTGCGGC ATGCAATACA
ACACAACATT CAAATACCAA CCCGTATGTC GTTAGCTTCG GGAAGCCTGT CCTCCACACA
AACACAAAGA CCTCGCGGCG CTGGTCAAAT CGCTCAAGGG TGACGAGGAA AAGATCCGTC
AAAAGATTAT GGAATGGTGG GAGGAACAGC CCGTTTCGAC CGAAGAAGAA TGGGAGGACG
TCAACAAGCG CATCGCCAAG AAGAAACCCG AAGTTCGTGG GGGTCGTGGA CGCGGAAGGT
CCGAAGGCCG TGGACGTGAG GCGGGACGCG GAGGCCGCGG CGATGGCGGC CGAGGCCGTA
CCGGTGAAGG GCGCGGGGCG GGTCGGGGGC GCTCGACAGC CCCGCGAGCC AACGACAGGC
CCAAGAACAA TACCACGCTC ACAGAGACAA GTGCCACTGC AGATAAACCC GTTGCAGATC
CCGAGATTGG AATTCCCAAC TTGAACTCGG TCCCGGCTCC ACTCGGAGCG TGGGCGAAAA
AGACCGGCGA CTCCATTCCG GCTGAAGTTT CTGCTCCCGA TCCGGTCCCG ACTCCCGCGC
CAGTGGCTGC CGTAACACCT CCTTCTCCCG TAGCCCCAAT GGTATCGACT CCTGCGGCTC
CCGGTATCCG TGCGACAAGT GGAGGGAACG TCTGGGCCAC CAAGGGATCG GCGCACCTTA
TCCGTGCGGA AAAGCCCAAA CCGCCGGCCC CGGCTGCCCC ATATGTACCA AAAGCAGAGG
CACCCCGGGT GCGACGGACC GGGGTCTCCC GCGAGGCACC CACGACCACT GCACAACCAC
CCGCTTCTGT GCATATGAAT GTGCCAGTCG CTCCTGCTCC TCCTGCACCG GCTACCACTG
CACCAACCAC AACGGCCAAC GCATGGAGCA AGAGCAGTGC GGTTTCAGAA ACATCCAAAG
TAGACCTTCC ACCCTCAGCT GTGGGTAGCA AACATATTGG ATCCATGTCA CCGGCTGCTC
CTCCAGCTCC GGCCGCCCCT CTAGAAATGC AGCAACAGAC CAAACCGCCG AAGGCACCTG
GACCTGTCTT GAATATGGGT CGCTGGGAAA CAACTGATGC CGACGACGCT AATTTGGACT
TTGGATTCGG CTCCTTTGAT GATGCGGGTG GCCCAGGTCA CCAGGTGAAC GCCAGCGTAA
CGGAGAACGA AATCGCTCCT CCCGCACCGG CGGCGTCTCC GGCTAGACCT CCGCCTGGTC
TCTCCCTTAC AGGCATTCCG CCGATGCCAA GCAACGCTGT CATGGTGCAC GAGCTAGAGA
ACAAGCTCGA AGGAGCCACA CTCAACGCCA GCGCTGGTAC CGGTGACAAC AATTCTCATC
CGCAAACCAG TGGACCTTCC ATGAACAGCA CTGCACCGGG CATGTACCAG GGTGGATACG
GCCAACCCTA CGGTATCCCT GGTAGCAATA ACATTGCGTC CTCCATGGGT ATGTACAACT
ACAACGCGCC CGGCGCACAG GGCAATGCAT TTGCTGGCAT GCCCGGCGGT GTTCCAGGGC
TGGGTGGACC CTCTCAGCCA AAACTTGGCG GTGGCATCCC TCCAGTCCAG GCCGGCGGAT
TGTACGCCGC TGCACAGCCT GAGCCAAGCT CCGGCAACGA ATCTGGATCG ATAGCGGCAT
CGAATCCGAC TGATCCTAAT GCAACCCCCG GTATGCCACC AGGCATGCCT AACATGCCTT
ACGGAAACCC AGCGCTTTAT TATGGAGGTC AAGCTCCTTT CCACATGGGA CAGCATCAAG
GCGGTATGGG TTACAACTAT GGGTATGGTG CCCAATTTGG AGGCGCTGTG CAGGGTGGAT
TTGGATACCC TCAAGGTATG GGGCAGAGTG CTGGGTATGC TCCCCATTAT GGTGATCAAC
ACGAGCAGCA AGGATCACAC GGCAACAGCG GTGGCTACCA GAAAAACAAT GGCAATTACC
GTGCACGCAA TCAGCATCAC AATAATAATC AGTATCACAA CCAATATCAA CAGCATGGTG
GTTATGGCGG CCAACCATAC AATATGGGTT ACCAAGGTGA TCATTTTCAA CAACGTGGAG
GATACGGCCA GCACGGAGGC ATGCCCGATC CGTACAACAT GCAACAGCAA CCTCAACAGC
ACCAGGGAGG GGGAAACTAC GGAGGAGGTT TCCAAGACGA CGAACAGTAC AAGGGTAAAA
AGGGTGGCAA CCGCCCCTTT CAACAACAGG GTCCGCCTCA GGCTCTGGGC ACTGGACAAC
AGACATTTGG CTTGCAAGGA CAAGTTGCCG ATTCAAGCCA ACCGTCTAGC GGCTGGTCCA
ATCAACAGGG AGCTACTGGT GGATGGGGCG GTGGTACGCC AAGTTGGCAA CAAAACAAGT
AA
 
Protein sequence
MRANYVARRG ESLHTLDSVL RPLREQNPHF APNVGLAVYR IPLGPLSTNT PTAGVIHETA 
TTTFFLHTFA MPSNNTGGSR AKNKDGKLID SLREACPPHK HKDLAALVKS LKGDEEKIRQ
KIMEWWEEQP VSTEEEWEDV NKRIAKKKPE VRGGRGRGRS EGRGREAGRG GRGDGGRGRT
GEGRGAGRGR STAPRANDRP KNNTTLTETS ATADKPVADP EIGIPNLNSV PAPLGAWAKK
TGDSIPAEVS APDPVPTPAP VAAVTPPSPV APMVSTPAAP GIRATSGGNV WATKGSAHLI
RAEKPKPPAP AAPYVPKAEA PRVRRTGVSR EAPTTTAQPP ASVHMNVPVA PAPPAPATTA
PTTTANAWSK SSAVSETSKV DLPPSAVGSK HIGSMSPAAP PAPAAPLEMQ QQTKPPKAPG
PVLNMGRWET TDADDANLDF GFGSFDDAGG PGHQVNASVT ENEIAPPAPA ASPARPPPGL
SLTGIPPMPS NAVMVHELEN KLEGATLNAS AGTGDNNSHP QTSGPSMNST APGMYQGGYG
QPYGIPGSNN IASSMGMYNY NAPGAQGNAF AGMPGGVPGL GGPSQPKLGG GIPPVQAGGL
YAAAQPEPSS GNESGSIAAS NPTDPNATPG MPPGMPNMPY GNPALYYGGQ APFHMGQHQG
GMGYNYGYGA QFGGAVQGGF GYPQGMGQSA GYAPHYGDQH EQQGSHGNSG GYQKNNGNYR
ARNQHHNNNQ YHNQYQQHGG YGGQPYNMGY QGDHFQQRGG YGQHGGMPDP YNMQQQPQQH
QGGGNYGGGF QDDEQYKGKK GGNRPFQQQG PPQALGTGQQ TFGLQGQVAD SSQPSSGWSN
QQGATGGWGG GTPSWQQNK