Gene PHATRDRAFT_49576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49576 
Symbol 
ID7198193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp91309 
End bp92895 
Gene Length1587 bp 
Protein Length528 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184392 
Protein GI219128379 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00024084 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCAGC ATTCAACTGC GATGAGTTTT ACCGGCCTCT TGGTCTTTGG AGCCTTGGGA 
GCGATATTGA TGAATCTGAT TTCAATACAT CAACATTTGG AGAAAGATGA AGGATCGCGT
AGAGAGCAAG CGTCTACTTC TCTCCGAAAG GCTTTTGCAC GGAATTACTT CACGCCGGTT
AAATCAATAG AGGTGCCCCC TAGCGCCGAC AAAATACGTG GATCTTTATC TTCGGCGGAG
GAGCTGGGGG AAGCAGATCG CTTCATCGCG CAGGAGCCAT CCAAACGAAC TCTCCTAGAA
GAAGTTCCAC CGCTTCTTTC CAGAGAAATC GACTTTGCGA AGCAATCCAA AACTCTTTCG
ACAGCCAGCA ATCGCGAGAC TCAAAGAAAC GATACTACAT CGCAAAGTTT CGCGCTGACA
AAACTACTCT CGTATGGCAA CCAGACGACG CCCAAGCAAC GTAGTGACGT TCCGGTTGAG
TTCAAGGTAC AGACCTCTTC GCGGTTTGCC TATAGTTTTC TAGTTGGCGG TTGCGACCCC
GATAACCCGA CCTACTTGGG GTACCTTTAC GACATTCTCG TATCGACGTA CATTCAAAGA
CAAGACGGAA GTCGTTCGGA TGTGATGGTG TTCTTCCAAA TGGCCTACGA CTCGCCATAC
GAGCACCTCC CCCCTGAGCA TACTCGTTTC CTCTACGATA TGAACATTCA ATACCAGTAT
ATTCCGAAGC AGAAGGACGA GGGCTTCTAC CGTGTCACGC TAGAAAAGTT TCGCATTCTG
ACCCTGACAC AATACGAACG GGTAATGTTT TTAGACGGCG ACGTAATGGC CCGAGGAAAT
CTGGACTCTC TTTTCGAGCT GTCTACACGC GGTGTTTTGA AGGAGAATGT GGTCATGGCG
GGCCGGGAAG AACCGGCCAA CGCAGGCCTA TTCATACTTG CTCCGCACGA AGGTGGCTAT
GAACGTATTC AAGAGTTGAT TCGCGAAAAA GAAGAGCGCG GTCGGGCGTT GCCGTACCCT
CACTGGGACG AAGACATTGG TTGGGGACAT AAGATTGAAG ATCCGGACTG GCACGAATTG
ATTACAGGTG CAAAAGGTAC GAAATGGGAT TTTTACTGTT CGTACTCCGA TCAAGGGTTG
CTGTACCACT GGATCAAATA CGAGCGGAAA TCGGCGTCTA TCTTCATGTC CAAACGCGTA
CACAACTGGG GTGTTGATAG CGAGGAAGGG ACCGATGTCG TGTTACAGGA GAATCTTATC
CTAAGCCGCG TCATGAGAAA AGTAGAGAAC GATCGCAAGT GCTACAAGGG GTCCATGCAG
GGCGCTCAGT GCCGACCACC GTTCAACGAT TTTATCCATT TCACCGGTAC GAGCAAGCCA
TGGATGCGAA AGCCTCCGGT GGACTTATCC GATGCCATGT CGGAAGAATC CCCGATGCAC
TACTGGTATT ACATCTTGTC CAAAGTCAAC CAGGATCTGA AAATGGGTCT TGCTTTCGAG
AACTGGGTGC CGTTGCAACG ACCAAAATTG GGACTGTTCC CCAGTATTGC CAAGGTTGCA
AATGTTGTTA AAAGTAGAAA GCAATAA
 
Protein sequence
MRQHSTAMSF TGLLVFGALG AILMNLISIH QHLEKDEGSR REQASTSLRK AFARNYFTPV 
KSIEVPPSAD KIRGSLSSAE ELGEADRFIA QEPSKRTLLE EVPPLLSREI DFAKQSKTLS
TASNRETQRN DTTSQSFALT KLLSYGNQTT PKQRSDVPVE FKVQTSSRFA YSFLVGGCDP
DNPTYLGYLY DILVSTYIQR QDGSRSDVMV FFQMAYDSPY EHLPPEHTRF LYDMNIQYQY
IPKQKDEGFY RVTLEKFRIL TLTQYERVMF LDGDVMARGN LDSLFELSTR GVLKENVVMA
GREEPANAGL FILAPHEGGY ERIQELIREK EERGRALPYP HWDEDIGWGH KIEDPDWHEL
ITGAKGTKWD FYCSYSDQGL LYHWIKYERK SASIFMSKRV HNWGVDSEEG TDVVLQENLI
LSRVMRKVEN DRKCYKGSMQ GAQCRPPFND FIHFTGTSKP WMRKPPVDLS DAMSEESPMH
YWYYILSKVN QDLKMGLAFE NWVPLQRPKL GLFPSIAKVA NVVKSRKQ