Gene PHATRDRAFT_44453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44453 
Symbol 
ID7197745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp610416 
End bp612342 
Gene Length1927 bp 
Protein Length589 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178263 
Protein GI219114935 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCCCCAAAAT CTGCATGTGA TTCGGTTAGT AGTTCATCTC ATACCAAAAT TTTCTCTTCG 
ATTCGTGTTG CCTCGCTTTA GCTGTGGAAA GGCTTGTTCC GGGAGCACGA AGAGAGAAAC
ATTCAATATC GGATAATTTT CCGGTCTCGA GGGTGCAATG AAACAATCCC GCTCGTCCAG
ATCTCGCGTG GAAAAAACTG TTCCAAAACC ACCGCCTACT GGCCCAAAAA CTCCATCTAC
GCTGGCCGCC ACCGTGGCAG CAGGGAAATC TTTCGAGACC ACCGAGCGCT GCGGCACATG
TTCTGCATGT TTACGTGAAG ACTGTGGCCA GTGCGAAGGA TGCGTATGCA AACAAAAATA
CGGAGGTGAT GGCTCCAGTA AGAAGCAGTG CGTCTATCGG GGTTGCCAAG CTATTTCGGA
GATAAATCTA AAGGTCGGCG GTGGCTGCTT CGCGAGCGCC ACTCACAACA GCGACATCAG
TAGCGTTTTG TCTATATCGC AATACCCACC ATACCCCTTC ACACCGCCAC CGTCTTTAAT
CGACGTCAAC AAAAAGAGGA AACCTGATTT GACAATAGCT GACAAAAAGT CGATGTATGG
GAAGCTCATC CCAAGTGAAT CTCCACGCGA TCACTGTTCC GGCTGCAACT TACGGCAAGA
AACCATTAAC GATTCTGTGC TTATCTGTGA CGGTCCTGCT TGCGGTCGTG AATATCATCT
GCGTTGTTGC GTGCCTGCTC TTGACATCAT TCCGGAAGGC GATTGGCTCT GCCAAGACTG
CAGTCCGTCT GGCAGCGCTG AAACTTTGAT GCAATATCTT GAATCAAACG ATGAACGACG
TTGTGACTTT CAATCTTCTG AGGAATTTGT TGCCTCTCTC ATCTCACATG ACATGGTAAA
AGAAAAAGTG CACCGTCGCC CTCTGTCGGA ACTCGAACGA GCCACTGAAA TCCATCGTAG
TGCAATTGGC GAAAATTGGA ACCTTTTGAT TTCACCAGAC TTTTATGTCG GAAAGCCGTT
GCGTATTTAC GATGGACTAG CAAACCAGTA CCATTCGGGC CGTCTGGTAG ATTGTCGACA
GTCTCTCTCT TGTGGGACAG TAGAATACCT TGCGCGTTTC CCTTCCGGAA AGGATGGTCG
GAAATCTCCA CTCCACCACT GGATTATTCT CGAAGAACAT TGCCTTGCTA TTGGCACCGC
ACTGATCTGG TCACAAACTC TTGGTCGGCG TTGGAAACCT GCACAGCTCC TGCTTCGTAC
TGGAAGAGAA CTTGTCTCTG TCGCCAGTAT GTATTCGGAA GAACAAGGTG AGATTCGGTT
TACGGATTCC AAGCATACTT TGAATGCGTT GCCTAGCACA CCGGAAACAG ACACAACTAA
ACCTGCGGCC ACTCCATGCT CGGGAAGTGC CTCTCGATCA GCACCGTCTG AGCCAAGGTT
TCCTATCAAG AAGAGGCGAA GGAACGAAGT GTGGGGCCTT GTACGTTTCT TTGGGGAAGG
AACCTTTGAA TTCGTCCCTT TGACAGCTCG TGCTCGCAGT TATAAGGATC CAATCTTTCA
AGCAAAATAC GGAAAGTCGG AAGCAATATG GCTCCCGCTT GCGATCGCAG AGGCTGAGCA
GGCAGAGCAA ACATCTGTTC TTCAATGGCG TAACATGGAG CAAAACAATA GACTTAGCCA
ACATGTCTTA TCGAGCAGAG ATGACTACGG TCTTCAACCA CTACAACCAA CCAACTCGTT
CGATTCTGTT TCTTTTCCCT CGCAGCTTAC ACCATCAATT CCGCAAGGCT TAGATCGACT
TCATATATTG AATCTTTTAC AAGAGCAAGG GTTAGAAGTC GACAAAGACA TAGCGTCAAT
TCTTCAATGT ACGAGTGTGC CAGTCAACGT GGCTAGGTGC CTCAAACAAA ATGGTCATGT
AGTATAG
 
Protein sequence
MKQSRSSRSR VEKTVPKPPP TGPKTPSTLA ATVAAGKSFE TTERCGTCSA CLREDCGQCE 
GCVCKQKYGG DGSSKKQCVY RGCQAISEIN LKVGGGCFAS ATHNSDISSV LSISQYPPYP
FTPPPSLIDV NKKRKPDLTI ADKKSMYGKL IPSESPRDHC SGCNLRQETI NDSVLICDGP
ACGREYHLRC CVPALDIIPE GDWLCQDCSP SGSAETLMQY LESNDERRCD FQSSEEFVAS
LISHDMVKEK VHRRPLSELE RATEIHRSAI GENWNLLISP DFYVGKPLRI YDGLANQYHS
GRLVDCRQSL SCGTVEYLAR FPSGKDGRKS PLHHWIILEE HCLAIGTALI WSQTLGRRWK
PAQLLLRTGR ELVSVASMYS EEQGEIRFTD SKHTLNALPS TPETDTTKPA ATPCSGSASR
SAPSEPRFPI KKRRRNEVWG LVRFFGEGTF EFVPLTARAR SYKDPIFQAK YGKSEAIWLP
LAIAEAEQAE QTSVLQWRNM EQNNRLSQHV LSSRDDYGLQ PLQPTNSFDS VSFPSQLTPS
IPQGLDRLHI LNLLQEQGLE VDKDIASILQ CTSVPVNVAR CLKQNGHVV