Gene PHATRDRAFT_36013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36013 
Symbol 
ID7201351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp343560 
End bp345281 
Gene Length1722 bp 
Protein Length573 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180416 
Protein GI219119306 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGACAC CCGCCACCAA GACGGAATTC GGAGACTACC AAGTCAATGC CGCCATGGGC 
TTGGCCAAAG CACTCAACCT CAGTCCCCGC GAATGCGCCG CACAAATCGT CAAAGCCCTG
CAACCCAAAA TTCAATCGTT CATGGAAGAG CCGGAAATTG CCGGCCCGGG ATTCGTCAAT
TTGCGCTTTC GAACATCGTA CTTGACGCAG GCCGTCTCGA GCATGGCTGG GGACGCTCAA
GGGCGTTTGG CGGTTCCCCG CACGGCGCAA ACACAAAAAA TTGTCGTCGA CTTTTCGTCC
CCCAACATTG CCAAAGAAAT GCACGTGGGC CATTTGCGAT CCACCATTAT TGGTGATACC
CTGTGCAACG TCCTTGAGTT CGCTGGACAC GAAGTGACCA GACTTAATCA CGTCGGCGAT
TGGGGAACAC AGTTTGGTAT GCTGGTTGAG CATTTGCGGG ACGAGTATCC GGCAGCGCTG
CGATCAGACA CCGCCGACGA TGTGGATCTG GGAGACTTGG TGCAACTTTA CAAGGCAGCC
AAGAAACGTT TCGACGAAGA CGATGAATTC AAGACGCGAG CACGCGAAGG GGTCGTGAAG
CTGCAAGCGG GAAACGAAGA GGAGCTGGCC GCCTGGGAGT CCTTGTGCGC AGCGAGTCGC
AAGGAATATC AAAAGATTTA CGATCGCCTA CAAATTGAAG GCCTGGTTGA GCGAGGCGAG
TCGTTTTACA ATCCGTTTCT GAAGGACGTC GTCGACGAAC TTGTCGAAAA GGGTTTGGCC
GTCGAGAGTG ACGGAGCCTT AGTGGTATAT CTGGAAGGAT ACACCAATCG TGACGGTTCT
CCGTTGCCCA TGATTGTGCG CAAATCCGAT GGTGGCTTTA ATTACGCAAC CACCGATCTG
GCGGCCATGC GACACCGTAC GTTGATGCCC CGAGCAGAGT CGGGAGAAAG AGCAGACCGG
GTCCTCTACG TCACGGATGC GGGACAGGCA CAACATTTCG AAATGGTGTT TGAAGCCGGA
AAAGTTGCCG GATTCTGTAG GGAGGGTGCT TCCCTGGAAC ATGTACCCTT TGGCCTAGTC
CAAGGGGAAG ATGGCAAGAA ATTTGCCACC AGGTCCGGTG AGACTGTCAA GCTAAAGGAT
CTTCTGGACG AAGCTGTCCG GATTTCTGGT GCGGATTTGA AGAAGCGCAA CGAAAACGTC
GATCAGGAAT TCCTGGACCG TGTGGACAAT GTCGCGCGTA TTGTAGGTAT CGGTGCTGTG
AAATATGCCG ATCTTTCCAT GAATCGCGAG TCGAATTACC GCTTCAGCTA CGATCGCATG
CTGAGTCTGA ACGGGAACAC TGCCCCATAC ATGCTCTACG CCTACGCCCG TGTATGTGGT
ATCATCCGCA AGGCCAGTGG GCAAGAAGGA ACCGGGGCCA TTGATTGGCC AAAGGCTTCC
GAAATAATGA TCACGCACGA GTCTGAGTTG GAGTTAATAC GGAATCTAGT CAAGTTACCC
GACGTGTTGA ACGAAGTTGA ACGAGAACTG TATCCAAACA GAATGTGTGA CTATCTTTTC
GAGACGTCAC AAAAGTTTAA TCAATTTTAC GAGAGTTGCT CGGTCAACAA AGCGGAAAGC
GAAGAGATCA AAGCAAGTCG TCTTTCCCTG TGTACAGCAA CTGCGGGCAC TATTCGCTTA
CTTTTGACTT TGCTCGGCAT CGAAACATTG GAAAAAATGT AG
 
Protein sequence
MVTPATKTEF GDYQVNAAMG LAKALNLSPR ECAAQIVKAL QPKIQSFMEE PEIAGPGFVN 
LRFRTSYLTQ AVSSMAGDAQ GRLAVPRTAQ TQKIVVDFSS PNIAKEMHVG HLRSTIIGDT
LCNVLEFAGH EVTRLNHVGD WGTQFGMLVE HLRDEYPAAL RSDTADDVDL GDLVQLYKAA
KKRFDEDDEF KTRAREGVVK LQAGNEEELA AWESLCAASR KEYQKIYDRL QIEGLVERGE
SFYNPFLKDV VDELVEKGLA VESDGALVVY LEGYTNRDGS PLPMIVRKSD GGFNYATTDL
AAMRHRTLMP RAESGERADR VLYVTDAGQA QHFEMVFEAG KVAGFCREGA SLEHVPFGLV
QGEDGKKFAT RSGETVKLKD LLDEAVRISG ADLKKRNENV DQEFLDRVDN VARIVGIGAV
KYADLSMNRE SNYRFSYDRM LSLNGNTAPY MLYAYARVCG IIRKASGQEG TGAIDWPKAS
EIMITHESEL ELIRNLVKLP DVLNEVEREL YPNRMCDYLF ETSQKFNQFY ESCSVNKAES
EEIKASRLSL CTATAGTIRL LLTLLGIETL EKM