Gene PHATRDRAFT_14067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_14067 
Symbol 
ID7202448 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp561002 
End bp562102 
Gene Length1101 bp 
Protein Length271 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181583 
Protein GI219122503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.952518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCCTA GTAGCCTCGT GCGACTAGTG AAGCCGTATC TATATACCTT TCGGAGTAAC 
GCCAAATTTC GCTGGTTTGG ACGAACAATT CTGGACGTTT ACGTCAGCGA GTTTGGGAGT
TACCCAGAGT CGTACTACCG TACGGCAATT CAACAAGGTC GTATACGGGT CGGAAACGAA
AAGGTGGATG TTGCATATAC TATACGATCC AACGACGTGC TAACCCATAC TGTACACCGT
CACGAACCGG CCGTAGCGGT GTCCCAACCG CAGGCGCCAT TTGTGAAAGT TGTTGCCAAT
TCGGAGACTT GGCTAGTGGT GGATAAACCT GGGACAATGC CGGTGCATCC TAGTGGCGCC
TACCACTTAA ATTCGCTGCT ACCAATTTTG GAAAATACCT ACGGAAAGTT GTATCCCATC
CATCGGCTCG ATCGTCTAAC AAGTGGCTTG GTTATCTTGG GGAAAACCCC TGAAGCTGCG
AGGCAATTGG GGAAGGCAAT CAAGGAAAGA GACTCTTGCA CAAAACTGTA CATAGCTCGA
GTTCGTGGTC GTTTTCCTTT CAACTGTGCA TCACACGTTC CGAACTTGTC CAGCCATAAG
TCATACCCTC CACGGTATGG AGAATGGTCT GTGCTCCAAG ACATGGATGG CAAAAAAGAT
AGTACCGGTA AGATTCGCAG TCGAAACTGT CATGGCTATA TGTTTGAGGA TATAAAGGGA
ACGGTTCGAA ATGATTTGAC GTTGCAAACA TTTGGTAGCA AAACTGGAGG TAGATTGGAG
GACTGGTTGC AAGCTCTTGA ATGCGAGGAC ATTTGTCAAA CCAATTTCTC GAGCAATAGC
TTTGTGTGGA TGCGTCTCTG TTGCCCTGTA CGAGTAGAGG AACCGAAAAA CGGGATTTGC
AAAGCTGGAA TATTTGACGA ACTCGACGAT AAAACTTACC ATGAAACAGT GAAAGCTGCC
GAGACGTCTT TTGCCTTGCT TAAGTTTGAT GCCAAGTCTG ACTCGAGTGT GGTATTGTGC
CGACCGGCAA CTGGTCGCAC CCATCAAATT CGGTTACACC TACAATACTT GGGCCATCCA
ATCGCAAACG ATCCGAATTA T
 
Protein sequence
MAPSSLVRLV KPYLYTFRSN AKFRWFGRTI LDVYVSEFGS YPESYYRTAI QQGRIRVGNE 
KVDVAYTIRS NDVLTHTVHR HEPAVAVSQP QAPFVKVVAN SETWLVVDKP GTMPVHPSGA
YHLNSLLPIL ENTYGKLYPI HRLDRLTSGL VILGKTPEAA RQLGKAIKER DSCTKLYIAR
VRGRFPFNCA SHVPNLSSHK SYPPRYGEWS VLQDMDGKKD MKAAETSFAL LKFDAKSDSS
VVLCRPATGR THQIRLHLQY LGHPIANDPN Y