Gene PHATRDRAFT_37105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37105 
Symbol 
ID7202106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp167793 
End bp169088 
Gene Length1296 bp 
Protein Length431 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181318 
Protein GI219121948 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACTGT CTAGAGAGGA CGCCACAAGA ATCTTTCCTT CCGCCAACTT ACCACCAACA 
GATTTATCGA TCGAAGCATT AATGGTGGAC ACTGACGGTC AAAATAGTGC CGAGAGAGAC
TCCCACGCTA AGCAACAGGA GACATCTTGG ACCGACGAAT CGTATCCGAC ACCCAGATAT
TCCTCGCAGG CGGAACAGCA TCCCCGAAAC GAGAATGCTT CCATTTTTCC TGATCTGACT
ACGATGGACA TGCGTCAAAC AGTTGGAACC AGTACGGCCC ACCTTTCTGA TTGGATCAAC
GATAACATTA TCGCCGTCCG GTACGGGGCG ATGGCAACGA TCGGGCTCTT GACCGCTTAC
GGCCTTTCCC AAACACCGCT CTTTTTCCGT TATCGGTCGG TCGCCGAGCT TCCTAATTGG
CTCTTTGCGA AACGGAGATC TATTACTTGT CGTCTTATGC CACAGTCGCA TCGGCAAATC
TTACATCCCA ATCAACCCAT CCGGCTCTAT CTACGACACT TATCACCAGC GGAGCGAATC
CTACTAGCGA TTTCCAAGTC AACCTATGAA AAGGTTCTTT CTTGGCATCC ATGGAATACT
CTACCGCTAG AAACTTTGAG TGTTTCCTCC ACCGATCAAT CTGCACCAAC CGCCCGCCAA
CTAGAAGCGT TAGAGGTTGA AATTGCCGGC ATTCAGGCTG CACCCGAGCA CCTGGCTCTC
CGAGAAGCAC CGGGTGAATG GTTGGCACGG TTGTGCCGGG ACCGTACGAC AGTCTCACTC
CAGCTGATTG CGCGACGTGT GATGGATCAA GGTGATGACA CCAATGAGCA ACGCTTAGGT
ACAGCGAGTC GTATTGGCAT GCGCAAAAGC AAGCGTGATA TCCCCGAGCT GGATTTCTCA
AGCTCAAAAA CTACCCAATC CCAAATTGCT ATTGGTCGCT TGTACTATCG TCCCAAACTA
GCACAATTCC AGTCCACCGA CGTGGGCCTC TCTCTAGTCC GGTACGGTCG GGCAACACCC
GTATCGGATG GTTTGTGGAA GACGCTTCTG GATTCGCACG TCGTCGTAGA AAGCAACAAA
GATGAATCGG CACAGGAGGC GGCCAAACAA TCTTCGCGAG ATGCAACGTA CGCATCTCAG
TTGAGCCAAG CCGAGTATGA AGCAGCCGCT GGTTCCTACG GTATGTGGAC TGACGCGGAG
ATTCGTAAGA ATCGGGCAGA CATTGTCGAT GAAGCCGAGT TTCAAACAAC AGCTCCCGTT
TGGAAAAAAG CCTGGCGGTG GATACGAAGA AGGTAG
 
Protein sequence
MRLSREDATR IFPSANLPPT DLSIEALMVD TDGQNSAERD SHAKQQETSW TDESYPTPRY 
SSQAEQHPRN ENASIFPDLT TMDMRQTVGT STAHLSDWIN DNIIAVRYGA MATIGLLTAY
GLSQTPLFFR YRSVAELPNW LFAKRRSITC RLMPQSHRQI LHPNQPIRLY LRHLSPAERI
LLAISKSTYE KVLSWHPWNT LPLETLSVSS TDQSAPTARQ LEALEVEIAG IQAAPEHLAL
REAPGEWLAR LCRDRTTVSL QLIARRVMDQ GDDTNEQRLG TASRIGMRKS KRDIPELDFS
SSKTTQSQIA IGRLYYRPKL AQFQSTDVGL SLVRYGRATP VSDGLWKTLL DSHVVVESNK
DESAQEAAKQ SSRDATYASQ LSQAEYEAAA GSYGMWTDAE IRKNRADIVD EAEFQTTAPV
WKKAWRWIRR R