Gene PHATRDRAFT_44708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44708 
Symbol 
ID7197932 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1343927 
End bp1346871 
Gene Length2945 bp 
Protein Length790 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178409 
Protein GI219115227 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTACCGAAGA GTCAGGATGG GGGCAATGGG AAAGAAGGCT AGCCTATGTA TCTCAGTCTG 
GTGCAACATC AGTAGTAAAC GCGAGATTAG CCCTGGGGAG ACTGGCTGGA GACGCCGGTC
TTGAAGGTTC TCTTTTTGTC AGCGTCGGCA AAAGAGCACG GAAAGAAGGC ATTTTCAACA
TTGCGAATAA TTCATTTGCC CAGGCCGAGG CTGCAGTCGC CAATATCTCT GATTTCGGCT
CTTTGCGCAG TTCTCTGATA CTGGAAGTCG CTAAGCTCAA GCGTGACATG GGAGAGACCA
ACCTGGCCTT ACGAATACTC GAGCCCGCGG ATGTGGGAGA CCTGTTCGAT ACCAAGAGCG
ACCAGCTACA AACGGAAGTC ATCCGCAGAG CGTCTCAGAT GCTGCAACCC TCGGGGAAGC
TGCTAAACAA CGGAAGGTTT GATGCAAATG GCCTACAAGA GAAACGACTC GTTGACTTTT
TTGTTGAATG TGCGCTGCAA TCGACGAGAT GGATGATGGA TGGTGGATTA AAAGCAGGCG
CAGAAATAAT GTACCGATTT CGAACGATAC ATCGGATTGC ACCTTCCTTT GAAAAGGGTA
CGTTCCACTA TGTCCATTCA GCTCGCTTTT GAAGGCTTTT TTTCTCATAC GCAATTATTT
AGCCCACTTT CTATACGCCA AGTACCTCGA TTCTGTTCTA AAATCTCGAA TTTCAGCGTT
GCAAAGAAGA CAAACAGGTC AACTGCCGGG GTTAGATGAG GATGACTTAC GGAGTCGAAC
GATTGGAGCC GACGAAGCTT GCCAGCGAAA TGTTCTACGA ATAATGGAGC ACTATACACA
GACATTGTGT CTCAATTCGA TACATGTCTA CAATGCATTA CCACGATTGC TCTCCCTTTG
GTTCGATTTT ACGTCTATAC AAAGTAGTAT CGAGGACCGG GAAGTGCAAG GTAAGTTAAT
GCGATAGATG TGCGACTCGT TCTCTTGTCT AACTTTACGC TCCGTTCCGT ACAGGCAACT
TGAAGCAGAA TCAGGAGGAA GCCAATGTGT TTATGGGTAA TCATTTTAAA AAGATTCCTG
CGCAAGCCTT CTATACTGCT CTTCCTCAGC TGGTATCGCG TATAGTTCAT GTAGATTCTG
ACACAGCATC CGTCGTTCGA GGTATTTTGA AACGGGTTCT TACAAAGTTC CCGAAACAAG
CCATGTGGCC GCTCGCTTGG TTGCGACACT CAAAAGCTTT AGAAAGGAGA AAAGTAGGAG
ATAGCATATT TCAGGATGCC GAGAAGACGC TAGTGAAAGC TTCAAATCAA ACCCACTACA
GAGTCTTGAT GGCGTCAAAG GGTCTCTTCA AATTTTTCCA AGAACTTGCA AAGTACAAGA
ACTCGGATCT ATCCACTCAA TCAATAAATG TCAAGCCCTG GAAAGGCGAA GTAGATCTAA
CTGAATTTAT TCCTCCCGTT CAAGCAGCGT TGTCAGCTTC ACTGGACTCA TCTGAGAGCG
CATTTATGAG CGATCCGTTT CCTAGACAAG TTCCACGCAT GAGATTGTTC TCACAGCGCG
TTTCTGTGAT GAGCTCGAAG GCCCGTCCCA AGAAATTGAA AGCCTATGTT GTTGCAGCTG
ACTCTAGGCT GTCCTCAGCA TGTGCAAGTA ATGGAACGGA TCAAAATCTA CCCGACATTG
GTGAGATTCA TTTTCTTGTC AAACAGGAGG CAAAAGGTGA CCTTCGCAAG GACGCACGTG
TTCAGGAGCT AAATAATGTT ATCAACCGGC TAATGGCGAA TTCGAGGGAC TCAAAAGGAC
ATACTACGCA CAATAGACGG CACGGACTAA GAACCTTTGC TGTCACTTGC TTATCAGAAG
ACACAGGTCT GCTGGAATGG GTGCCCAATA CGTCTTCACT TCGAAGTCTC GTATCGGTAG
CGTACAATCC ACAGGCAAAC GCGTTCTCTT CTCGGCGACG TGGATCCCGC CTAGTTGCAA
TGAACGATCC AGTTTTGCGA GGAAATTTTG AGAAGAAATG CCAAGCAATG TATTTCTCAG
ACGGAAACCT ACGGAAAGCC GCTACTTTGT TTGAAGAACT CTGTCTCAGA CAATACCCTC
CTTTACTCTA TTGGTGGTTT GTACAAACGT ACTTGGATCC GCATTCCTGG TACGAAGCTC
GAATCAGATT TACATTGAGT GCTGCTGCTT GGTCGGCTGT CGGGCACGTG ATTGGTCTCG
GAGATCGACA TTCAGAGAAC ATTCTGGTCG ATGCATTGAA CGGGGAATGT GTCCATGTGG
ACTTCGATTG GTACGTTGAT CAAGGCAGTT TCGTACTTTC TCATTAAGTT GCCGACAGCC
TCATCATCTT CTTCAACTGA ATCAGCATTT TTGACAAAGG CCTACTGTTG CCTCGCCCAG
AAGTTGTTCC GTTCCGTTTA ACTGCAAATA TGGTGGACGC CTTCGGGCCG ACCGGGGTGG
ATGGCGTCTT TCGGAGTGGA TTAAAATCAG CCATGACTAC CCTTCGCGAC AATCGCGATA
CACTACTGTC CGTCTTGGAG CCGTTTGTCA AGGATCCCGT GATTGATTGG AAAAGATACC
GATCACATCA ACGCAACGAC GCGACACCGA CACAGGAACG TCCCGTAATG GAAATGAAGC
GATCAATTAA CGTGATTGAT GAGCGTTTGC AGGGCATCTA CAATTTAGGA AATCCAAACG
CGAAAAAAAT CCGAAGGACA GATGGGTTCA TCGATCAGGA AGACGACAAA ATAACCCAAA
TGTTACCTTT GTCTGTCGAA GGGCAAGTGC ACAAAATGAT TGCCGAAGCA ACGAGTAGCG
AAAACTTGGT TCAACTGTAT GTAGGATGGA TGCCGTGGGT ATAGCTCTTT TGGCAGCAAT
CAAAGACGAG AGAAATCAGT TTTGCTTCAC TTAATCTACC TAGCCTAGAA GCACAGCAAA
TTGTG
 
Protein sequence
MGETNLALRI LEPADVGDLF DTKSDQLQTE VIRRASQMLQ PSGKLLNNGR FDANGLQEKR 
LVDFFVECAL QSTRWMMDGG LKAGAEIMYR FRTIHRIAPS FEKAHFLYAK YLDSVLKSRI
SALQRRQTGQ LPGLDEDDLR SRTIGADEAC QRNVLRIMEH YTQTLCLNSI HVYNALPRLL
SLWFDFTSIQ SSIEDREVQG NLKQNQEEAN VFMGNHFKKI PAQAFYTALP QLVSRIVHVD
SDTASVVRGI LKRVLTKFPK QAMWPLAWLR HSKALERRKV GDSIFQDAEK TLVKASNQTH
YRVLMASKGL FKFFQELAKY KNSDLSTQSI NVKPWKGEVD LTEFIPPVQA ALSASLDSSE
SAFMSDPFPR QVPRMRLFSQ RVSVMSSKAR PKKLKAYVVA ADSRLSSACA SNGTDQNLPD
IGEIHFLVKQ EAKGDLRKDA RVQELNNVIN RLMANSRDSK GHTTHNRRHG LRTFAVTCLS
EDTGLLEWVP NTSSLRSLVS VAYNPQANAF SSRRRGSRLV AMNDPVLRGN FEKKCQAMYF
SDGNLRKAAT LFEELCLRQY PPLLYWWFVQ TYLDPHSWYE ARIRFTLSAA AWSAVGHVIG
LGDRHSENIL VDALNGECVH VDFDCIFDKG LLLPRPEVVP FRLTANMVDA FGPTGVDGVF
RSGLKSAMTT LRDNRDTLLS VLEPFVKDPV IDWKRYRSHQ RNDATPTQER PVMEMKRSIN
VIDERLQGIY NLGNPNAKKI RRTDGFIDQE DDKITQMLPL SVEGQVHKMI AEATSSENLV
QLYVGWMPWV