Gene PHATRDRAFT_49644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49644 
Symbol 
ID7198217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp292017 
End bp293641 
Gene Length1625 bp 
Protein Length516 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184437 
Protein GI219128473 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGTCAGTATC CCGAAACCTC AAGCCAAGGA TTGACGACAA AACGTTCGAT TGTTTTGCTG 
CCAGGGAAAC CGGGATGATG GTTTCAGGAT CGACCTTTCT ATCAATATTT GTTGCATTTG
CTGCGAAAGA TTGTTTCGGC AGCAACCACT GTCGTTCTCC GGCTTTATGG AGTGTCGATG
CGTTCATCAC ATCGAATCGC ATGGTCCCAC TAAATTCCCA ACCCGCAGCC AAGTCCATGC
TCAAGGAACT TTACTTAGCT TCAAGAAAGC TAGAGTCTTC CGACACACCT CTAGATGATT
TAAACATAGC AATTATTGGG GCAGGACCTT CCGGCTTACT GCTGGCGCAC AAGGTTGCTA
GATTGGGAGC CAAGGTAAAA ATTTTCGAAG CCCGCTCTCG TCCTCAACCC GACAGCCTAG
AGCAAGGTCG AGCATACGCT CTGGGGGTGG GCATACGGGG GCGTACGGCC ATCCAAGCCG
TTGACGACGG TCTCTGGAAC GTTGTGGAAC AAGCCGGTTT TGGAAGTCAA CGATTTCAAC
TTCATGCCGG GCCACTAAAA ATGACTTTGC GTGACGAACA AGACAATGTG CAGAAGTCTG
TTCTTCTGTA CCAGACCGAT TTGTGTCGAG TTTTGGCCGA AGAACTGGAA TCTTGCTACA
ATGAGACGCA TGTTACGTTG GCTTACAGCA GCAACGTAGT TGGCGTAGAC TTAGACACCA
AACAAGTAAA GATTCGTGGA CTTGATTCCA TAGAAAAGGA GACACATTTT GACCTGATTG
TTGGATGTGA CGGAGTAAAC TCCATTGTAC GTCAAGAGCT TGTCGACTTC TGTCCTGCTT
TCAAGTCAGC CCGCACCGCG CTCCCTGGTG TTTTCAAAGT GGTACAACTC TCTGCCATGC
CGCCTGCTTT GGATCCCACA GCAGTTGCTT TGATATTACC AAAAGCTGGT GGAGTAACAG
CCTTTGTAGA GCCGACAATC AACAACACGT GCTGCATTCT TTTTGCTGGA CGAAATGAAA
CAGACCCCCT CCTTATTTCC AAAGATGAAA CAGAGCTATT CGAAACCATG CAATCCCGCT
TTCCGATTTT AAAAGGCGCC AACTTCAAGT CTGTAGCAAT TCAAATGGCT GCAATCGAAA
AACCATCCCA GGCGTCGAGT ATTGTTTGCA ATATGTATCA CTTCAATGGT ACAGTAGCCT
TGCTTGGGGA TGCTGCACAT GCTACAGGAG GTGTTTCTGG ACAAGGAGTG AACTCGGCTC
TGTGCGATTC TGTTGCCTTG GGTGAAAGTC TGCGAGACAA TTTTGAGCAT TGGAACAAGG
AGCAGTCCTT GCAAAATGCG CTTTTGGACT ATTCTCGAAA GCAAGTTCCA GAAGGCAGGG
CTTTGTATGA TCTGTCCTTT GGGCCAACCC CACAAGGAGT TCTCCAGCGC GTAAAATTAA
TGTTGAAGAA TGCACTTGAT TTTATTTTTC AAGGACGTTT CGGTATTGGC GACGTTCCGT
TGCAAACACT CCTAACAACA TCAACCAGAT CATTTGCAGA TATCCGCCGC GACAGAGAGG
CGATCTACTG CGAGCCGTTT CCGACTCAAG AAGAGTGGAA CAACAAGCTC ATTGGTAAAC
AATGA
 
Protein sequence
MMVSGSTFLS IFVAFAAKDC FGSNHCRSPA LWSVDAFITS NRMVPLNSQP AAKSMLKELY 
LASRKLESSD TPLDDLNIAI IGAGPSGLLL AHKVARLGAK VKIFEARSRP QPDSLEQGRA
YALGVGIRGR TAIQAVDDGL WNVVEQAGFG SQRFQLHAGP LKMTLRDEQD NVQKSVLLYQ
TDLCRVLAEE LESCYNETHV TLAYSSNVVG VDLDTKQVKI RGLDSIEKET HFDLIVGCDG
VNSIVRQELV DFCPAFKSAR TALPGVFKVV QLSAMPPALD PTAVALILPK AGGVTAFVEP
TINNTCCILF AGRNETDPLL ISKDETELFE TMQSRFPILK GANFKSVAIQ MAAIEKPSQA
SSIVCNMYHF NGTVALLGDA AHATGGVSGQ GVNSALCDSV ALGESLRDNF EHWNKEQSLQ
NALLDYSRKQ VPEGRALYDL SFGPTPQGVL QRVKLMLKNA LDFIFQGRFG IGDVPLQTLL
TTSTRSFADI RRDREAIYCE PFPTQEEWNN KLIGKQ