Gene PHATRDRAFT_49980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49980 
Symbol 
ID7198658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp495023 
End bp496769 
Gene Length1747 bp 
Protein Length473 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184812 
Protein GI219129261 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTAGATCACC TAGTCTCGCG GGTCGCTTCA ATTGACGTGA ATGGAAAGAC CTATCCAAAC 
GCCGGGAGGC GAGCCCGCAC CGCCGTGCTC CTCCTGGTTT GGATGCTCTG CTCTATCAAC
AGCCGGTGAC AATGCAGAAA ATCGAAGTTT TTAAGGCCTA CCTTCATTTC CTTGTATTCG
CTACTACACA GTAAATTTCA GCGTCAATCA GCGTAAAAGG ATAAAAAATA CATCAAGTGG
TCTAGCCTGA TAAGCCTAGT TTCTTCGGCG GTGGTCCAAA TTATGGCGCT ATTGCGGATC
GACCCAAACG TTTTAGGTCA AAGAAAGCGG AAAAAGATGT TGGCAATATT GGCTGTAGGG
ATGCTAATAT CAGGACTCCA GTTCGGATCA GTGCTCCATC TTTGGCGATT TCAAAATGTC
ATTTTCCCTG CCTCTCCAAA TTCTGATACG ATGCTTTACA CTGGAAGTAT TGTCACCAGT
CTTCCTCAGC CCAGTGAGTC GCCGCGACCA ATTGCGAACA ACGCCAATGA CTTAAATGGA
TCCTCTCTTG TATCGCAGCC ACACAATGCA ACAAACGTAG TCCCAAATGG GACTTCACTT
TGCGAGGAAT GTTGGAAGAT TGTACAAAAG GCCCCTCGCG TCAACCTATC TTTTGCCCCG
ATTCCGACGG ATAGTCATCC CTACATGGGA GCTCGCGATG CCAGAGGACA ATGGGGCTAT
GTCCATAACG TCTACAACTT GCAACAGAAT CCGCCACATC CGCAAGGGTG GTCGTTTACA
AAAGTCCTTG AAAATCGTAG ATACTGCGAC GTGCGCGACG ACCATTGGAC TGCCTTGCAG
CGCATCCAAT TACCCAAGTT GCCAAGGACT GTTTCCGAGC TAACGCCGGC ATCCGCAACG
AAGATCCTGT GTGCTGTGTA TAGTTCCGAG CCCTTTCACC ATAAACTCCA CGCAATTCGA
GAAACTTGGG CTCCCAAGTG TGACGGCTTT TTCGTGGCAT CGAACTTGAC AGATGCATCG
CTAGATGCCG TCGACATTCC GCACGAGGGG ATAGAATCGT ACAGAAATAT GTGGCAAAAG
GTACGGTCGC TGCTATCCTA CGCGTACGCA AACTACTACA ACGAGTTTGA TTGGTTTCAT
ATTGGTGGGG ACGATTTGTG GGTCATTGTC GACAATCTCA GGGAGTATTT GCACAGTGAC
GAGATTCGAA TCGCCGCCAA CGGTGGGATT GAATTCGGAA GCCATTCTGC CTTTCTAGAC
AATGAGACAC AAGTTCCGCT TCTTTTAGGA TGCCACTTTG CTCAAGGGGG CAAACTTTCA
CAGCTGTATA TAACCGGAGG ACCAGGCTAT ACACTGAACA AAGCAGCACT GAAACTACTT
GTCACAGAGG GAATGGACTA CTTTCAGCAC AAAATTACTT CGACGGAAGA CGTACTTGTA
TCTCGAATCT TCCGCACACT AAGTGTGGAT CCCTATCCGA CCCTAGATCC CGTCGGAGCA
GAACGTTATC ATCACTTTAC TCCTGGTCAG CACTTCAACG CAACCAGGAG GATGTATCAT
TGGTACCACG TCTGGAAACG GCCATTCCCG AAGAACCCCA TTGGTCCCAA CCATTCCTCA
ACACGGAGCG TGGCGTTTCA CTCGGTTAAT AGTGAAGACA TGCGGCATTT TCATGTTCTC
ACGGAAGGTC TCTGCAATTC ATAATAGACC ATGTAAAGTC AAATAGTGTA CACAGATTTT
CTAGAAA
 
Protein sequence
MALLRIDPNV LGQRKRKKML AILAVGMLIS GLQFGSVLHL WRFQNVIFPA SPNSDTMLYT 
GSIVTSLPQP SESPRPIANN ANDLNGSSLV SQPHNATNVV PNGTSLCEEC WKIVQKAPRV
NLSFAPIPTD SHPYMGARDA RGQWGYVHNV YNLQQNPPHP QGWSFTKVLE NRRYCDVRDD
HWTALQRIQL PKLPRTVSEL TPASATKILC AVYSSEPFHH KLHAIRETWA PKCDGFFVAS
NLTDASLDAV DIPHEGIESY RNMWQKVRSL LSYAYANYYN EFDWFHIGGD DLWVIVDNLR
EYLHSDEIRI AANGGIEFGS HSAFLDNETQ VPLLLGCHFA QGGKLSQLYI TGGPGYTLNK
AALKLLVTEG MDYFQHKITS TEDVLVSRIF RTLSVDPYPT LDPVGAERYH HFTPGQHFNA
TRRMYHWYHV WKRPFPKNPI GPNHSSTRSV AFHSVNSEDM RHFHVLTEGL CNS