Gene PHATRDRAFT_44287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44287 
Symbol 
ID7198004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp129526 
End bp130620 
Gene Length1095 bp 
Protein Length364 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178433 
Protein GI219115275 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAGTA GGGCCGCAAA AGGAACCACT GGAGGGTTTT CGTTCGACTT TTTGTCGTCA 
GATGATAGAG CACCCTTTCC TGAGTCAAAC CACACGTCTC ATTCCACGGA GCTGTGCCAA
GATACTGACA GTGAACGTCG TCCTTTGGTG TGGATCAAAA ATGTCACAGA GCTTCTTTTG
GATCGATCGC AGGAAGAAAT TGTGTTTGAC GAGATTCCTT GGCCATCGAA TGATAAGAGC
GATGATGATG ACACGGTCGC AAACGAAGTG CTGCATACCT TGAAATGTTT GGCACCTGTA
CGTCGAGTAG ATCATCATTC GTCGTCATTC GTTGATCAGA GGGAAACTAC TGGCATCAAC
TTAGGCTTTG AGAGTCAGAT AGACACCTGG CAGAACACAG ACATAGAGCC GGGTGTTTAC
GAAGGCGGCA TGAAAGTGTG GGAATGTAGT ATCGACCTAG TTCGCTACCT TGCAACTCAG
GAGATTCGAC TGGATCCGAA CCAATTCGCA ATCGAGCTCG GATGTGGCCA TGGTTTGCCG
GCGTGCTATT TACTACGGGA AAGCTTACGG GCATCCCGCA GAGCAGATTT CAATGACGAT
GAGGCTTTTA AAATCATATT TACTGACTAC AACGACTATG TGCTCAAAGA CGTGACTATT
TCAAACATGT TCATCAACAT TGTTCAGCAA GTATCGAATG AAACCATCAA AGCGTCCGAT
GCCGACCTTA AGCGCGTGGG CGAAAGTGTT CTCCTCGGTG CCGGGGATTG GATGAACTTG
TCGCGGCAGT TGACAAACGC AGATGCAGGG GATCTGCCAC TACCCAAGGA TGGCCATTTC
GATTTAATTT TGGCAGCTGA GACGCTTTAT TCAGAGATAA CTGCACGTGA GACTGCACAA
TGGTTTAGTC GACACCTGAA ACCTAACTCC GGCGTTGGTC TGGTGGCGAG TAAGCGATAT
TACTTTGGCG TCGGTGGTGG CGTCGATACT TTTCGGATGA CGGCGCAGTC GCTCGATTTG
CTGGTGGAAA CGGTAAAAAT ATATGACAAC GGCTCTAGCA ACATTCGGGA ACTGCTGCGT
GTGCAAAAGG TATAA
 
Protein sequence
MASRAAKGTT GGFSFDFLSS DDRAPFPESN HTSHSTELCQ DTDSERRPLV WIKNVTELLL 
DRSQEEIVFD EIPWPSNDKS DDDDTVANEV LHTLKCLAPV RRVDHHSSSF VDQRETTGIN
LGFESQIDTW QNTDIEPGVY EGGMKVWECS IDLVRYLATQ EIRLDPNQFA IELGCGHGLP
ACYLLRESLR ASRRADFNDD EAFKIIFTDY NDYVLKDVTI SNMFINIVQQ VSNETIKASD
ADLKRVGESV LLGAGDWMNL SRQLTNADAG DLPLPKDGHF DLILAAETLY SEITARETAQ
WFSRHLKPNS GVGLVASKRY YFGVGGGVDT FRMTAQSLDL LVETVKIYDN GSSNIRELLR
VQKV