Gene PHATRDRAFT_23871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_23871 
SymbolAAT_2 
ID7199093 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp64821 
End bp66221 
Gene Length1401 bp 
Protein Length426 aa 
Translation table 
GC content49% 
IMG OID 
Productaspartate aminotransferase 
Protein accessionXP_002185116 
Protein GI219129902 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCGAAAAAT CAGAGCTTCG ATTTTCACAC CGCAAGCAAC CATGCTGAAG CAAATCTCTC 
AATCGTTCCT TCGTCCCACA GCTCGATGTG TGTCAGCGTC CTCAAGGGTT CGATTCATGA
GCGCCAGTCC CTGGGCCGAC TATGAAATGG CACCCTTTGA CCCCATTATC GGCCTGAACG
AGGAGTACAG CAAAGATGAC TTTCCACAAA AAGTAATCGT TGGAGTTGGT GCCTACCGCG
ACGGAAACGG AAAGCCTTAT GTGCTTCCCT GTGTCCGGGA AGCAGAGAAG AAGATGATGG
AGCAGAACCT CGACATGGAG TATTCAGGTA TCGTACGTAT AAAGTGAACA CGTTTCCCAG
CATTCAAGCT ACGACTAAGA TGCGCTAACG ACACGTGTTT ATCTTGCAAA GGCTGGAGAT
GCCAAGTTCG TGGAATTAGC ACTCAAGTTT GGATACGGCA AAGACTCGAA ACCTCTAGGA
GAGAACCGCA TTCAAGGTGT GCAGGCACTT TCTGGTACAG GCGGCCTCCG CGTTATGGGG
GAGCTCCTGC GTAAACACGG CCACACCCAT ATTTATGTGC CGAATCCAAC GTGGGGCAAT
CACATTCCCA TTTTCGTCAA CTCTGGATTG GAAGTTCGTA AATACAGGTA CTATGATGCC
AAGAACTCGG ACCTCGACTT TGACGGTATG ATTACCGACA TCAAGGAAAT GCCGACGGGG
AGCACTGTAC TACTGCATGC CTGCGCACAC AACCCAACCG GTATGGACCC TACTTTGGAA
CAATGGAAAG AGCTCAGTGA TATAATCAAA ACGAAAAAGC TTCTGCCATT CTTTGATTGT
GCATACCAAG GGTTTGCTTC TGGAGATGCC AACATCGACG CTGCATCAGT CCGGATGTTT
GTCGAAGACG GACACCTTTT AGCAATGGTC CAGTCGTTTT CCAAGAACTT TGGTTTGTAT
GGTCATCGAG TCGGTACGCT ATCAGTTGTT GGCGAGTCCG AAGCGGAAGC CAAGCGTGTG
CAATCTCAGC TCAAAACCGT AATCCGGCCC ATGTACTCCA ACCCTCCGCG TCATGGTGCA
CGTATCGTTT CAACTATCTT GTCGGACCCC AAGCTTACCC AAGACTTCCT GATTCAATGC
AAGGAAATGG CAGACCGAAT TCATACCATG CGTGGATTGC TTCGCAGTAA CTTGGAGCAG
GCTGGCTCGA CACACAATTG GGAGCACATT ACTCGACAGA TCGGTATGTT TGCCTACAGT
GGCCTTTCGA AAGACCAGGT ATTGGAGATG CGTCACAAGC ACCATGTCTA CTGTACTGCG
GACGGTCGAA TTTCCATGGC GGGTGTAACT TCTGGAAATG TGGACTACAT TGCGCAAGCC
ATTCATGCCG TTTCCAAGTA A
 
Protein sequence
MLKQISQSFL RPTARCVSAS SRVRFMSASP WADYEMAPFD PIIGLNEEYS KDDFPQKVIV 
GVGAYRDGNG KPYVLPCVRE AEKKMMEQNL DMEYSGIAGD AKFVELALKF GYGKDSKPLG
ENRIQGVQAL SGTGGLRVMG ELLRKHGHTH IYVPNPTWGN HIPIFVNSGL EVRKYRYYDA
KNSDLDFDGM ITDIKEMPTG STVLLHACAH NPTGMDPTLE QWKELSDIIK TKKLLPFFDC
AYQGFASGDA NIDAASVRMF VEDGHLLAMV QSFSKNFGLY GHRVGTLSVV GESEAEAKRV
QSQLKTVIRP MYSNPPRHGA RIVSTILSDP KLTQDFLIQC KEMADRIHTM RGLLRSNLEQ
AGSTHNWEHI TRQIGMFAYS GLSKDQVLEM RHKHHVYCTA DGRISMAGVT SGNVDYIAQA
IHAVSK