Gene PHATRDRAFT_23059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_23059 
SymbolAAT_1 
ID7195539 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp341906 
End bp343370 
Gene Length1465 bp 
Protein Length435 aa 
Translation table 
GC content52% 
IMG OID 
Productaspartate transaminase 
Protein accessionXP_002183857 
Protein GI219127260 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCTCCATCT CGCTAACAAT CATATAATGA AGTTTGCCCT GCTTGCCGTG TGTGTCCTTT 
CCATTCACGT AGCTTGTGCG CTCTCGCTCA AGACTCCGGC GGCCACCGCG GCAACGGCAG
CCTCCTTATG GAAAGATCTC GAAGCTGGTC CACCGGACGC TATTCTCGGT ATCGCCCAGG
CCTTTCGGGC CTCCACCGAT CCCCGCAAGG TTAACGTGTG TGTGGGTGCG TACCGGGACG
CCGAAGGTAA TCCTTGGGTA CTTCCCTCCG TTCGAGCAGC CGAACAGGTG TTAATGGCGG
ACAACGACAA CAAGGAGTAC CTGCCTATCG AAGGTGACGC GGATTTTGTC AACAAGGCCC
TGGCGTTTGC CTACGGCGAC GAAATGGATG TTCATCGCAT AGCTGGAGTG CAAACGTTGA
GTGGGACTGG AGCTTGTCGA ATTGGGGGAC AGTTTTTGAG CACCTTTTTG CCTGGACGGA
CGATTTACAT TCCCACACCC ACGTGGTAGG TTGGCCTCGT TTTTCTCTTT CTCGTGGCGG
TGCTGTTTAT CAGCGCAAGG ATTGCCAACC CCTCACTTGT ACTCTTTTTT CTAGGGGAAA
CCATTGGAAA ATATTCGCTG AATGCGGTTT GCAAGCCGCA CCATACCGTT ACTACAATCG
TGCCACCAAC GCGCTGGATT TGGACGGATT GCTGGAGGAT TTACAGGAAG CCGAGGACGG
CTCTATTATT CTGCTGCATG CCTGTGCGCA CAATCCCACC GGCTGCGACC CTACATTGAA
AGATTGGCAA CGCATTGCCG ATGTTTTGGA AGAAAAATCG CATGTGGTCT TTTTCGATTC
GGCGTATCAG GGCTTTGCCT CGGGCGACGG CGAAAAGGAC GCCGCCGCCT TGCGCTACGT
GGTGAAGCGT GGGTTGCCCG TCTTACTAGC GCAGTCGTTT GCCAAAAATT TCGGACTCTA
TGGAGAACGC TGCGGGACCC TGTCGGTGGT TTGTGGAGAT GCGGATCAAA AGGACCGTAT
CCTGTCGCAA CTAAAGTGCA TCATTCGACC AATGTACAGT TCCCCACCGA AACACGGGAG
TAGCATCGTG AGGACGGTAT TGTCAGACGA GAAGCTGACA TCTCAGTACT ACAAAGAATG
CGCCACCATG GCGGATCGTA TTTTGGACAT GCGCACCAAG CTTGTAACCA AATTGTCGGA
AGTAGGCTCC AAGCATGATT GGTCGCACGT GACGGGCCAA ATTGGTATGT TTGCCTTCAC
CGGCATGTCC AAAGAAATGT GTGACCAGCT GACGAACGAA TACGAAATCT ATTTGACGAA
AGATGGGCGT ATTAGCATTG CGGGTTTGAA TGATCAGAAT CTCGAGTACG TGGCGAAGGC
TATCCACGCT GTCACAGATG GCCAGAGCAT TACTACCGCA TGAGCAATTG AACCCGTGTA
ATAAATAAAT TTTGGTAAAT AAGTG
 
Protein sequence
MKFALLAVCV LSIHVACALS LKTPAATAAT AASLWKDLEA GPPDAILGIA QAFRASTDPR 
KVNVCVGAYR DAEGNPWVLP SVRAAEQVLM ADNDNKEYLP IEGDADFVNK ALAFAYGDEM
DVHRIAGVQT LSGTGACRIG GQFLSTFLPG RTIYIPTPTW GNHWKIFAEC GLQAAPYRYY
NRATNALDLD GLLEDLQEAE DGSIILLHAC AHNPTGCDPT LKDWQRIADV LEEKSHVVFF
DSAYQGFASG DGEKDAAALR YVVKRGLPVL LAQSFAKNFG LYGERCGTLS VVCGDADQKD
RILSQLKCII RPMYSSPPKH GSSIVRTVLS DEKLTSQYYK ECATMADRIL DMRTKLVTKL
SEVGSKHDWS HVTGQIGMFA FTGMSKEMCD QLTNEYEIYL TKDGRISIAG LNDQNLEYVA
KAIHAVTDGQ SITTA