Gene OSTLU_3820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_3820 
Symbol 
ID4999417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp776288 
End bp777415 
Gene Length1128 bp 
Protein Length376 aa 
Translation table 
GC content55% 
IMG OID640414838 
Productpredicted protein 
Protein accessionXP_001415929 
Protein GI145341672 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID[TIGR01264] tyrosine aminotransferase, eukaryotic
[TIGR01265] tyrosine/nicotianamine aminotransferases 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGATATCGC TCGCGCAAGG AGATCCGACG GTGTTTGGAC ACCTTTTGCC GCCGAAGACG 
GCGATGGATG AGGTGGCTGG GGCGTTCTCG ACGAGCGCGC ACAACGGGTA CACGGCGAGC
GCGGGTTCGG CGACCGCGCG GGCGGCGGTG GCGATGCGGT ATTCGTTACC CGATCGTCCA
CCGTTGAGAA CAGAGGACGT TTTCATGACA GTTGGTTGCT CCGAGGCGCT CTCACACTCG
TTCGCGGCGA TGGCGGTGGA GGGAGCAAAC ATTTTACTGC CGAGGCCCGG TTTTCCGTTG
TATGAAACTT TGTGTCATAG GCACGGTTTG GGATACAAGT TTTATGATTT AGACGACGAA
AATGGATGGG AAGTCAAGAT TGACGATGTT CGCAGGCTTC GGGACGAAAA CACGGTGGCG
ATCGTCGTGA ATAACCCGAG CAATCCTTGC GGCGCGGTGT TTAGTGAAGG TCACCTGCGA
GAAATTTGCG AGACTTGCCA CGAGTTGCGC TTGCCAATCA TCGCCGATGA AGTGTACGAA
GACGTCGCTT TCGATGAAGA CAGGCCGTTT CTGTCGATCG CAGCTTTTAG TGGTAGAGTT
CCCGTCATGG TGGTGAGTGC GTTGAGCAAG CGCTGGCTCG CGCCCGGATG GCGCATTGGT
TGGCTTGTCC TTCACGACTA CGATCATATT CTACAGACTG CAGGCGTGCA GCTTGCGATT
AACAACTTGT GTCAGGTGTC GTTAGGTCCG CCGACGCCGA TCCAAGCCGC GATTCCGGGA
ATTTTCAAAG CCAACGAGAC GGAGTGGCTA AAGGCTACGC TCGGCGTCTT GCGTCGCGCA
AGCCAGCGCT GCGTCGAACG CTGTGCGCGA GTTCGTGGTT TGACTGTTCC TTGTGAACCT
CAAGGAGCGA TGTATGTGCT GTTGAAAATG AATGGTGATG CGTTCAAGGA CGCAAATGGG
TTTTTCACTG ATGTCACCTT CGCCAAGCGC CTGCTTGCGG AGGAATCAGT ACTCGTGTTG
CCGGGCACGT GCTTTCACGC GCCCGGATAC TTACGTCTAG TGATTACAGT TCCAGATGAC
GAATTGCAGA ACGCGTGGGA TCGCATTGAG ACGTTTTGTG AACGTTAC
 
Protein sequence
LISLAQGDPT VFGHLLPPKT AMDEVAGAFS TSAHNGYTAS AGSATARAAV AMRYSLPDRP 
PLRTEDVFMT VGCSEALSHS FAAMAVEGAN ILLPRPGFPL YETLCHRHGL GYKFYDLDDE
NGWEVKIDDV RRLRDENTVA IVVNNPSNPC GAVFSEGHLR EICETCHELR LPIIADEVYE
DVAFDEDRPF LSIAAFSGRV PVMVVSALSK RWLAPGWRIG WLVLHDYDHI LQTAGVQLAI
NNLCQVSLGP PTPIQAAIPG IFKANETEWL KATLGVLRRA SQRCVERCAR VRGLTVPCEP
QGAMYVLLKM NGDAFKDANG FFTDVTFAKR LLAEESVLVL PGTCFHAPGY LRLVITVPDD
ELQNAWDRIE TFCERY