Gene OSTLU_27563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_27563 
Symbol 
ID5005278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp517162 
End bp518702 
Gene Length1541 bp 
Protein Length512 aa 
Translation table 
GC content65% 
IMG OID640420699 
Productpredicted protein 
Protein accessionXP_001421486 
Protein GI145354427 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0657659 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGATGCCGAC CGAGACGGCG CGAACGCACG CGCGCGGGCT CGTGGATTAT CTCAACGACG 
CGTGGACGGC GTACCACGCG ACGCGCGCGA CGTGCGAGGC GCTCGCGCGG CGAGGATTCG
TCGAGCTCGA CGAGCGCGCG ACGTGGTCGT TGGCGCGCGG GGGGAGGTAC TTTTACACGC
GGAACGCGTC GGCGGTGGTG GCGTTCGCGG TGGGGGGTGG ATACGAACCG GGAGATGGGT
TCGTGATCGT CGGCGCGCAC ACGGACTCGC CGTGTCCGAA GCTGAAGCCG AACACGCGCG
TGGAGGGGGG CGACGAGGTG CGCGTGCGCG TGCAGCCGTA CGGCGGCGGG CTGTGGCACA
CGTGGTTCGA TCGCGATTTA GGGATCGCGG GACGGGTGGT GGTGAAATCG TCGCACACGG
GGGAGATTTT GCATCGATTG GTGCGGATAG ATCGGGCGGT GTGTCGGATT CCGACGCTGG
CGATTCACTT GGATCGAAAC GTCAACAGCG AGGGGATGAA GGTGAACTTT CAGCAGCACA
TGGCGCCGAT TTTGGCGACG CGCGCGAAGG CCGAAGCGAA AGACGACGAC GAGGGTGGGG
AGAAAACGAC GGCGAGCGAC GGTAAGGGGT CGAGCGAACG GCATCACCCG CTGCTGCTGA
CGTTGCTCGC CAAGGAACTC GGGTGCGCGC CGGGCGACAT CGTCGATTTC GATCTACAGC
TGTGCGACAC GCAACCGAGC GCGATCGGTG GGGCGCAGAA TGAGTTCATT TACAGCGGCC
GTTTGGATAA CCTGGCGAGT TGTTACACAT CGTTGCACGC GCTGATGAAC GCCTCGACGG
ATGAGGCGTT GGCGGACGCG CGAGGCGTGC GCATGATTAT GCACTTTGAC CACGAAGAAG
TCGGAAGCGA GTCTTCGAGC GGCGCCGCGG GCGCGATGAC CACGGACGCG ATCAAACGCA
TCGCAGCTGC GCTGAGCCAA GGAAGCGTGG AAGGCTTGGA CGAGCGCACG CGCCGCGCGT
CGTTTTGCGT CAGCTCCGAC ATGGCGCACG CCTTGCACCC AAACTACGCC GATCGACACG
AACCGGCGCA CGCGCCGAAA ATGCACGGCG GCTTAGTCAT CAAGCACAAC GCCAACCAGC
GTTACGCCAC CGATGCCGTG ACGGCATTCA TGTTCCGCGA GATTGGCGAG CGCGCGGGCG
TTCCCGTGCA AGAGTTCGTC GTGCGAAGCG ACACCGGTTG CGGTTCCACC ATTGGGCCGA
TTTTCTCCAC CCGAACCGGC ATTCGCACCG TGGACGTCGG CGCCGCGCAG CTTTCCATGC
ACTCCATCCG CGAAGTCTGC GGCGCTGACG ACATAGACCA CGCCGTGAAG CACCTCACCG
CGGTTTACCT CCACTTTATC GATCTCGATC GCACCCTCAT AGTCGACGGA GCCATCGGCA
CGCTGTGTCG CCCGTGCGAC GTCTTCGAAT CGGCGACGTC CAAGCTCTCC CTCGACGTCC
GAGACGACGC CGCCGACGAC GCTCGCACGT ACGCCGAGTG A
 
Protein sequence
MPTETARTHA RGLVDYLNDA WTAYHATRAT CEALARRGFV ELDERATWSL ARGGRYFYTR 
NASAVVAFAV GGGYEPGDGF VIVGAHTDSP CPKLKPNTRV EGGDEVRVRV QPYGGGLWHT
WFDRDLGIAG RVVVKSSHTG EILHRLVRID RAVCRIPTLA IHLDRNVNSE GMKVNFQQHM
APILATRAKA EAKDDDEGGE KTTASDGKGS SERHHPLLLT LLAKELGCAP GDIVDFDLQL
CDTQPSAIGG AQNEFIYSGR LDNLASCYTS LHALMNASTD EALADARGVR MIMHFDHEEV
GSESSSGAAG AMTTDAIKRI AAALSQGSVE GLDERTRRAS FCVSSDMAHA LHPNYADRHE
PAHAPKMHGG LVIKHNANQR YATDAVTAFM FREIGERAGV PVQEFVVRSD TGCGSTIGPI
FSTRTGIRTV DVGAAQLSMH SIREVCGADD IDHAVKHLTA VYLHFIDLDR TLIVDGAIGT
LCRPCDVFES ATSKLSLDVR DDAADDARTY AE