Gene OSTLU_33570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33570 
Symbol 
ID5003809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp433607 
End bp434960 
Gene Length1354 bp 
Protein Length424 aa 
Translation table 
GC content61% 
IMG OID640419230 
Productpredicted protein 
Protein accessionXP_001419807 
Protein GI145350846 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.104047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.023306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCTCG CGTCGGTCGT CGAGGCGACG CTCGGACGCG TCGCCGCGCT CGGACGCACG 
GCCTCGAGCG CGTCGACGTC CGCGAGCGGC GTCGCGGACG CCGGCGCGGC GACGCTGCGG
CGACTCGGCG CGGCGAAGAC GCGCGGGACG TCGTCGCTCG ATGACTTGAA CGAACACGCG
AAAATCATCA CCGACGGCGT CGATTTCCGG CCGGTGACGG ACACGTCGTG CTTCGTCGAC
GCGTATCGCT CGAGCGACGC GGCGACGGCG GTGATGTTCG TCTTCATCAG CGGGTGCGTG
CTCGGGGCGG TGCCGCAGTA CCTGAAGGTG GTGGTGTTGG GGACTTCGGA GGGGCTGTCG
CTGAGCTCGT TGGCGCTGAT GAACGTGTCG AACGTGTGCG CGACGATGAA TGTGTTCATT
CTGCATTACG AGCAGATCCG ACGGTGCGTC GCGGGAGCGG CGGGGTACGA GTACGAGCGG
TGTCAGGCGT CGCTGTTGAC GCTGTATTAC ACGTTGATTT ACACGTTGCT GTGGATACCG
CTCTACCCGC TCGCGGCGCA CTTCACGAGC GATCGCAAGA CGGAATACTT TGGGTACGTC
ATGTCTAAGC GTAAGGCGGC GTGGTACGGG TTAGCGTTGT GGGCGGTGCC GTGCGCGTTG
CTCGCGGCGC CCGTCGCGAG GATGTTGTTC GGATCGACGT GTTTTGAGTT TGAACGTTAT
GCGATTTTCT TAGGGTTGAC GAACGCAGTT TTAGAAACCA CGCGATACGT GCCGCAGTTG
TGGGAGTCCG TACACTCCAA AGGTTCGGGG GCGATGAGTT ACATGCGATT AGCGCTGTCC
GTCGCGGGCG GGCTCGGGGC GACGATTCAA AAGGCGGTGA TGCACGAGTC TTGGTCCACG
TGGGGGCCTC CGCTCATCGG GCACGGATTG GAGATGGCTA TCTTCTGCGT GAACCTATTC
AACGACATGA CGCGCCGTCG CGAGCGCACG GACATGAGGA AAGAGGCTTT AGGATTAATG
CGCGATTCGG ACGACGACTA CGAAGACAGC CGCGACGACA TGGAGACGGA CGCCCACGCG
AAGCGCAAGG CTGCGATGTT AGAGTCAGCC GAATCCGCCG CGCGCACGGG ATCGCCCACG
AAATCCGAAT CAGACGTGGA GGATTGGGTG CAAAATATCC CCACCGAGGG CGGTTTCAAG
GCGAAGACGT CCTTCGTGTG GCATCGCGCG TGCACGGACA AGCACTTCTT CACGTCGTTG
GTGCGATACC TCTAGATTTC CCTCGTTCGG CGCGCGCGCG TCGAGTGCTC AACCATTTAG
TCATCGTCAA TATCAGTAGA TTCTCTAGCG ACTC
 
Protein sequence
MVLASVVEAT LGRVAALGRT ASSASTSASG VADAGAATLR RLGAAKTRGT SSLDDLNEHA 
KIITDGVDFR PVTDTSCFVD AYRSSDAATA VMFVFISGCV LGAVPQYLKV VVLGTSEGLS
LSSLALMNVS NVCATMNVFI LHYEQIRRCV AGAAGYEYER CQASLLTLYY TLIYTLLWIP
LYPLAAHFTS DRKTEYFGYV MSKRKAAWYG LALWAVPCAL LAAPVARMLF GSTCFEFERY
AIFLGLTNAV LETTRYVPQL WESVHSKGSG AMSYMRLALS VAGGLGATIQ KAVMHESWST
WGPPLIGHGL EMAIFCVNLF NDMTRRRERT DMRKEALGLM RDSDDDYEDS RDDMETDAHA
KRKAAMLESA ESAARTGSPT KSESDVEDWV QNIPTEGGFK AKTSFVWHRA CTDKHFFTSL
VRYL