Gene OSTLU_33194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33194 
Symbol 
ID5003171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp536772 
End bp538881 
Gene Length2110 bp 
Protein Length672 aa 
Translation table 
GC content64% 
IMG OID640418592 
Productpredicted protein 
Protein accessionXP_001419414 
Protein GI145350003 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.274614 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGCGCCCCG CGCGCGCGCC CCGACACCGA CCCGAACGAA AGCGCCGACG CATCGCGCGC 
GTCTCGATGC GTCGCGAGTC CCGCTGAGCG CATGCGCGAC GCCGATTCGC CGCGCGCGCG
CGACAGCGCG AACGTCACGA CGTCGGAAAC GACGCCCGCG CGCGCGCCCA AGCGCGCGCA
TCCCACCGCG GGATGGCTCG AGCTCGAACG CGCGCTGCGC GTCGTCTACG GCGGCGTCGT
GCGCGAGCTC GCGCCGCTGG GCGTGGTCGG GATGGCGACG TTTGACCCGA TGGCTGGGTT
CGTGCTGTGG AGCGAGCAGC GCGAGGCGGA ACGCGATCGA ACGAGGACGA TGCGAGAGCT
GTACCCGAAT GAGACGATGA AACCCGAAAG CCCGGGTGAA ATTGAACTCG CGTCGCTGCG
AGTGGCGGTG AAATACGCGA TCGGGGCGTA TGGGACGGCG TCGAGCGTGC TGAGCGACGC
GTCGTTTCGG GACAAATTGA AGAGTTTAAA GGCTGCGGGC GGCGGCGGTG AAACGCGCGG
AATGGGGGAT TACGAAGCGA TGCACGCGAA AGCGACGGAG GCATGCGCGA GGAGCTGCGA
GATCGAGACG AAAGATGTGG TGGATGCGGA GTGGAGTGGG ACGGAATTTT CGCCGAGCTC
GTTCGTCGCC GTCGATCGCG CCGCGGGCAA GGTGGTGCTT TCTGTGCGCG GGACGTGGGA
GTTTCACGAC GCGTTAACGG ACGTGAGCAG CGAAAGTGTG AAGTTCTTGA ACGGCTGGGC
GCACTCTGGG ATGGTGGCAT CGGCGTGGCA AGTGTTAAAG CGCATGCTTC CCGCCGTGGC
GCGTTCGATG CGCAAGCTTT CGGGATACGA GTTTCTCGTC ACCGGTCACA GTATGGGCGG
CGGTGTAGCC GCGTGCGTCG CGATGCTCAT GCATAGTACG GATAAAGATA TCGAGTCGCT
CGCGCTAGAG GGATTGAGCG ATGTCGTCGA GGAGGAGAGA AGAGAAATAT TGCGACGACT
GGCGTCGTGC ACGTGCGTGT GTATCGCGGC GCCGAGCGTG AGCAGCATGG ACTTGAGCGA
GGCGGCGAGC GATTACATCA CGTGCGTCGT CGCCGGCGCT GATGTTATAC CGCGATTATG
TCACGCTTCC GTTCGCCGAC TATTACGTCG GTTGAACCAC GCCGCACCGT CGCACGCGAT
GTTGCGCGCC GTATCCTCGG TGCTCGGCGG CAGGGACAGG CCGGCGTTGG AGCGAGAAGT
GAGCGCGACC GAGGAAGAAA TCACCGAGGT AGCGCAAGAT TTAGACAACG TTAGCAAAGA
TAGTGAGTTA GCAAGTGGCG CTGGAGCGCG CGACGTCTCG AAAGCTGCGT CGGTGTCGCC
TCCTAAGAAG CGCGGGGACA GTCGGCGCAA ATGCCAAGGC GCTTGGGGCG AGGTAGAGGG
CGTCGTCGGC CTCGAACTTC GTGATCACGC GGCGAGCGAT TTCATGGTTC AACCGGGACG
AGTTATCCAC CTCAAGCACG TGCGTTCAGA CGCACCGACT GCGGAGTATA AACACCCCAC
GGCGTTCACA GACGTCGTCC TCGACCCGTA CATGATGCTC GACCACATTC CGGGAACGTA
CCAAGCCGCG GTGAAGGCGA TTCACGATCG TGTCAAAGCC GGTGGACAAT CCTGGGTCGA
GCACGATGCT TCCAATCGCG ATTCCGACGA CGAAGACGAG GAAGACGCCG TGAGCGCGTT
TCTTCACGCC CAAGACGTCC GCTCCGGCGC GCGCCGAGGA TGGCGCTCCG TTCGCAACTT
CCTCGGCATC GACGGCGACG AAGACGACGA CGCGTTCAAG CGCGACGCCC ACGACGCCGG
CGCCACCGCC CCCGGTCCAC GACCATCCCA TCACGTTCCC GACATCGGCA CCGATACCGA
CGACGACGAC TCCATCCCCG ACTTTTACGG CGAAGACGCT CGCGCTCGCC GCACGCACCA
TCCTCGCCGC GCGTCCGCGC CCGCCGACGA CGCCGCGCCC ATCGCTCCCA TCGCGGACGC
CGTCGACGCC AATCCTTTCA CGCGCGCTTG GGACTGGCTC AAGCGTCAGG CCGCCGACGA
CGACGCGTAG
 
Protein sequence
MRDADSPRAR DSANVTTSET TPARAPKRAH PTAGWLELER ALRVVYGGVV RELAPLGVVG 
MATFDPMAGF VLWSEQREAE RDRTRTMREL YPNETMKPES PGEIELASLR VAVKYAIGAY
GTASSVLSDA SFRDKLKSLK AAGGGGETRG MGDYEAMHAK ATEACARSCE IETKDVVDAE
WSGTEFSPSS FVAVDRAAGK VVLSVRGTWE FHDALTDVSS ESVKFLNGWA HSGMVASAWQ
VLKRMLPAVA RSMRKLSGYE FLVTGHSMGG GVAACVAMLM HSTDKDIESL ALEGLSDVVE
EERREILRRL ASCTCVCIAA PSVSSMDLSE AASDYITCVV AGADVIPRLC HASVRRLLRR
LNHAAPSHAM LRAVSSVLGG RDRPALEREV SATEEEITEV AQDLDNVSKD SELASGAGAR
DVSKAASVSP PKKRGDSRRK CQGAWGEVEG VVGLELRDHA ASDFMVQPGR VIHLKHVRSD
APTAEYKHPT AFTDVVLDPY MMLDHIPGTY QAAVKAIHDR VKAGGQSWVE HDASNRDSDD
EDEEDAVSAF LHAQDVRSGA RRGWRSVRNF LGIDGDEDDD AFKRDAHDAG ATAPGPRPSH
HVPDIGTDTD DDDSIPDFYG EDARARRTHH PRRASAPADD AAPIAPIADA VDANPFTRAW
DWLKRQAADD DA