Gene OSTLU_30851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30851 
Symbol 
ID5000807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp865978 
End bp867123 
Gene Length1146 bp 
Protein Length381 aa 
Translation table 
GC content62% 
IMG OID640416228 
Productpredicted protein 
Protein accessionXP_001417078 
Protein GI145345135 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00449336 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCGGAGG CGTGCAAAAG AACGCGCGAC GACGAAGATG GACTCGGGAG TGGCGCGTGC 
GCGTTCGCGT ACGCGCGCGC GAAGGCGACG ACGGGGTGCG TGTTCGTGGA CGACGGCGGC
GCGGGATCGA AGACGCTGTA CGAGGCGCTG CGACCGAGCG CCACGGCGAG ATTAGCGGAG
ACGCTGTCGG CGTCGAGCGC GGGCGTGGAC GCGGACGCGA AGGCGCGAGA TGAACTGCAG
GCGGGATTGC ACTGGGAGTT TTTGCGCGCG CATCCCGAGA CGCGGCCGTT GTTGGGGATC
ACGGGCGCGG ACGCGAGGAA AGGATTACCG GGAAACGGCG ACGACGACGA GGCGCCGGAG
ACGGCGTCTA TGTGTGGCGT CGTCGCCGCG GTCGTGGCGA GTAAGATATT GAGTTATTTG
GACGCGAGTG AAGCCGTCGC AGTGGCGCGC GAGTGCGATC AGTGGACGCA CGTGGAGGCG
TTGGCGTTGT ACAAGCCCGA TCGCCCGGAC TCGGCGCCCG TGCCAAACGC GTCCGAATAC
GAGGTTCTTC CGAGCGGGGA CGATATGGAT GCCGACGACG CGCAAGACGG TGATGAAGAT
CGCGAGGGTT TCGATACGCA CGCGAGACTT TTCGGCTACG ACGCGTTGGC TAAGCTGAGA
GAGATGTACG TCCTTGTGTG CGGCACGGAT TCGCTCGCCA ACGACGCGTG CGTCTGCGCG
CTTGCCGCCG TCGGCGTCGG TAACGTCGAC GTATACGGGG CGAGTGGATC AAAAGTATTT
GTTCGGCACG ATTGCGAAGT GGACGATTTG TGCGATTTAG ACGATTTAGA AACGTACAAA
TACCACGTCG TGGTTCGCAC GAGCGCGTGC GCCACCGCCG ACGAAGTCGT CGCCATCGCG
AGAGAGGCAA AATCGCCGGT GATTGAAATT ACGAGTACGA GCGTGGGGTC GTGTGTTGTG
GATGTTAGTT TAGGCACGGA CTCGTCCTTC ACGCGTGCGT CGTTCTCGGG ATGGATGGAT
GCGCCGACGG CATGCGTCGC CGCACACATC GCCGCGATGG AAGTAGTGCG GATAGCGCAA
GACCGAAGAC GCGCGACGAC GATAGTGTTC GACGGGAAAG GGATATTTAC CAAGGCAAGG
ATGTAA
 
Protein sequence
MAEACKRTRD DEDGLGSGAC AFAYARAKAT TGCVFVDDGG AGSKTLYEAL RPSATARLAE 
TLSASSAGVD ADAKARDELQ AGLHWEFLRA HPETRPLLGI TGADARKGLP GNGDDDEAPE
TASMCGVVAA VVASKILSYL DASEAVAVAR ECDQWTHVEA LALYKPDRPD SAPVPNASEY
EVLPSGDDMD ADDAQDGDED REGFDTHARL FGYDALAKLR EMYVLVCGTD SLANDACVCA
LAAVGVGNVD VYGASGSKVF VRHDCEVDDL CDLDDLETYK YHVVVRTSAC ATADEVVAIA
REAKSPVIEI TSTSVGSCVV DVSLGTDSSF TRASFSGWMD APTACVAAHI AAMEVVRIAQ
DRRRATTIVF DGKGIFTKAR M