Gene OSTLU_38141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38141 
Symbol 
ID5004228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp28190 
End bp29311 
Gene Length1122 bp 
Protein Length373 aa 
Translation table 
GC content49% 
IMG OID640419649 
Productpredicted protein 
Protein accessionXP_001419879 
Protein GI145351005 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATA TACTCAAAGC GTTGGACGCG AATCCGAAAT TAAAGGAGGA TTACGTGAGC 
GAATCGACGT CTGGGGTGAT CACTACTTTG GTGTGCGCGG CGTTGTGTTT GATTTTGTTC
TTTGGCGAAT TCTTCTCGTA CAAGACGACG AAAATCGTGA GCGAATTGAG AGTAAATCCG
CTCGGTGTTC ATCAAACGGT GCCCAACGCG GAAAGACTGA AGATTGACGT CGATATTACC
TTTCACAGTC TGGCTTGCAA TCTCATCACG CTCGACACCT CGGATAAAGC CGGAGAAGAG
CACTACGACG TGCACGATGG TCACATCGAA AAGAGGAGGA TAGACAAGCA TGGGAAAGTG
ATTGATGCTG CGTTTACTTC AGAAAAGCCA AACAAACACA AGGAGATTGA GCAAGCGCTG
CAAAAGATGA ACGAGACCGA CTCCGCACAC GCCGCCGACT CTCATGCCAT GGAGCACGTG
CAGCCGTTCG GTGGTATGTT TGGTCTACAA AGTTTATTGC AAGAAGTGTT TCCAGAGGGC
GTGGAGCATG CGTTTAGAAA CGAGAATCAA GAAGGGTGCG AGGTGAAGGG TTACCTTGAA
GTGAATCGGG TACCGGGACG GTTTTCCATT TCGCCGGGAC GTTCGCTCAT GATGGGGATG
CAAATGGTCA AGCTAAACGT GCAGACGGCA TTAAATTTAA CGCATACGAT TCACAGGCTG
TCATTTGGGG AAAGCTTTCC CGGTTTGGTG AGTCCACTCG ACGGAACGCA CCGCTCACTT
CCGCCGAACG CGGTGCAGCA ATATTTTCTT AACGTTGTGT CGACGACATT CGAGCCTTTG
GGAGAGAACA AAATCATCAG CACTCATCAG TATAGCGTTA CTGAAACTTT CACAAGCTCA
CAGCGATCAA TTATGGGGAC GTCCAACGGC CGTGATCCGG GCGTCATCTT TACTTACGAA
ATATCGCCGA TTCGCGTCGA CTTCAAAGAG ACTCGCACGT CGTTTGGTGC ATTCGTCCTG
GGTATCTGTT CCGTCATCGG AGGCGTCGTC ACTATGGCGG GTATCACGCA AAATGCCGTT
GAGTATATTA TTTCTAATCG CAAGACCCTC TTCGCGTCAT AG
 
Protein sequence
MTNILKALDA NPKLKEDYVS ESTSGVITTL VCAALCLILF FGEFFSYKTT KIVSELRVNP 
LGVHQTVPNA ERLKIDVDIT FHSLACNLIT LDTSDKAGEE HYDVHDGHIE KRRIDKHGKV
IDAAFTSEKP NKHKEIEQAL QKMNETDSAH AADSHAMEHV QPFGGMFGLQ SLLQEVFPEG
VEHAFRNENQ EGCEVKGYLE VNRVPGRFSI SPGRSLMMGM QMVKLNVQTA LNLTHTIHRL
SFGESFPGLV SPLDGTHRSL PPNAVQQYFL NVVSTTFEPL GENKIISTHQ YSVTETFTSS
QRSIMGTSNG RDPGVIFTYE ISPIRVDFKE TRTSFGAFVL GICSVIGGVV TMAGITQNAV
EYIISNRKTL FAS