Gene OSTLU_44173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_44173 
Symbol 
ID5004528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp457349 
End bp458800 
Gene Length1452 bp 
Protein Length483 aa 
Translation table 
GC content52% 
IMG OID640419949 
Productpredicted protein 
Protein accessionXP_001420513 
Protein GI145352351 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.844643 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGC GGGCAAAAGC GCTCCTGGAC GTGCGCACAG AGGAGGCGGA GGCCTTGTTG 
AGTCACTTCT CGTACGATTT CGAAGCCGCC GCGACGGCGT GGTTCGAGGA CACGAGAAAG
GTGCGCGAGA CGTCGGGGTT GATCGATGCA AAGACGAGAC GCGAAAATAG CGAAGCAGCG
ATGTCGTCGG GTGGAACGCG AGGGTGCGGG ATTTGCTTCG AGGACTTTCC AGGGGATGCT
TTGACGACGG TCGGGTGCGC GCATGAATTT TGCGACGAGT GTTGGTCGGG ATGGGTGACG
AGCAAGGTGA ATGATGGGCT TTCCGTGGTC AACACGCGGT GTCCTATGTG TCCTGCCAAA
GTCCCCGAGT CCATGATTCG AAAGTTTCTT AGTGATGAAG ATGAAACGAA GTTTGATACA
TTCTTGCGGC GGTCGTTTTT GGAAAACAAC GCCAAGTTGC GCCCTTGCAT TGGCGTCGAT
TGCGAATGTG CCATCGCCGT CGAGCAACTG CCGACCAATC CCGTGAGTGT GAAATGCAAC
TGTGGTGCCG AATTCTGCTT TTCGTGCCAG AGTGAGCCCC ACGTGCCTGT GAATGATTGT
GAAGTCGCGA AGAAGTGGAT GGACAAAATC AACTCCGACG GTGTGAACTC GGAGTGGATG
CTAGCTAACA CGAAGGGATG TCCGAAGTGT CATCGACCGA TCTTGAAGAA TGGAGGATGT
ATGCACATGC ACTGCTCACA GTGCCATTGC TCGTTTTGCT GGCTTTGTCT CGGACCTTGG
GATTCCGGGC CGTACGCCTG CGCCAGACGC TGCAACAAAT ACAGTGGAGA CAAAACCGGC
GACGAAAACA GGCGGAAACG AGCCAGAGAT TCTCTCGAGC GCTACGTGTT CTACTATGAA
CGCTATAGAG CGCACGAGGA TGCGAGTAAA AAAGCCGAAC AAGACGTCGA GAGATTCAAA
GACAGCGTGC TTGACATATT GATCGATTTA CAGCGTACGT CCAAGCAACA AGTTGTTTTC
ATCATGGATG CGCTCAGGCA AGTGACCGAG TGCAGGAAAA TTTTGAAATG GACTTACGCG
TACGCGTATT ACGAATTTGC CGACGATCAG AGCAAGAAAG AGTTCTTTGA GTACATTCAA
GGTGACATGG AGCGTTGTCT CGAGCTCCTG TCTCGCATGA TTGAATCAGA CATCAAACCA
TTCCTTCCGC CAGAGCCGGA AGATGATGAA CAGAAACAAA ACGTGTCGCC GCCGTCGACG
CTAACTGATG AACTTCAAGA TGGGAAATAC CAGTACGCAC CCGAAAAGCA AGAGAGTCTG
GAAAACGACT TTGCCCTATA CAAAGCTCGA CTCATCGACA CCACTGCCGT GTTGCGCAAG
TTCACGGATA CGTTGGTTTC GGAAATGGCT AAAGGTTTGC TCGGCGCTAG AAACATAGAC
AAAACTGATT GA
 
Protein sequence
MIERAKALLD VRTEEAEALL SHFSYDFEAA ATAWFEDTRK VRETSGLIDA KTRRENSEAA 
MSSGGTRGCG ICFEDFPGDA LTTVGCAHEF CDECWSGWVT SKVNDGLSVV NTRCPMCPAK
VPESMIRKFL SDEDETKFDT FLRRSFLENN AKLRPCIGVD CECAIAVEQL PTNPVSVKCN
CGAEFCFSCQ SEPHVPVNDC EVAKKWMDKI NSDGVNSEWM LANTKGCPKC HRPILKNGGC
MHMHCSQCHC SFCWLCLGPW DSGPYACARR CNKYSGDKTG DENRRKRARD SLERYVFYYE
RYRAHEDASK KAEQDVERFK DSVLDILIDL QRTSKQQVVF IMDALRQVTE CRKILKWTYA
YAYYEFADDQ SKKEFFEYIQ GDMERCLELL SRMIESDIKP FLPPEPEDDE QKQNVSPPST
LTDELQDGKY QYAPEKQESL ENDFALYKAR LIDTTAVLRK FTDTLVSEMA KGLLGARNID
KTD