Gene OSTLU_3430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_3430 
Symbol 
ID5005031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp170652 
End bp171851 
Gene Length1200 bp 
Protein Length400 aa 
Translation table 
GC content57% 
IMG OID640420452 
Productpredicted protein 
Protein accessionXP_001421073 
Protein GI145353550 
COG category[I] Lipid transport and metabolism 
COG ID[COG3239] Fatty acid desaturase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.00816557 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000371871 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GAACGGCGGT ACGTGACGAT CGAAGGCGTG GAATACGATG TGACGGATTT TAAGCATCCC 
GGAGGATCGG TTATTTATTA CATGCTGTCG AACACGGGAG CGGACGCGAC GGAGGCTTTT
AAAGAGTTTC ATTATCGGTC GAAAAAGGCG CGCAAGGCGT TGGCGGCGTT GCCGCATAAG
CCAGTGGACG CGGCGACGCG GGAACCGATC GAAGATGAGG CGATGCTGAA GGATTTCGCG
CAGTGGCGCA AGGAATTGGA GCGTGAGGGA TTTTTTAAGC CCTCGCCGGC GCACGTGGCG
TATCGATTCG CCGAGCTCGC GGCGATGTTC GCGCTCGGCA CGGCGTTGAT GCACGCGCGT
TGGCACGTCG CTTCCGTGAT CGTGTACTCG TGTTTCTTCG GCGCGCGATG CGGTTGGGTG
CAGCACGAGG GTGGGCACAA TTCGTTGACT GGAAACATTT GGTGGGACAA GCGAATCCAA
GCCTTCGCCG CGGGGTTCGG CTTGGCGTCG AGTGGCGACA TGTGGAACAA CATGCACAAC
AAGCATCACG CGACGCCCCA AAAGGTGCGA CACGATATGG ATCTCGACAC CACTCCCACG
GTGGCGTTCT TCAACTCCGC GGTTGAAGAA AATCGCCCGC GGGGATTCAG TAAGTTGTGG
TTGCGCCTTC AAGCGTGGAC CTTCGTGCCC GTGACGTCCG GTATGGTTTT GTTCTTCTGG
ATGTTCGTCT TGCACCCGCG TAACGCGCTG CGACGCAAAA GCTTCGAAGA AGCGGCTTGG
ATGTTTTCCG CGCACGTCAT TCGCACGGCG GTTATCAAAG CCGTCACCGG CTACTCCTGG
ATCGCCTCGT ACGGCTTGTT CGCGGCGACG ATGTGGGCGA GCGGATGTTA CTTGTTCGCG
CACTTTTCCA CGTCTCACAC GCACTTGGAT GTCGTGCCGA GCGATAAACA CCTCTCGTGG
GTGCGATACG CCGTCGATCA CACGATCGAC ATCAATCCGA ACAACAGCGT CGTCAACTGG
TTGATGGGCT ACTTGAACTG CCAAGTCATC CATCACCTGT TCCCGGATAT GCCTCAGTTC
CGCCAACCCG AAGTCTCCCG CCGATTCGTC CCGTTTGCGA AGAAGTGGAA CTTAAACTAC
AAGGTCTTGA CGTATTATGG GGCCTGGAAG GCGACGTTCG GCAACTTGAA CGACGTCGGG
 
Protein sequence
ERRYVTIEGV EYDVTDFKHP GGSVIYYMLS NTGADATEAF KEFHYRSKKA RKALAALPHK 
PVDAATREPI EDEAMLKDFA QWRKELEREG FFKPSPAHVA YRFAELAAMF ALGTALMHAR
WHVASVIVYS CFFGARCGWV QHEGGHNSLT GNIWWDKRIQ AFAAGFGLAS SGDMWNNMHN
KHHATPQKVR HDMDLDTTPT VAFFNSAVEE NRPRGFSKLW LRLQAWTFVP VTSGMVLFFW
MFVLHPRNAL RRKSFEEAAW MFSAHVIRTA VIKAVTGYSW IASYGLFAAT MWASGCYLFA
HFSTSHTHLD VVPSDKHLSW VRYAVDHTID INPNNSVVNW LMGYLNCQVI HHLFPDMPQF
RQPEVSRRFV PFAKKWNLNY KVLTYYGAWK ATFGNLNDVG