Gene OSTLU_40179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40179 
Symbol 
ID4999402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp459700 
End bp460827 
Gene Length1128 bp 
Protein Length376 aa 
Translation table 
GC content54% 
IMG OID640414823 
Productpredicted protein 
Protein accessionXP_001415501 
Protein GI145340791 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.10614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACTTA GCTCGCAAGA TTTCTTAGAT TTGACCCTCG ACGAAACCAT GCGATCTCAG 
TGCGCTGAAA CTATTCATCG CTATGGTCTC GGATCGTGCT CACCTCGTGG TTTTTATGGC
ACATTCCGTC CGCACATGGA TTTGGAAGCC AAGATTGCGA AATTTCTCGG CGTGGGTGAA
GCTGTGCTTT ACTCTTTCGG CGTTTGCACT GCGTCCAGTG TCATTCAAGC TTTAGCGTCA
AAAAGTGATG TGGCTGTGGT CGATCGAGGC GTTGGACCGA GTATCATCGC TGGTTTACGT
TTGGCGAAGC TCGAAATTAG ATGGTACAAT CACGCCGATC CTGCTGATGC GGCGCGTGTT
TTTGCGCAAA TTGAAACCGA AGATGGTTCC ACGTCTGCGA GGCTCACCCG ACCTGTCAGA
CGTCGATGGT TGATTACTGA AGCATGTTTT GGCTCCACCG GTCGATGTGC GCCTCTCCGT
GAACTTGTAG CTTTGAAGGA TCATCACCAT GCACGAATGA TTCTCGATGA GTCGTTCTCC
TTTGGCGCCA TGGGTGAAAG TGGTCGTGGT CTGATTGAAC ACGTTGGACT ACCCAGCAGC
TCTGTTGATG TCATTTGTGC TTCGTTGGAG AACGCGTGCG CATCTGTGGG TGGTTTTGTG
GCCGGAGATA CGGGAGTCGT GGCTTACCAA CGCTTGATGG GAAGTGGTTA CGTCTTCTCA
GCGTCCTTAC CGCCCTACCT CGCGACGGCT TCTTTACACG CCATCAGCCG CATCGAAGCT
GAACCGGCCA TGGTTGAAAA GCTTCATGAC GCGGCGCGAC GTACTCGCAG CGCACTCGTC
AGTGGAGACA TTCCCGGTAT GACTACTGAT GCAGATGCCG ACTCGCCAGT CATCCCCGTC
AAGCTCTCGG CCGGCGTTGG GAGCGGGGAC GAGAACATGC TTCTGCATCG CATCGCTGCT
CGTATGCGAA GTAAAGGATT TGGTGTGTGC GTGGCTCGAG TCAGTCCTGT CATTTTACCG
TCTCACCGCC CCCCGCCGTC CCTTCGTCTA TATGCGCACG CCAGTCACAC GGCGGACAAG
ATTGACAAGA TGCTCACAGT GCTTCGAGAT GCTGCGTTGG ATATCCTC
 
Protein sequence
MVLSSQDFLD LTLDETMRSQ CAETIHRYGL GSCSPRGFYG TFRPHMDLEA KIAKFLGVGE 
AVLYSFGVCT ASSVIQALAS KSDVAVVDRG VGPSIIAGLR LAKLEIRWYN HADPADAARV
FAQIETEDGS TSARLTRPVR RRWLITEACF GSTGRCAPLR ELVALKDHHH ARMILDESFS
FGAMGESGRG LIEHVGLPSS SVDVICASLE NACASVGGFV AGDTGVVAYQ RLMGSGYVFS
ASLPPYLATA SLHAISRIEA EPAMVEKLHD AARRTRSALV SGDIPGMTTD ADADSPVIPV
KLSAGVGSGD ENMLLHRIAA RMRSKGFGVC VARVSPVILP SHRPPPSLRL YAHASHTADK
IDKMLTVLRD AALDIL