Gene NATL1_05021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_05021 
SymbolcyoE 
ID4780948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp457511 
End bp458506 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content41% 
IMG OID640083777 
Productprotoheme IX farnesyltransferase 
Protein accessionYP_001014329 
Protein GI124025213 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0109] Polyprenyltransferase (cytochrome oxidase assembly factor) 
TIGRFAM ID[TIGR01473] protoheme IX farnesyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.196285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAGCT CTACATCAGA ATTAATTACA CAGCCTGTTA ATCGTGAGGA AGTAGTTCCT 
TCTAGAAAGC GTATTAAACT GCCAGCCTGG CTAGAAGTCG TCAAACCAAG ATTAATTCCT
CTTTTACTCG CAACAACTGT TGGAGGAATG GCTCTCTCGG AGGAATGGCC TTTACCCTCT
CCAAGATTGG CATGCACTTT AGGAGGAGGA GCCCTTGCTG CTGCGGCTGC AGGAGCTCTT
AATTGCTTAT GGGAACAAGA TCTTGATAAG CGAATGAAAC GAACCAGTAA TAGAGCATTA
CCTTCTGGTC GCCTTTCTCA GTCTTCTGTT TTTATTGGAG CAGTTGCTTG TACACTTGTT
TCTTCAGCCC TTTTAGTAAG TGGAGTTAAC TGTTTAGCAG CGGGACTTAC TCTTTTAGGA
CTTTGTAGTT ACGTACTTCT CTATACAGCG TTCTTAAAAC CTAGAACATC TCAAAATATA
GTTTTTGGAG GTGTCGCTGG AGCAATTCCC CCTCTTGTAG GAGCTTCAGC AGCGGCAGGA
CATATTGGGT TAGGTGGTTG GTGGCTATTC TCTTTGGTTA TGGTGTGGAC CCCAGCACAT
TTTTGGGCTT TGGCCATCTT GTTGAAAGAG GACTATCGAT CTGTTGGCAT TCCTATGCTT
CCTACAGTAA GTGGACCTTT TGTTACAGCT AAGGCTATCT CTGTGTATGG CTATCTAACT
GTATTTTTAA GCTTTTTAGG GTGCTTCGTT TTACCTGAAG GAGGTTTGTT ATACGGAATT
TTGTTGCTAC CTTATAATTC AAGACTTCTT CAATTGGTTT CCAGATTAAG AGATAATCCC
GAAGATTTAG ATCGTGCGAA GGGTTTATTT CGATGGTCTA TTCTTTACAT GTTTGGGGTT
TGTTTTTTGC TGGTTATTAG TAGATTACAA GTGTCAATTG TTTTTAATGA TCAGTTGATA
GCTTTAATTA AAGACTTCTC TATCGGATTT TCGTAA
 
Protein sequence
MVSSTSELIT QPVNREEVVP SRKRIKLPAW LEVVKPRLIP LLLATTVGGM ALSEEWPLPS 
PRLACTLGGG ALAAAAAGAL NCLWEQDLDK RMKRTSNRAL PSGRLSQSSV FIGAVACTLV
SSALLVSGVN CLAAGLTLLG LCSYVLLYTA FLKPRTSQNI VFGGVAGAIP PLVGASAAAG
HIGLGGWWLF SLVMVWTPAH FWALAILLKE DYRSVGIPML PTVSGPFVTA KAISVYGYLT
VFLSFLGCFV LPEGGLLYGI LLLPYNSRLL QLVSRLRDNP EDLDRAKGLF RWSILYMFGV
CFLLVISRLQ VSIVFNDQLI ALIKDFSIGF S