Gene Sala_1206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1206 
Symbol 
ID4080701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1246738 
End bp1247958 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content63% 
IMG OID638009567 
Product5-aminolevulinate synthase 
Protein accessionYP_616255 
Protein GI103486694 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase
[TIGR01821] 5-aminolevulinic acid synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.41592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATTACA ACGATATTTT CGCTCGCGCG ATCGACCGGC TGCATGAAGA GGGACGCTAC 
CGCGTCTTTA TCGATATCCT GCGCAACAAG GGGGCTTATC CCAACGCGCG CTGCTTTGCG
GGACACAACG GGCCAAAACC GATCACCGTG TGGTGCTCGA ACGACTATCT TGCGATGGGC
CAGCATCCGA AGGTCATCGA GGCGATGGAA GCGGCGCTGC ACGACGTCGG CGCCGGGTCG
GGCGGAACGC GCAACATCGG CGGCAACACG CACTATCACA TCGACCTCGA GGCCGAACTC
GCCGACCTGC ACGGCAAGGA GGGCGCGCTG CTCTTCACAT CGGGCTATAT CTCGAACGAG
GCGACGCTCG GCACGCTGGG CAAGCTGCTC CCGAACTGCA TCATCTATTC GGACGAACTC
AACCACGCTT CGATGATCGC GGGTATCCGC AATTCGGGCT GCGAAAAGCG CGTGTGGCGG
CACAACGACC TGGAGCATCT CGAGGAACTG CTCGCCGCCG ACGATCCCGA GGCGGCAAAG
CTGATCGCGT TCGAAAGCGT CTACTCGATG GACGGCGATG TCGCCCCGAT CCACGCGATC
TGCGATCTTG CAGAGAAATA TAACGCGCTT ACCTATTGCG ACGAAGTCCA TGCCGTCGGC
ATGTACGGCC CGCGCGGCGG CGGCATCACC GACCGCGACG AGGCCGCGCA CCGCGTGACG
ATCATCGAGG GGACGCTCGG CAAGGCGTTT GGCGTGATGG GCGGCTACAT CGCCGCCGAC
AGGAATATCG TCGACGTGAT CCGCTCCTAT GCGCCGGGCT TCATCTTCAC GACCAGCCTC
TCGCCCGTGC TTGTCGCCGG GGTGCTCGCG AGTGTGCGTC ACCTGAAGCA ATCGAGCGTC
GAGCGCGACG CGCAACAGGC AGCTGCCGCC TATCTCAAGC AATGTTTCCG CGACGCGGGC
CTGCCGGTGA TGGATTCGAC GACGCACATC GTCCCGCTGA TGGTTGGCGA CCCGGTGCGC
GCAAAGAAGA TCAGCGACAT ATTGCTCGCC GAATATGGCG TCTATGTCCA GCCGATCAAT
TTCCCGACCG TGCCGCGCGG CACCGAACGC CTGCGCTTCA CCCCCGGCCC CGCGCATGAC
GAAGCGATGA TGCGCGAGTT GACCAGCGCG CTCGTCGAAA TCTGGGACCG GCTGGAATTG
CAGCTCCAGA AGGCGGCGTG A
 
Protein sequence
MNYNDIFARA IDRLHEEGRY RVFIDILRNK GAYPNARCFA GHNGPKPITV WCSNDYLAMG 
QHPKVIEAME AALHDVGAGS GGTRNIGGNT HYHIDLEAEL ADLHGKEGAL LFTSGYISNE
ATLGTLGKLL PNCIIYSDEL NHASMIAGIR NSGCEKRVWR HNDLEHLEEL LAADDPEAAK
LIAFESVYSM DGDVAPIHAI CDLAEKYNAL TYCDEVHAVG MYGPRGGGIT DRDEAAHRVT
IIEGTLGKAF GVMGGYIAAD RNIVDVIRSY APGFIFTTSL SPVLVAGVLA SVRHLKQSSV
ERDAQQAAAA YLKQCFRDAG LPVMDSTTHI VPLMVGDPVR AKKISDILLA EYGVYVQPIN
FPTVPRGTER LRFTPGPAHD EAMMRELTSA LVEIWDRLEL QLQKAA