Gene Sare_2148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2148 
Symbol 
ID5706966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2470497 
End bp2471741 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content67% 
IMG OID641271633 
Producthypothetical protein 
Protein accessionYP_001537004 
Protein GI159037751 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.149546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCC TCACCATCGA AACCCAGCAG TACCTCTCGT ACTACATTTC CCGCTACCAG 
CAGGTCGAGG CGTTCGGTGT CGACCTGTAC GTGCTCAACG GAGAGGGCAC GCACGACTTC
TGGCCGTCGG AGCGGTATCG CCTGGTCGGG TCGAAAAAGA TCGACGACAT CGTCGCCCAG
GCCCGTCGGT GGCACGCCGA GGAACAGTTC GACGGCGTGA TCACATTCTC GGAGTCGGCC
GTCGTCACGG TGGCCGTGGT GGCCGAAGCC CTGGGCCTGC CCGGAATCCC GGTCGAGGCC
GCCGTGCGCA GTCGCAACAA GTACCTGATG CGACGAGCAT ACGAGGAAGG GGGCGTACCG
GTTCCCCGTT ACCGGCTGGT CGAGTCGGTG GGCGAGGCCC TCGCCGGGGG CGCCGAGTTC
GGCTACCCGC TGATCATCAA ACCCACCATG GGAGCGGGCA GCCACTTCGT CTTCCGAGTG
GACGACGCCC GTGCGTTGGA ACGCCGGTAT GTGCAGGCGG CGGCCGGGAT CCAGGACATG
TTCTGGGCGA CGTCCGAGGC CGACGGCATC GATCTCGGCC CGCAGGGCCT GCTGGTGGAG
TCCTTCCTCG ATGGGCGCGA GTATCTGATC GAGGCGTTGG CCTGGGACGA CGAGCTCTAC
CTCGGCTCGG TGGTGGACCG GATCACCGTC GAGGGTGGCA CCTTCGACGA CGATGTCCAT
CACGCGCCCA CCTCGTTGCC CGCCGAGGAC CTGGCCAAGG TGCACCGTGT GGTGGCGGCC
GCAGCCCGGG CCCAGGGCCT GCACCGCAGC GTCATGCACG CGGAGGTGCG GTTCCACCAG
GGCGAGCCGC ACCTGCTGGA GATCGCCGCC CGGGTCGGCG GCGGCGGGTT GGACCAGATC
GCCCGGCTGA CCGCCGAGTA CGACCCGATC CGTGCTGTGG TCGACGTTGG TCGGGGGGTC
AGACCCGTGG TGCGACACTA TCGCCCGACC GGCACCCACA TCGCCGCGAT GTGTCTCATC
AGCGATGCGG GCGTGGTCGA GCAGGTGCAC GTGCCGCCGG AGGTCAGCAC CTCCGACAAG
GTGTTCCTGT TGAAGATCAC CGCTCGGCCG GGTGACCTGA TCCGCCGCCC CCCTGACGGC
AACACCATCC TCGGCTTCCT GGGCACGACC GGACGCTCCG AGGCGGAGGC CAGGTCCACC
ATGAACGAAT TCGCTTCCAA GATCACGGTT CGGTTCACCC GCTGA
 
Protein sequence
MKLLTIETQQ YLSYYISRYQ QVEAFGVDLY VLNGEGTHDF WPSERYRLVG SKKIDDIVAQ 
ARRWHAEEQF DGVITFSESA VVTVAVVAEA LGLPGIPVEA AVRSRNKYLM RRAYEEGGVP
VPRYRLVESV GEALAGGAEF GYPLIIKPTM GAGSHFVFRV DDARALERRY VQAAAGIQDM
FWATSEADGI DLGPQGLLVE SFLDGREYLI EALAWDDELY LGSVVDRITV EGGTFDDDVH
HAPTSLPAED LAKVHRVVAA AARAQGLHRS VMHAEVRFHQ GEPHLLEIAA RVGGGGLDQI
ARLTAEYDPI RAVVDVGRGV RPVVRHYRPT GTHIAAMCLI SDAGVVEQVH VPPEVSTSDK
VFLLKITARP GDLIRRPPDG NTILGFLGTT GRSEAEARST MNEFASKITV RFTR