Gene Sare_4405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4405 
Symbol 
ID5703454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4977949 
End bp4979157 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content76% 
IMG OID641273824 
Producthypothetical protein 
Protein accessionYP_001539173 
Protein GI159039920 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3616] Predicted amino acid aldolase or racemase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.443829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0267897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCACCG ACAGTTCGGA TCTGCGCGCT CGGCTCGACC GGGCGACCGC TCACCTCGAC 
CCGCCGTACG CGGTGGTCGA CCTCAGGGCG TTCGATGCGA ACGCCGCCGC CCTCGCCAGT
CGCGCCGCCG GTAAGCCGGT CCGCGTCGCC AGCAAGTCGA TACGCTGCCG GACGCTGATC
TCCCGGGCCC TGACCGCCCC CGGCTGGGCG GGTGTGCTGA CGTTCACCCT GCCCGAGGCG
CTCTGGCTGG TCCGCTGCGG GGTAACCGAC GACGCGGTGG TGGCGTACCC CACGGCCGAT
CGAGCCGCGC TCGCCGAGCT GGCCGGTGAT CCGACGCTCG CCGCCGCGGT GACCCTGATG
GTCGACGACA CCGCCCAGCT CGACCTGGTG GACGCCGTCA GTGCCCCGGG GCAGCGGCCC
GAGCTGCGGG TCTGCCTCGA CCTGGACGCC TCCTGGCGAC CGCTGGGCGG CCGGCTGCAC
GTCGGGGTCC GCCGCTCGCC GGTGCACGAT CCGCGGGCGG CCGGCGCGCT CGCCGCCGCC
GTCGCCGCCC GGCCCGGGTT CCGGCTGGTC GGGCTCATGG CGTACGAGGC TCAGATCGCC
GGGCTGGGCG ACGCGCCACC GAAACGGGCA GTGCTCGGCG CGGCGATCCG ACTGGCCCAG
CGCGGGTCGT ACCGGGAGTT GCTGGCCCGC CGGAGTGCGG CGGTCGCGGC GGTACGCGAG
CACGCCGAGC TGGAGTTCGT CAACGGTGGC GGCACCGGCA GCGTGGCCGC CACCAGCGCC
GATCCCGCGG TCACCGAGGT GACCGCGGGG TCCGGGCTGT ACGGGCCGAC GCTGTTCGAT
GCCTACCGGG CCTGGCGCCC GACCCCCGCC GCGTACTTCG CCTGCTCGGT GGTCCGCCGG
CCAGCACCCG GCTACGCCAC TGTGCTCGGC GCCGGCTGGA TCGCCTCCGG ACCGGCCCAA
CGGAGTCGGC TTCCCCGCCC CGTCCTACCG GCCGGCCTCC AGTTGGTCGA CGCCGAGGGC
GCCGGCGAGG TGCAAACCCC GCTGACCGGC CGGGCAGCCG GCTCGCTACG GGTCGGCGAC
CGGGTCTGGT TCCGGCACGC CAAGGCCGGT GAACTCGCCG AGCACGTCAA CGAGCTGCAT
CTGGTGGAGG CCGACACCGC CGGGGCGGCC GCCGCCACGT ACCGGGGCGA GGGACGGGCG
TTCCTCTGA
 
Protein sequence
MATDSSDLRA RLDRATAHLD PPYAVVDLRA FDANAAALAS RAAGKPVRVA SKSIRCRTLI 
SRALTAPGWA GVLTFTLPEA LWLVRCGVTD DAVVAYPTAD RAALAELAGD PTLAAAVTLM
VDDTAQLDLV DAVSAPGQRP ELRVCLDLDA SWRPLGGRLH VGVRRSPVHD PRAAGALAAA
VAARPGFRLV GLMAYEAQIA GLGDAPPKRA VLGAAIRLAQ RGSYRELLAR RSAAVAAVRE
HAELEFVNGG GTGSVAATSA DPAVTEVTAG SGLYGPTLFD AYRAWRPTPA AYFACSVVRR
PAPGYATVLG AGWIASGPAQ RSRLPRPVLP AGLQLVDAEG AGEVQTPLTG RAAGSLRVGD
RVWFRHAKAG ELAEHVNELH LVEADTAGAA AATYRGEGRA FL