Gene Sare_4075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4075 
Symbol 
ID5705370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4636214 
End bp4637389 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content72% 
IMG OID641273501 
Productmalate dehydrogenase 
Protein accessionYP_001538856 
Protein GI159039603 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0926309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCGT CCCCTGTGGA CCCTGCTGAC CCGGTCTTCC GACGGCACCG GGGCGGCAAG 
ATGGCCGTTA CCTCGACCGT CCCGTTGACC AGCCGGGAGG ACCTCTCCCT CGCGTACACC
CCGGGGGTGG CCCGGGTGTG CGAGGCCATC GCCGCCGAGC CCCGCCTCGC CGACGACTAC
ACCTGGGCCG GGCACACAGT CGCGGTCGTC ACCGACGGTT CGGCCGTGTT GGGACTCGGC
AACATCGGCC CGCGCGCCGC GCTGCCGGTC ATGGAGGGCA AGGCCGTGCT GTTCAAGCAG
TTCGGCGGGG TGGACGCGGT GCCGGTCTGC CTGGACACGC AGGACGTGGA CGAGATCGTG
GCCGCGGTAC GGGCCCTCGC CCCGTCGTTC GGCGGCATCA ACCTGGAGGA CATCAGCGCG
CCGCGCTGCT TCGAAATCGA GCGGCGCCTG GATGAGGCGC TGGACATTCC CGTCTTCCAC
GACGACCAGC ACGGCACCGC CATCGTCGTA CTCGCCGCGC TACGCAATGC GGCGGCGCTA
CTCACCCGCA AGCTCGGCGA CCTCCAGGTG GCGGTCAGCG GCGCGGGTGC CGCCGGCGTG
GCGGTGACCA AGATGCTCGT CGCCGGCGGG GTCAATCCGG AACAGGTGGT CGTCTGCGAC
TCCCAGGGCG TCCTCGGGCG GCACCGCGAT GATCTCACCG GCACCAAGGC CGAGCTGGCC
GAGCTGACCA ACGGCGACGG CCGGCAGGGC GACATGGCCG CGGCGCTGCG GGGTGCCGAC
GTGTTGATCG GCGTCTCCGG CGGCCAGATT CCCGAGGCGG CGGTGGCCGG CATGGCCCGC
GGCGGGATCG TCTTCGCCCT GGCCAACCCC ACCCCCGAGG TGCACCCGGC AGTGGCGGCC
CGGCACGTCG CGGTGGTCGC CACCGGCCGC AGCGACCATC CCAATCAGAT CAACAACGTG
CTCGCCTTTC CCGGAGTCTT CCGCGGCGCG TTGGACGCCC GGGCCACCCG GATCACGGAT
GGCATGAAGG TGGCCGCGGC GGACGCGATC GCCGGGGTGG TGGCCGAGTC GCTGACACCC
GAGGCGATCG TCCCGTCACC GCTGGACCCA CGGGTCGCCC CCGCGGTGGC CGAGGCGGTC
GCGGAGGCCG CACGGCGTGA CAGCGTGACC CGGTGA
 
Protein sequence
MSSSPVDPAD PVFRRHRGGK MAVTSTVPLT SREDLSLAYT PGVARVCEAI AAEPRLADDY 
TWAGHTVAVV TDGSAVLGLG NIGPRAALPV MEGKAVLFKQ FGGVDAVPVC LDTQDVDEIV
AAVRALAPSF GGINLEDISA PRCFEIERRL DEALDIPVFH DDQHGTAIVV LAALRNAAAL
LTRKLGDLQV AVSGAGAAGV AVTKMLVAGG VNPEQVVVCD SQGVLGRHRD DLTGTKAELA
ELTNGDGRQG DMAAALRGAD VLIGVSGGQI PEAAVAGMAR GGIVFALANP TPEVHPAVAA
RHVAVVATGR SDHPNQINNV LAFPGVFRGA LDARATRITD GMKVAAADAI AGVVAESLTP
EAIVPSPLDP RVAPAVAEAV AEAARRDSVT R