Gene Sare_4247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4247 
Symbol 
ID5708097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4819147 
End bp4820265 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content72% 
IMG OID641273666 
Productalanine racemase 
Protein accessionYP_001539019 
Protein GI159039766 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0787] Alanine racemase 
TIGRFAM ID[TIGR00492] alanine racemase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.170203 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00478121 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGGCAGG CCGAGGTACG CGTCGACCTT GACGCGATCC GTGAGAACGT GAGTTGGCTG 
CGATCCGGTA GCGCCGCCGA GCTGATGGCG GTGGTCAAGG GCGACGGGTA CGGCCACGGC
ATGGTCCCGG CCGCCCTGGC CGCGCTGGAC GGTGGCGCCG ACTGGCTCGG TGTCTGCACC
CTCGACGAGG CACTCACGCT GCGCCGGGAG GGGATCACCG CGCCGATCCT GGCCTGGCTC
CTCGCCCCGG GCCTGCCGTT ACACGAAGGT GTGGCGGCCG GGATCGACCT CGGCGCGGCC
AGTGTGGCGC AGCTGGACGA GATGGTGCAG GCGGGCCGGA CCGCCGGCCG TCCGGCCCGG
CTGCACCTCA AGATCGACAC TGGGCTGTCC CGGGGTGGCG CCACCGTCTC GGACTGGCCG
GGGCTGCTCA CCGCCGCCGC GAAGGCGCAG GCCGACGGTA CGGTCGAGGT GGTCGGGGTG
TGGAGCCACT TCGTGTACGC GGACGCGCCC GGCCACCCGA CGACCGACCG GCAGCTCGCC
GTCTTCCACG AGGGCTTGGA CATGGTGGAG AAGGCGGGGC TGCGTCCGCG TTACCGCCAC
CTGGCCAACT CGGCAGCCAC GCTGACCCGG CCGGATGCCC ACTTCGACCT GGTCCGGCCC
GGGCTGGCCG TCTACGGCCT GTCTCCGGTG GCCGGCGAGA GCTTCGGGCT GCGGCCGGCG
ATGACCGCCC GCGCCCGCGT CATGCTCACC AAGCAGGTCC CGGCGGGCGC CGGGGTCTCC
TATGGCCACA CCTATACCAC CGAGCGGGAC AGCACCCTCG CCGTTATTCC GCTCGGGTAC
GCCGACGGGG TGCCCCGGAG CGCGTCCAAC AGCGGCCCGG TGCACCTGGG TGGCGTCCGG
CGCACCATCT CCGGCCGGGT CTGCATGGAC CAGTTCGTGC TCGACTGCGG CGACGATCCG
GTGGCGCCGG GGGACGTGGC CGTCCTGTTC GGCAGTGGGC GGAACGGGGA GCCAACAGCC
GACGACTGGG CCGAGGCGGT CGGCACGATC AACTATGAGA TCGTCACCCG ATTCGGCAGT
ACGCGGGTGC CCCGTAGCTA CGACGGCGAG CGTCCGTGA
 
Protein sequence
MWQAEVRVDL DAIRENVSWL RSGSAAELMA VVKGDGYGHG MVPAALAALD GGADWLGVCT 
LDEALTLRRE GITAPILAWL LAPGLPLHEG VAAGIDLGAA SVAQLDEMVQ AGRTAGRPAR
LHLKIDTGLS RGGATVSDWP GLLTAAAKAQ ADGTVEVVGV WSHFVYADAP GHPTTDRQLA
VFHEGLDMVE KAGLRPRYRH LANSAATLTR PDAHFDLVRP GLAVYGLSPV AGESFGLRPA
MTARARVMLT KQVPAGAGVS YGHTYTTERD STLAVIPLGY ADGVPRSASN SGPVHLGGVR
RTISGRVCMD QFVLDCGDDP VAPGDVAVLF GSGRNGEPTA DDWAEAVGTI NYEIVTRFGS
TRVPRSYDGE RP